newsence
來源篩選

Maybe AI Agents Can Be Lawyers After All

Techcrunch

Recent advancements in AI, particularly Anthropic's Opus 4.6 model, show significant improvement in AI agents' performance on professional tasks like law, suggesting a potential future where AI could assist or even act as lawyers.

newsence

也許AI代理最終也能成為律師

Techcrunch
22 天前

AI 生成摘要

最近AI的進步,特別是Anthropic的Opus 4.6模型,在AI代理處理法律等專業任務的表現上顯示出顯著提升,暗示著AI未來可能能夠協助甚至擔任律師。

Maybe AI agents can be lawyers after all | TechCrunch

Image Image

Topics

Latest

AI

Amazon

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

Meta

Microsoft

Privacy

Robotics

Security

Social

Space

Startups

TikTok

Transportation

Venture

More from TechCrunch

Staff

Events

Startup Battlefield

StrictlyVC

Newsletters

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Contact Us

Posted:

Image Image

Maybe AI agents can be lawyers after all

Last month, I wrote about Mercor’s new benchmark measuring AI agents’ capabilities on professional tasks like law and corporate analysis. At the time, the scores were pretty dismal, with every major lab scoring under 25%, so we concluded lawyers were safe from AI displacement, at least for now.

But AI capabilities can change a lot in a couple of weeks.

This week’s release of Opus 4.6 shook up the leaderboards, with Anthropic’s new model scoring just shy of 30% in one-shot trials, and an average of 45% when given a few more cracks at the problem. Notably, the release included a bunch of new agentic features, including “agent swarms,” which may have helped with this kind of multi-step problem-solving.

Regardless, the score is a huge jump from the previous state-of-the-art, and a sign that progress on foundation models isn’t slowing down. Mercor CEO Brendan Foody, who was particularly impressed, said, “jumping from 18.4% to 29.8% in a few months is insane.”

Image

Thirty percent is still a long way from 100%, so it’s not like lawyers need to be worried about getting replaced by machines next week. But they should be a lot less confident than they were last month!

Topics

Image

Tickets are live at the lowest rates of the year. Save up to $680 on your pass now.Meet investors. Discover your next portfolio company. Hear from 250+ tech leaders, dive into 200+ sessions, and explore 300+ startups building what’s next. Don’t miss these one-time savings.

Newsletters

Subscribe for the industry’s biggest tech news

Every weekday and Sunday, you can get the best of TechCrunch’s coverage.

TechCrunch Mobility is your destination for transportation news and insight.

Startups are the core of TechCrunch, so get our best coverage delivered weekly.

Provides movers and shakers with the info they need to start their day.

By submitting your email, you agree to our Terms and Privacy Notice.

Related

Image

Reddit says it’s looking for more acquisitions in adtech and elsewhere

Image

Here’s how Roblox’s age checks work

Image

Senator, who has repeatedly warned about secret US government surveillance, sounds new alarm over ‘CIA activities’

Latest in AI

Image

Maybe AI agents can be lawyers after all

Image

How Elon Musk is rewriting the rules on founder power

Image

How far will Elon Musk take the ‘everything’ business as SpaceX and xAI merge?

Image

© 2025 TechCrunch Media LLC.