News
Claude Opus 4.1 scores 74.5% on the SWE-bench Verified benchmark, indicating major improvements in real-world programming, bug detection, and agent-like problem solving.
Anthropic says Claude Opus 4.1 improves software engineering accuracy to 74.5%. That compares to 62.3% with Claude Sonnet 3.7 ...
Anthropic launches automated AI security tools for Claude Code that scan code for vulnerabilities and suggest fixes, ...
OpenAI CEO Sam Altman went so far as to call GPT-5 “the best model in the world.” That may be pride or hyperbole, as ...
Anthropic launched Claude Opus 4.1 today, an upgraded version of its flagship AI model that achieves 74.5% accuracy on ...
It’s Christmas in August – at least, for those tech-wonks who are interested in new model releases. Today’s news is a very ...
More improvements will be rolled out in the coming weeks, the company said. The price for Opus 4.1 remains the same as that ...
Anthropic has released Claude Opus 4.1, which is said to deliver better coding and agent performance with improved safety.
Anthropic's Claude Opus 4.1 achieves 74.5% on coding benchmarks, leading the AI market, but faces risk as nearly half its $3.1B API revenue depends on just two customers.
With GPT-5 on the horizon and Meta ramping up AI hiring, Anthropic’s new security-focused features aim to differentiate ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results