Anthropic Economic Index report: Learning curves
…Our sample covers February 5 to February 12, three months following the release of Claude Opus 4.5 and coincident with the release of Claude Opus 4.6. We first document how…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…Our sample covers February 5 to February 12, three months following the release of Claude Opus 4.5 and coincident with the release of Claude Opus 4.6. We first document how…
Ever since Anthropic first made the earth-shaking disclosure of its incredibly capable Claude Mythos AI model back in April and the steps it was taking toward a safe release of that…
…We’ve therefore launched the model with safeguards that mean queries on some topics will instead receive a response from our next-most-capable model, Claude Opus 4.8. To release the…
…Queries will now fall back to Claude Opus 4.8, Anthropic’s previous flagship model, the company said in a post on X. Anthropic will prominently tell users too: “You will see…
…They often even prefer it to our smartest model from November 2025, Claude Opus 4.5. Performance that would have previously required reaching for an Opus-class model—including on real-world…
…As for those safeguards I mentioned earlier, Claude will automatically route prompts on some topics through the less-capable Opus 4.8. "To release [Fable 5] both safely and quickly, we've…
…And you know what, I've found one and it's every bit as capable as Claude's latest models for what I need. Related I finally found a local LLM I…
…improve security. Opus 4.6 is currently far better at identifying and fixing vulnerabilities than at exploiting them. This gives defenders the advantage. And with the recent release of Claude Code Security…
…says this: “The difference in capabilities between Mythos Preview and Claude Opus 4.6 is larger than the difference between previous releases.” So we presume there is some insight that makes a…
…Lawrence Keunho Jang , , , Abstract MyPCBench evaluates computer-use agents as personal assistants in a simulated Linux desktop environment with real-world web applications, revealing that Claude Opus 4.6 achieves the highest…