GPT-5.5 dominates $1,500 LLM hacking test while Gemini refuses to even try
…Claude Sonnet 4.6 and Claude Opus 4.8 each solved 2 out of 10 runs, but Opus in particular got close multiple times before safety guardrails ended the session. At the…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…Claude Sonnet 4.6 and Claude Opus 4.8 each solved 2 out of 10 runs, but Opus in particular got close multiple times before safety guardrails ended the session. At the…
…You get access to Claude's Haiku, Sonnet, and Opus models. To be fair, they're excellent and the entire reason why Claude Code is absolutely worth paying for. But also, that…
…Anthropic released Claude Opus 4.7 on 16 April, about a week before the incident, and it did not immediately respond to a request for comment. Crane wrote on X that Cursor…
…the only Mythos-class model that had been in general release) can still downshift to the next most powerful Claude model, Opus 4.8, without having to start another chat or coding…
Claude Code Degraded Before Opus 4.8 Release
28 minutes of launch has already passed and for me, it is crystal clear, just branding. 10-15% better than Opus.We are slowing down in adoption and new features, Anthorpic is becoming Apple of Tim Cook and not from Steve…
Introducing Claude Fable 5: a Mythos-class model that we've made safe for general use. Its capabilities exceed those of any model we've ever made generally available. Fable 5 is state of the art on nearly all tested benc…
As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…
it's been a month tagging sama openai tibo on X for this issueand no one seem to replyand eveyone is falttering codex, im sure im not the only one facing thisi switched to codex from claude since it was better consume le…
…Jun.22 Improvement New features and Claude as agent provider preview in JetBrains IDEs copilot Jun.19 Release AI credits consumed per user now in the Copilot usage metrics API account management…
…Claude Design behaves like a design tool inside a chat Claude Design launched April 2026 under Anthropic Labs, runs on Opus 4.7, and you can find it at claude.ai/design…
…Pricing is now $5/$25 per million tokens—making Opus-level capabilities accessible to even more users, teams, and enterprises. Alongside Opus, we’re releasing updates to the Claude Developer Platform , Claude…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.