Search: preview bugs

Assessing Claude Mythos Preview’s cybersecurity capabilities

… We start Claude on the files most likely to have bugs and go down the list in order of priority. Finally, once we’re done, we invoke a final Mythos Preview agent. …

Apr 7, 2026

Project Glasswing: An initial update

… On maintainers’ request, we sometimes disclose bugs directly, without further assessment. We’ve now reported 1,129 such unvetted bugs, of which Mythos Preview estimated that 175 were high- or critical-severity. …

May 22, 2026

Measuring LLMs' impact on N-day exploits

… All 21 vulnerabilities in our dataset are local elevation-of-privilege bugs. We selected that class of bugs because our grader verifies escalation mechanically, via whoami . …

Jun 8, 2026

Measuring LLMs’ ability to develop exploits

… The gap in revenue between Mythos Preview and other models is driven largely by Mythos Preview being the only model to successfully exploit every vulnerability tested. …

May 22, 2026

Frontier Red Team

… Research Project Deal Publications Search Date Category Title Measuring LLMs’ impact on N-day exploits Mapping AI-enabled cyber threats: Insights from the LLM ATT&CK Navigator What we learned mapping a year’s worth of AI-enabled cyber threats Measuring LLMs’ ability to develop exploits Assessing Cl…

Jun 8, 2026

Partnering with Mozilla to improve Firefox’s security

… One, Claude is much better at finding these bugs than it is at exploiting them. …

Mar 6, 2026

Introducing Claude Opus 4.7

… Note that Mythos Preview remains the best-aligned model we’ve trained according to our evaluations. …

Apr 16, 2026

Claude Fable 5 and Claude Mythos 5

… Users will find Mythos 5 comparable to, or somewhat stronger than, Mythos Preview in most cases, while costing substantially less. …

Jun 9, 2026

Claude Opus 4.6

… We’ve introduced agent teams in Claude Code as a research preview. …

Feb 5, 2026

2028: Two scenarios for global AI leadership

… The Mythos Preview wake-up call Mythos Preview, a model that we released to select partners as part of Project Glasswing in April, signals the arrival of an acceleration period that makes policy action even more urgent. …

May 14, 2026

Followed topics