Search

Showing top 6 results for "preview bugs"

People also ask

Is Anthropic’s decision to withhold Mythos Preview a marketing stunt?

Withholding a model has been a long-standing lever in Frontier Lab safety plans. Before Mythos Preview and Glasswing, OpenAI launched their Trusted Access for Cyber program for GPT-5.3-Codex (their first model to reach “High” cybersecurity capability). Anthropic have now launched their similar Cyber Verification Program.

Mythos enters the chat
What’s unique in Mythos Preview?

The UK AI Security Institute (AISI) put it best with this summary, “Mythos Preview represents a step up over previous frontier models in a landscape where cyber performance was already rapidly improving”. The AISI recently created an evaluation which tests model capability on a network attack simulation spanning 32 stages of an attack chain (estimated to take a human 20 hours to complete). Mythos Preview is the first model to solve this challenge from start to finish, succeeding on 3 of 10 attempts with a 100 million token budget. AISI expect greater budget would improve results further. Mytho

Mythos enters the chat