Search: Model update

LLMs and biorisk

…As we noted at the time, this was a precautionary decision—improving model performance on our evaluations meant we could no longer confidently rule out the ability of our most advanced model…

Sep 5, 2025

LLM-discovered 0 days

…AI models can now find high-severity vulnerabilities at scale. Our view is this is a moment to move quickly—to empower defenders and secure as much code as possible while the…

Feb 5, 2026

Harness design for long-running application development

…First is that models tend to lose coherence on lengthy tasks as the context window fills (see our post on context engineering ). Some models also exhibit "context anxiety," in which they begin…

Mar 24, 2026

Introducing Claude Tag

…it makes the model even more proactive, and it works better with a full team. Tagging @Claude is now one of the main ways we get things done at Anthropic. Today, 65…

Jun 23, 2026

…Autodesk Fusion allows designers and engineers with a Fusion subscription to create and modify 3D models through conversations with Claude. Blender offers a natural-language interface to its Python API, allowing users…

Apr 28, 2026

Long-running Claude for scientific computing

…As models have become significantly better at long-horizon tasks over the last year or so, a new way of working emerged: rather than getting involved with every detail, we can specify…

Mar 23, 2026

Anthropic and Amazon expand collaboration for up to 5 gigawatts of new compute

…Claude remains the only frontier AI model available to customers on all three of the world's largest cloud platforms: AWS (Bedrock), Google Cloud (Vertex AI), and Microsoft Azure (Foundry). Claude Platform…

Apr 20, 2026

How we contain Claude across products

…As our models have improved, they have become more aligned on most behavior evaluations, but this doesn’t mean risk necessarily shrinks. Less capable models are more likely to misread a situation…

May 25, 2026

Assessing Claude Mythos Preview’s cybersecurity capabilities

Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly…

Apr 7, 2026

Project Vend: Can Claude run a small shop? (And why does that matter?)

…We look forward to sharing updates as we continue to explore the strange terrain of AI models in long-term contact with the real world. Acknowledgments We’re very grateful to Andon…

Jun 27, 2025

Followed topics

Search