LLMs and biorisk
…As we noted at the time, this was a precautionary decision—improving model performance on our evaluations meant we could no longer confidently rule out the ability of our most advanced model…
…As we noted at the time, this was a precautionary decision—improving model performance on our evaluations meant we could no longer confidently rule out the ability of our most advanced model…
…AI models can now find high-severity vulnerabilities at scale. Our view is this is a moment to move quickly—to empower defenders and secure as much code as possible while the…
…First is that models tend to lose coherence on lengthy tasks as the context window fills (see our post on context engineering ). Some models also exhibit "context anxiety," in which they begin…
…it makes the model even more proactive, and it works better with a full team. Tagging @Claude is now one of the main ways we get things done at Anthropic. Today, 65…
…Autodesk Fusion allows designers and engineers with a Fusion subscription to create and modify 3D models through conversations with Claude. Blender offers a natural-language interface to its Python API, allowing users…
…As models have become significantly better at long-horizon tasks over the last year or so, a new way of working emerged: rather than getting involved with every detail, we can specify…
…Claude remains the only frontier AI model available to customers on all three of the world's largest cloud platforms: AWS (Bedrock), Google Cloud (Vertex AI), and Microsoft Azure (Foundry). Claude Platform…
…As our models have improved, they have become more aligned on most behavior evaluations, but this doesn’t mean risk necessarily shrinks. Less capable models are more likely to misread a situation…
Claude Mythos Preview is a new general-purpose language model that is strikingly capable at computer security tasks. This post provides technical details for researchers and practitioners who want to understand exactly…
…We look forward to sharing updates as we continue to explore the strange terrain of AI models in long-term contact with the real world. Acknowledgments We’re very grateful to Andon…