Search: Safety and security

Redeploying Claude Fable 5

… We provide more context on how we use safety classifiers to detect potentially dangerous cybersecurity uses of our models. …

Jun 30, 2026

Announcing Anthropic's Responsible Scaling Policy

… The basic idea is to require safety, security, and operational standards appropriate to a model’s potential for catastrophic risk, with higher ASL levels requiring increasingly strict demonstrations of safety. …

Sep 19, 2023

Developing Nuclear Safeguards for AI

… This demonstrates a model of public-private partnerships that can be replicated in other national security domains. It also illustrates that there are steps that industry can take now to implement meaningful safety measures. …

Aug 21, 2025

Australian government and Anthropic sign MOU for AI safety and research

… Central to the MOU is a commitment to work with Australia’s AI Safety Institute. We will share our findings on emerging model capabilities and risks, participate in joint safety and security evaluations, and collaborate on research with Australian academic institutions. …

Mar 31, 2026

More details on Fable 5’s cyber safeguards and our jailbreak framework

… First, we provide more information on the cybersecurity safeguards —specifically, the safety classifiers —that we launched with the model. These are the AI systems that accompany the model that detect and block dangerous or potentially dangerous cybersecurity uses. …

Jul 2, 2026

2028: Two scenarios for global AI leadership

… Opportunities for engagement on AI safety Anthropic supports international AI safety dialogue with AI experts in China, when possible. …

May 14, 2026

Claude's extended thinking

… Claude 3.7 Sonnet’s safety mechanisms AI Safety Level. Anthropic’s Responsible Scaling Policy commits us not to train or deploy models unless we have implemented appropriate safety and security measures. …

Feb 24, 2025

Anthropic and NEC partner to build AI-native engineering at scale in Japan

… Together, we will develop secure, industry-specific AI products for the Japanese market, starting with tools for finance, manufacturing, and local government. “This long-term partnership with Anthropic enables NEC to maximize the potential of AI in the Japanese market,” said Toshifumi Yoshizaki, Ex… …

Apr 24, 2026

The Long-Term Benefit Trust

… We believe AI may create unprecedentedly large externalities , ranging from national security risks, to large-scale economic disruption, to fundamental threats to humanity, to enormous benefits to human safety and health. …

Sep 19, 2023