Web 25
Videos
Topics
People also ask
What safety risks?
If you’re willing to entertain the views outlined above, then it’s not very hard to argue that AI could be a risk to our safety and security. There are two common sense reasons to be concerned. First, it may be tricky to build safe, reliable, and steerable systems when those systems are starting to become as intelligent and as aware of their surroundings as their designers. To use an analogy, it is easy for a chess grandmaster to detect bad moves in a novice but very hard for a novice to detect bad moves in a grandmaster. If we build an AI system that’s significantly more competent than human
Core views on AI safety: When, why, what, and how
anthropic.com › research › assistant-axis
The assistant axis: situating and stabilizing the character of large language models
…simulated longer conversations that real users might naturally have with AI models, and tested whether drift over time led to concerning behavior. To assess whether we could mitigate any harmful responses, we…
Jan 19, 2026
anthropic.com › research › attack-navigator
Mapping AI-enabled cyber threats: Insights from the LLM ATT&CK Navigator
…It is calculated based on the actor's activity across Claude.ai , Claude Code, and our API, drawing on our safety classifiers alongside open-source and internal threat-intelligence indicators. The higher…
Jun 3, 2026
anthropic.com › research
How AI Is Transforming Work at Anthropic
…We find that AI use is radically changing the nature of work for software developers, generating both hope and concern . Our research reveals a workplace facing significant transformations: Engineers are getting a…
Dec 2, 2025
anthropic.com › research › labor-market-impacts
Labor market impacts of AI: A new measure and early evidence
…If unemployment increased for all workers in parallel, we would not attribute this to AI advancements that still leave many tasks unaffected. One group of particular concern is young workers. Brynjolfsson et…
Mar 5, 2026
anthropic.com › news › mozilla-firefox-security
Partnering with Mozilla to improve Firefox’s security
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Mar 6, 2026
To show you the most relevant results, we’ve omitted some entries very
similar to those already shown.
Repeat the search with the omitted results included .