Search

Showing top 54 results for "agent safety focus"

All sources anthropic.com 27 xda-developers.com 12 developer.nvidia.com 3 theverge.com 3 wired.com 2 blog.cloudflare.com 2 techcrunch.com 1 huggingface.co 1 theregister.com 1 404media.co 1 en.wikipedia.org 1

Claude Opus 4.6

…safety profile as good as, or better than, any other frontier model in the industry, with low rates of misaligned behavior across safety evaluations. In Claude Code , you can now assemble agent…

Feb 5, 2026

PwC is deploying Claude to build technology, execute deals, and reinvent enterprise functions for clients

…The collaboration focuses on three areas of highest leverage: agentic technology build, AI-native deal-making, and reinvention of the enterprise function. PwC is launching a new finance business group (Office of…

May 14, 2026

Anthropic raises $65 billion, nears $1T valuation ahead of IPO | TechCrunch

…released its new Claude Opus 4.8 model, which touts better capabilities in agentic tasks, advanced coding, and focus on honesty and self-correction. The AI startup is also reportedly planning to…

May 28, 2026 · Rebecca Bellan

Introducing Sonnet 4.6

…Our safety researchers concluded that Sonnet 4.6 has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes…

Feb 17, 2026

Trustworthy agents in practice

…Open protocols also keep competition focused on the quality and safety of the agent, rather than on who controls the integrations. None of these measures replace the work that model developers have…

Apr 9, 2026

I gave Claude Code control of my desktop for a week, and it automated things I didn't think were possible

…agentic variant Claude Code. Anthropic places a strong emphasis on AI safety in its model design. Not this time. Claude Code was built by Anthropic, a company focused on AI safety research…

Apr 19, 2026 · Simon Batt

Focus areas for The Anthropic Institute

…Our agenda focuses on four areas for research: Economic diffusion Threats and resilience AI systems in the wild AI-driven R&D In Core Views on AI Safety , we wrote that doing…

May 7, 2026

Introducing Claude Opus 4.5

…longer-running agents and new ways to use Claude in Excel, Chrome, and on desktop. In the Claude apps, lengthy conversations no longer hit a wall. See our product-focused section below…

Nov 24, 2025

Claude Code auto mode: a safer way to skip permissions

…We keep an internal incident log focused on agentic misbehaviors. Past examples include deleting remote git branches from a misinterpreted instruction, uploading an engineer's GitHub auth token to an internal compute…

Mar 25, 2026

I replaced the expensive Claude Pro subscription with these local models, and my productivity didn’t drop a bit

…Anthropic was founded in 2021 with a strong focus on AI safety research. 02 / 8 Safety What is the name of the safety and values framework Anthropic developed to guide Claude's…

Apr 21, 2026 · Parth Shah

Followed topics