Search

Showing top 54 results for "agent safety focus"

All sources anthropic.com 27 xda-developers.com 12 developer.nvidia.com 3 theverge.com 3 wired.com 2 blog.cloudflare.com 2 techcrunch.com 1 huggingface.co 1 theregister.com 1 404media.co 1 en.wikipedia.org 1

Project Glasswing: what Mythos showed us

…Context - Coding agents are tuned for one focused stream of work: building a feature, fixing a bug, writing a refactor. They ingest a lot of source code, hold a single hypothesis at…

May 18, 2026 · Grant Bourzikas

Natural Language Autoencoders

…Related content Teaching Claude why New research on how we've reduced agentic misalignment. Donating our open-source alignment tool Focus areas for The Anthropic Institute At The Anthropic Institute (TAI), we…

May 7, 2026

Claude's newest model is a step forward and two steps back, and it's infuriating

…Finally, the model also shows significant improvement in agentic safety, meaning it's a lot better at recognizing and refusing prompt injection attacks when you're using it as an agent. Opus…

Apr 24, 2026 · Mahnoor Faisal

KPMG integrates Claude across its core business and workforce of more than 276,000 in strategic alliance

…With Claude Cowork and Managed Agents embedded inside it, KPMG professionals and their clients can build new AI capabilities directly in the platform—work that used to mean jumping between tools, chat…

May 19, 2026

Ronan Farrow on Sam Altman’s “unconstrained” relationship with the truth

…This is the reason this company was founded as a nonprofit focused on safety, and where things were being obscured in a way that credible people around this found it less than…

Apr 16, 2026 · Nilay Patel

Sydney will become Anthropic’s fourth office in Asia-Pacific

…built with respect for the unique goals, opportunities, and challenges of the region.” Our initial focus will be supporting our enterprise, startup, and research customers. Anthropic already works with some of Australia…

Mar 10, 2026

Claude Cowork freed up 60GB on my laptop by finding files I completely forgot about

…However, keep in mind that this feature is currently in research preview, and Anthropic is still working on agent safety. The feature is also exclusive to Claude's paid plans for now…

Mar 23, 2026 · Mahnoor Faisal

Long-running Claude for scientific computing

Science Long-running Claude for scientific computing Mar 23, 2026 In this post, Siddharth Mishra-Sharma , a researcher on the Discovery team, explains how to apply multi-day agentic coding workflows—test…

Mar 23, 2026

The AI Compute Crunch Is Here (and It's Affecting the Entire Economy)

…Similarly, the general cost of consumer electronics is increasing as chip manufacturers and production lines shift their focus to building more AI capacity. The largest consumer electronics manufacturer in the world, Apple…

Apr 24, 2026 · Jason Koebler

How we contain Claude across products

…1 The second approach to capping the blast radius—and the focus of much of this post—is containment. Rather than supervising what the agent does, we supervise what it’s able…

May 25, 2026

Followed topics