Claude for Financial Services
…Claude 4 models outperform other frontier models as research agents across financial tasks in Vals AI's Finance Agent benchmark . When deployed by FundamentalLabs to build an Excel agent, Claude Opus 4…
…Claude 4 models outperform other frontier models as research agents across financial tasks in Vals AI's Finance Agent benchmark . When deployed by FundamentalLabs to build an Excel agent, Claude Opus 4…
…We’ve already applied NLAs to understand what Claude is thinking and to improve Claude’s safety and reliability. For instance: When Claude Opus 4.6 and Mythos Preview were undergoing safety…
…In our case when we’re looking for memory safety issues we have our sanitizer build of Firefox and if you make it crash you win. We point that agent off to…
…With Claude Cowork and Managed Agents embedded inside it, KPMG professionals and their clients can build new AI capabilities directly in the platform—work that used to mean jumping between tools, chat…
…None of this is remotely sustainable as it currently stands. This means that the startups that are using AI agents to scale their operations are doing so at a time when AI…
…Why pointing a generic coding agent at a repo doesn't work When we first started AI-assisted vulnerability research last year, our instinct was the obvious one: point a generic coding…
…There is no federal statute protecting AI company employees who disclose these kinds of safety concerns that are being aired in this piece. We have cases where Jan Leike, who was a…
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
…But agents can read URL paths, which in some cases contain hypotheses from other agent search queries embedded in the URL slugs. One agent correctly diagnosed what it was seeing: “Multiple AI…
…Paul Christiano stepped down in April 2024 to take a new role as the Head of AI Safety at the U.S. AI Safety Institute . In January 2026, Kanika Bahl stepped down…