Focus areas for The Anthropic Institute
…Conversely, are there aspects of existing law which already apply to AI agents and shouldn’t? Reliability of agents: What aspects of autonomous AI agents could be adapted to fit into existing…
…Conversely, are there aspects of existing law which already apply to AI agents and shouldn’t? Reliability of agents: What aspects of autonomous AI agents could be adapted to fit into existing…
…API) and agentic coding tools like Claude Code score highest due to their potential to automate actions. Impact (0–30 points): Captures the real-world effects of the user’s behavior through…
…This feels like a watershed moment for spreadsheet agents on Shortcut. 01 / 20 Evaluating Claude Opus 4.6 Across agentic coding, computer use, tool use, search, and finance , Opus 4.6 is…
…There are new tools for longer-running agents and new ways to use Claude in Excel, Chrome, and on desktop. In the Claude apps, lengthy conversations no longer hit a wall. See…
…For agent products in translation, deep research, slide-building, and analysis, it delivers powerful reliability. On CursorBench, Claude Opus 4.8 exceeds prior Opus models across every effort level. Tool calling is…
…This is the reliability jump that makes Notion Agent feel like a true teammate. In our evals, we saw a double-digit jump in accuracy of tool calls and planning in our…
…Finally, we’re adding new Agent Skills for scientific problem selection , converting instrument data to Allotrope , and supporting bioinformatics work with skills bundles for scVI-tools and Nextflow deployment . We’re also…
…For example, in April 2026, DXC launched DXC OASIS, its tool for running customers' IT systems, where AI agents handle much of the routine work. Claude is now the default foundation model…
…The core primitives for song composition were present, and the agent could drive them autonomously, using tools to create a simple production from end to end. You might say it’s not…
…MCP into their systems, while development tools companies including Zed, Replit, Codeium, and Sourcegraph are working with MCP to enhance their platforms—enabling AI agents to better retrieve relevant information to further…