Coding agents in the social sciences
… Teaching Claude why New research on how we've reduced agentic misalignment.
… Teaching Claude why New research on how we've reduced agentic misalignment.
… We go into greater technical detail on this topic in our submission to NIST's Center for AI Standards and Innovation CAISI on agentic security. …
… This might mean not building agentic systems at all. Agentic systems often trade latency and cost for better task performance, and you should consider when this tradeoff makes sense. …
… For earlier LLMs, single-turn, non-agentic evals were the main evaluation method. …
… Next, we developed a collection of metrics that draw on data from both agentic uses of our public API and Claude Code , our own coding agent. These offer a tradeoff between breadth and depth: Our public API gives us broad visibility into agentic deployments across thousands of different customers. …
… Today, Anthropic is acquiring Stainless, a leader in SDKs and MCP server tooling, to extend that reach even further. …
… In a world where agents read everything, the investigation tooling is also an attack surface. …
…We virtualized the components of an agent: a session (the append-only log of everything that happened), a harness (the loop that calls Claude and routes Claude’s tool calls to the…
… We keep an internal incident log focused on agentic misbehaviors. …
…Agents with a filesystem and code execution tools don’t need to read the entirety of a skill into their context window when working on a particular task. This means that the…