Coding agents in the social sciences
… Agentic coding platforms like Claude Code and Codex can take a research idea and a dataset, write and run an analysis, interpret the output, and iterate autonomously. …
The human sciences are shifting: for the first time, core research tasks can be handed off to machines. AI chatbots increasingly contribute to scientific research, including in the most prestigious publications and in the social sciences. This has spurred optimism that AI could boost research productivity—while also stoking fears about overloaded peer review and a deluge of academic AI slop. But while turn-taking AI chatbots have primarily been used for writing assistance, coding agents could restructure social science research more radically. Agentic coding platforms like Claude Code and Code
Coding agents in the social sciences… Agentic coding platforms like Claude Code and Codex can take a research idea and a dataset, write and run an analysis, interpret the output, and iterate autonomously. …
… Next, we developed a collection of metrics that draw on data from both agentic uses of our public API and Claude Code , our own coding agent. These offer a tradeoff between breadth and depth: Our public API gives us broad visibility into agentic deployments across thousands of different customers. …
… Over the past two years, we’ve shipped three primary agentic products: claude.ai , Claude Code, and Claude Cowork. …
…Claude Code , for example, can accomplish complex tasks across domains using local code execution and filesystems. But as these agents become more powerful, we need more composable, scalable, and portable ways to…
Science Long-running Claude for scientific computing Mar 23, 2026 In this post, Siddharth Mishra-Sharma , a researcher on the Discovery team, explains how to apply multi-day agentic coding workflows—test oracles, persistent memory, and orchestration patterns—to scientific computing tasks even outsi… …
… Carlyle has adopted Claude as a key part of our AI technology stack because of its strong coding capabilities, agentic reasoning, and continual advances in both the underlying models and key features. …
… Opus 4.8’s capabilities The table below shows how Opus 4.8 compares to its predecessor and to other models on tests of coding, agentic skills, reasoning, and practical knowledge work tasks. …
… We keep an internal incident log focused on agentic misbehaviors. …
… This might mean not building agentic systems at all. Agentic systems often trade latency and cost for better task performance, and you should consider when this tradeoff makes sense. …
… We go into greater technical detail on this topic in our submission to NIST's Center for AI Standards and Innovation CAISI on agentic security. …