Trustworthy agents in practice
… Increasingly, agents in products like Claude Code hand off some of their work to subagents —other "Claudes" working in parallel on different parts of a task. …
AI is poised to transform the domain of cybersecurity. Anthropic’s Safeguards team recently identified and banned a user with limited coding abilities leveraging Claude to develop malware. Research suggests that this lowering of the bar for expertise needed to pose a threat, combined with the falling costs of large language models (LLMs), presages a dramatic shift in the economics of cyberattacks.[1] To understand the present state of AI cyber capabilities and gain insight into their trajectory, we pursue different approaches to model evaluation, including publicly available and custom-made be
Claude does cyber competitions… Increasingly, agents in products like Claude Code hand off some of their work to subagents —other "Claudes" working in parallel on different parts of a task. …
… This is where Anthropic engineering has devoted the most effort, and also where many of the most surprising security failures have occurred. Over the past two years, we’ve shipped three primary agentic products: claude.ai , Claude Code, and Claude Cowork. …
… Cybersecurity : For OASIS, DXC is developing an always-on security engineer subagent, built on Claude Security, that will be deployed across its security operations centers SOCs . …
… First, when researching “patching agents,” which use LLMs to develop and validate bug fixes, we have developed a few methods we hope will help maintainers use LLMs like Claude to triage and address security reports faster. …
… Part of why we could achieve such speed is that we had multiple versions of Claude running at the same time tackling different challenges. But scaling up AI agents is arguably easier than finding additional human cybersecurity experts. …
… For example, Claude Code is an excellent harness that we use widely across tasks. We’ve also shown that task-specific agent harnesses excel in narrow domains. Managed Agents can accommodate any of these, matching Claude’s intelligence over time. …
… The researchers then tasked this agent with emulating attacks against a cyber-physical model of a water treatment plant one of the Control Environment Laboratory Resource platforms that PNNL operates on behalf of the Department of Homeland Security’s Cybersecurity and Infrastructure Security Agency… …
… The expansion centers on three areas: Agentic technology build . Engineering teams are using Claude Code to ship production software for major companies in weeks, not quarters. …
… Nidhi Aggarwal, Chief Product Officer of HackerOne, said, “Claude Sonnet 4.5 reduced average vulnerability intake time for our Hai security agents by 44% while improving accuracy by 25%, helping us reduce risk for businesses with confidence.” According to Sven Krasser, Senior Vice President for Dat… …
… Our new connectors and Agent Skills are generally available to all Claude subscribers, including Claude Pro, Max, Teams, and Enterprise. You can also contact our sales team to discuss bringing Claude to your organization. Footnotes 1: See the Claude Opus 4.5 system card , pages 48-49. …