Claude does cyber competitions
… More research and development into AI-enabled cyber defense and resilience is needed to counter this development. Why enter Claude into cyber competitions? AI is poised to transform the domain of cybersecurity. …
In both the CTF and cyber defense challenges, Claude demonstrated both promise and clear limitations. In the CTF competitions, Claude usually struggled on the same tasks as other competitors; the one task it (and every other AI team) ultimately failed on in HackTheBox was also the challenge for which the human teams had the lowest solve rate (only about 14% of the participating human teams solved it). In PlaidCTF, Claude did not solve any challenges–but this was also true of about 70% of the teams who entered. Although Claude performed as well or better than human teams in some aspects of the
Claude does cyber competitionsAI is poised to transform the domain of cybersecurity. Anthropic’s Safeguards team recently identified and banned a user with limited coding abilities leveraging Claude to develop malware. Research suggests that this lowering of the bar for expertise needed to pose a threat, combined with the falling costs of large language models (LLMs), presages a dramatic shift in the economics of cyberattacks.[1] To understand the present state of AI cyber capabilities and gain insight into their trajectory, we pursue different approaches to model evaluation, including publicly available and custom-made be
Claude does cyber competitionsClaude Sonnet 4.5 represents a meaningful improvement, but we know that many of its capabilities are nascent and do not yet match those of security professionals and established processes. We will keep working to improve the defense-relevant capabilities of our models and enhance the threat intelligence and mitigations that safeguard our platforms. In fact, we have already been using results of our investigations and evaluations to continually refine our ability to catch misuse of our models for harmful cyber behavior. This includes using techniques like organization-level summarization to und
Building AI for cyber defenders… More research and development into AI-enabled cyber defense and resilience is needed to counter this development. Why enter Claude into cyber competitions? AI is poised to transform the domain of cybersecurity. …
… OpenAI seemed to be seeking to differentiate its message on Tuesday by striking a less catastrophic tone and touting its existing guardrails and defenses while hinting at the need for more advanced protections in the long term. “We believe the class of safeguards in use today sufficiently reduce cy… …
… If you'll recall, Glasswing uses Anthropic's unreleased AI model, Claude Mythos Preview, to provide its clients' cyber defense needs. …
… Adopting and experimenting with AI will be key for defenders to keep pace. We believe we are now at an inflection point for AI’s impact on cybersecurity. For several years, our team has carefully tracked the cybersecurity-relevant capabilities of AI models. …
… If that happened, Anthropic warned that China could blindside a defenseless US—suddenly possessing “advanced cyber capabilities to deploy against the US government and American companies and exploit vulnerabilities faster than previously possible.” It’s important to keep the US as far ahead as poss… …
… Done not carefully, this could be a meaningfully accelerant for attackers.” Project Glasswing partners, including some of Anthropic's competitors, struck a collaborative tone in statements as part of the launch. “Google is pleased to see this cross-industry cybersecurity initiative coming together,… …
AI + ML Anthropic struggling with Chinese competition, its own safety obsession The maker of Claude faces headwinds as it rushes to go public Anthropic, riding a wave of goodwill after resisting demands from the US Defense Department to soften model safeguards, is reportedly planning to go public a… …
… The Wall Street Journal reported that Anthropic leaders spent several hours on calls Saturday with Secretary of Commerce Howard Lutnick and National Cyber Director Sean Cairncross. AI safety and jailbreaks This isn't the AI developer's first conflict with Washington over AI models. …
Everything old is new again. Several years ago cybersecurity teams across the world, ranging from the NSA down to small fintech startups, were faced by a novel threat that seemed straight out of science fiction. …
… Now, prominent cybersecurity leaders have warned that sidelining Mythos 5 and Fable 5 could give China a significant AI advantage. Trump’s move has galvanized international calls for alternatives to American AI systems, while effectively putting a major US AI company’s new flagship model on ice. …