Search

Showing top 127 results for "Claude Opus update" · filtered from 133 indexed

Videos

Measuring AI agent autonomy in practice

…One of the most widely cited capability assessments is METR’s “Measuring AI Ability to Complete Long Tasks,” which estimates that Claude Opus 4.5 can complete tasks with a 50% success…

Feb 18, 2026

Anthropic - Wikipedia

…Claude 3 Opus, Claude 3 Sonnet, and Claude 3 Haiku, in decreasing order of performance. [ 57 ] In June 2024, it released Claude 3.5 Sonnet. [ 58 ] In May 2025, Anthropic released Claude…

Apr 11, 2024 · Contributors to Wikimedia projects

Cybersecurity

…Richard Lawler May 13 AI cybersecurity updates for MDASH, Mythos, and GPT-5.5. On Wednesday, the AISI, which evaluates AI models for the British government, said both Anthropic’s Claude Mythos…

May 18, 2026

KI-Update kompakt: KI-Redenschreiber, Fable & Mythos-Sperre, KI-Sorgen, Apple-KI

…Im BIRD-Benchmark liegt das Tool deutlich vor OpenAIs GPT-5.5 und Anthropics Claude Opus 4.6. Die Aufgabe ist besonders anspruchsvoll, da Daten vielschichtig sind und komplexe geschäftliche Zusammenhänge berücksichtigt…

Jun 15, 2026 · Isabel Grünewald

Discussions and forums

Hacker News · u/davidvgilmore · 3w ago

Show HN: Rayline routes Claude Code subagents to on-device and cheaper models

Hi HN,I’m one of the builders of Rayline.Rayline is a Claude Code compatible LLM gateway. It intercepts and overrides claude code’s internal routing and lets you route subagent calls to different models instead. For exam…

10 8

Hacker News · u/mesmertech · Jun 1, 2026

Ask HN: Anyone else seeing serious degradation in DX with Opus 4.8?

As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…

Hacker News · u/sminchev · May 18, 2026

I created a 126K line Android app with AI – the workflow that worked for me

I really wanted to see how far I can go. Can I create a meaningful and complex application, big enough, but without knowing the language.I have 18+ years of experience as software developer. But I have no experience with…

Claude Code installed my IDE, and now I can't go back to manual setup

…Challenge Complete Your Score / 8 Thanks for playing! The default is Sisyphus, the main orchestrator powered by Claude Opus 4.6. It can plan, delegate, and execute tasks with a 32K budget…

Apr 16, 2026 · Joe Rice-Jones

Coding agents in the social sciences

…Our survey came around two months after a flurry of discussion about Claude Code and Opus 4.6 that kicked off in late December of 2025. Yet even among interested respondents who…

May 27, 2026

Assessing Claude Mythos Preview’s cybersecurity capabilities

…Use generally available frontier models to strengthen defenses now . Current frontier models, like Claude Opus 4.6 (and those of other companies), remain extremely competent at finding vulnerabilities , even if they are…

Apr 7, 2026

How AI Is Transforming Work at Anthropic

…At the time this data was collected, Claude Sonnet 4 and Claude Opus 4 were the most capable models available, and capabilities have continued to advance. More capable AI brings productivity benefits…

Dec 2, 2025

How we contain Claude across products

…On Gray Swan's Agent Red Teaming benchmark, which tests susceptibility to prompt injection, Claude Opus 4.7 holds attack success to roughly 0.1% on single attempts, and around 5–6…

May 25, 2026

A cheat sheet to slash commands in GitHub Copilot CLI

…Claude Opus 4.5 (Preview) (1x) 3. Claude Haiku 4.5 (0.33x) 4. Claude Sonnet 4 (1x) 5. GPT-5.1 (1x) 6. GPT-5.1-Codex-Mini (0.33x) 7…

Jan 21, 2026 · Jacklyn Carroll

Followed topics