6 Months of AI Radio Went About as Badly as You'd Expect
…Andon Labs used the latest versions of four AI models over several months, but ultimately settled on Claude Opus 4.7, GPT-5.5, Gemini 3.1 Pro and Grok 4.3…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…Andon Labs used the latest versions of four AI models over several months, but ultimately settled on Claude Opus 4.7, GPT-5.5, Gemini 3.1 Pro and Grok 4.3…
…I was trying to replace cloud models with local models for the wrong reasons; there was no way a 12B local model could outrun Opus 4.7 in deep reasoning and large…
…Axios added that the Trump administration had previously tried to stop Anthropic from releasing the model, but failed. The report also noted that, under the export control directive, a license would be…
…OpenAI’s GPT-4o (before the highly sycophantic and since-sunset GPT-5), GPT-5.2, xAI’s Grok 4.1 Fast, Google’s Gemini 3 Pro, and Anthropic’s Claude Opus…
Claude Code Degraded Before Opus 4.8 Release
Claude Sonnet 5 Could Be Released Later Today, May Not Be Better Than Opus 4.8
Hello there,Long time listener, first time caller over here. I’ve been meaning to try A.I coding capabilities for some time now. I wanted to focus on a problem that is simple enough for me to execute all the way to the f…
Announcements Agents for financial services May 5, 2026 We’re releasing ten ready-to-run agent templates for the most time-consuming work in financial services: building pitchbooks, screening KYC files, and…
…It runs the same Claude models already available to everyone today (including Claude Opus 4.8), with no special access and no gating.” The workbench builds on Anthropic’s October 2025 launch…
…Based on Claude Opus 4.7, Anthropic's just-released, more costly model, Claude Design is accessible via the palette icon on the Claude.ai left-hand navigation frame to Pro, Max…
Alignment Teaching Claude why May 8, 2026 Last year, we released a case study on agentic misalignment . In experimental scenarios, we showed that AI models from many different developers sometimes took egregiously…
…Mythos scored 93.9% on the SWE-bench Verified (which is the industry-standard benchmark for autonomous software) compared to Claude Opus 4.6's 80.8%. For context, Google's flagship…
…Anthropic has faced mounting criticism in recent months over aggressive Claude rate limits, especially surrounding Claude Code and Opus usage. The SpaceX deal appears designed to directly address those bottlenecks while positioning…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.