Google says these AI models are best for coding Android apps
…72.4% Claude Opus 4.6: 66.6% GPT-5.2 Codex: 62.5% Claude Opus 4.5: 61.9% Gemini 3 Pro Preview: 60.4% Claude Sonnet 4.6: 58.4…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…72.4% Claude Opus 4.6: 66.6% GPT-5.2 Codex: 62.5% Claude Opus 4.5: 61.9% Gemini 3 Pro Preview: 60.4% Claude Sonnet 4.6: 58.4…
…Safeguards Alongside the release of Claude Opus 4.6, we're introducing a new layer of detection to support our Safeguards team in identifying and responding to cyber misuse of Claude. At…
…For example, every new session starts with similar sets of instructions, but I also instruct Claude to update the file on every new milestone, and by the end, the context becomes overloaded…
…update, which is an assessment of how safe Mythos is and how it might cause harm through its actions, says this: “The difference in capabilities between Mythos Preview and Claude Opus 4…
Hi HN,I’m one of the builders of Rayline.Rayline is a Claude Code compatible LLM gateway. It intercepts and overrides claude code’s internal routing and lets you route subagent calls to different models instead. For exam…
As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…
I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments.On my…
Sharing a small Mac app I built around OpenAI’s gpt-realtime-2 model. You call up a voice coding agent and talk to it like you’d talk to a freelancer ("make the hero tighter, put a product image on the right, that one's …
I really wanted to see how far I can go. Can I create a meaningful and complex application, big enough, but without knowing the language.I have 18+ years of experience as software developer. But I have no experience with…
…Another factor is that the large one-million-token context window available on paid plans with the Claude Opus 4.6 or Sonnet 4.6 models increases costs, especially with cache misses…
…Claude Mythos goes public in new Fable 5 model that’s ‘safe for general use’ Claude Opus 4.8 launches today with agentic improvements, new features Google sues cybercrime network that used…
…Sign in to your XDA account I like Claude Code, but I don't like its limitations. You're largely restricted to Anthropic's models like Claude Sonnet and Opus. They're…
…Mythos is markedly different from Claude Opus 4.6, which Anthropic only recently said was not very skilled at developing working exploit code. Where Opus 4.6 managed an exploit development success…
…As a first step, we’ll be retiring Opus 4.6 Fast for Copilot Pro+ users, beginning today. We recommend using Opus 4.6 as an alternative model with similar capabilities. Jun…
…Anthropic Signs SpaceX Colossus 1 Data Center to Boost Capacity If you have been using Claude Opus over the past few weeks, you have likely noticed errors, rate limits, and many have…