Teaching Claude why
…Thus, after Claude 4, it was clear we needed to improve our safety training and, since then, we have made significant updates to our safety training. We use agentic misalignment as a…
Users will find Opus 4.8 to be a modest but tangible improvement on its predecessor. There’s still more to be done: we’re working on developing and releasing models that provide many of the same capabilities as Opus at a lower cost. Not only that, but we plan to release a new class of model with even higher intelligence than Opus. As part of Project Glasswing, a small number of organizations are currently using Claude Mythos Preview for cybersecurity work. Models of this capability level require stronger cyber safeguards before they can be generally released. We’re making swift progress on dev
Introducing Claude Opus 4.8…Thus, after Claude 4, it was clear we needed to improve our safety training and, since then, we have made significant updates to our safety training. We use agentic misalignment as a…
…For developers, platform engineers, and engineering leaders, this is not an incremental model update. Claude Fable 5 completes multi-step, goal-directed work that previous models could not sustain, and it does…
…started: * The AI-designed car is taking shape * OpenAI’s big Codex update is a direct shot at Claude Code * Microsoft and OpenAI’s famed AGI agreement is dead * OpenAI’s new…
…The results suggest that the new stories were able to effectively “update the prior around Claude’s baseline expectations for AI behavior outside of the Claude persona.” The researchers theorize that this…
Hi HN,I’m one of the builders of Rayline.Rayline is a Claude Code compatible LLM gateway. It intercepts and overrides claude code’s internal routing and lets you route subagent calls to different models instead. For exam…
As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.The two examples for me where I lost confidence in…
I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments.On my…
Sharing a small Mac app I built around OpenAI’s gpt-realtime-2 model. You call up a voice coding agent and talk to it like you’d talk to a freelancer ("make the hero tighter, put a product image on the right, that one's …
I really wanted to see how far I can go. Can I create a meaningful and complex application, big enough, but without knowing the language.I have 18+ years of experience as software developer. But I have no experience with…
…After building a map of resources, server data was passed through OpenAI’s APIs to GPT-4.1 for analysis, producing ~2500 reports which were fed back into Claude Code for exploitation…
…Early Sunday morning, the company posted , “Anthropic’s Opus 4.7 and 4.8 models are experiencing degraded performance, which is causing a higher rate of failures for users selecting these models…
…GPT-4o, GPT-4 Turbo, GPT-4, GPT-3.5 (OpenAI) Claude 3 Opus, Claude 3 Sonnet, Claude 3 Haiku, Claude 2.1, Claude Instant 1.2 (Anthropic) Gemini Pro 1.5…
…Claude Opus 4.6
…When Claude Code briefly vanished from the Pro subscription One fewer checkmark with no advance notice On April 21, with no announcement, Anthropic updated its pricing page and the Claude Code support…
…bugs and security issues." To highlight Claude Code Security's bug hunting potential, the company pointed to how its red team had used Claude Opus 4.6 to find " over 500 vulnerabilities…