Claude’s new model is more ‘honest’ when it messes up
… Higher-effort responses will use more tokens, giving users the option of lower-effort responses if they don’t want to burn through their rate limits as quickly. …
… Higher-effort responses will use more tokens, giving users the option of lower-effort responses if they don’t want to burn through their rate limits as quickly. …
… Testers have found Opus 4.8 to be "more reliable and sharper in its judgement" when doing agentic tasks, and the model also made gains in honesty. Early testers report that Opus 4.8 is more likely to flag uncertainties about its work and less likely to make unsupported claims. …
… Learn more. General technology Claude Opus 4.8 is more honest, less deceptive, and considerably cheaper The company is also adding a couple of new features to Claude. …
… The model completely refused to flatter and exposed the self-delusion with intellectual honesty. …
… Anthropic developed Constitutional AI CAI , a technique that trains Claude using a set of principles — a 'constitution' — to guide its responses toward being helpful, harmless, and honest. …
… Anthropic developed Constitutional AI CAI , a technique that trains Claude using a set of principles — a 'constitution' — to guide its responses toward being helpful, harmless, and honest. …
… I started questioning the process more. In one test, the response was around 300 words, and it took more than 2 minutes. Normally, I’d read it in 80–90 seconds, so this was noticeably slower than my natural reading pace. Yet I engaged more. I wasn’t skimming anymore; I was following. …
… Anthropic developed Constitutional AI CAI , a technique that trains Claude using a set of principles — a 'constitution' — to guide its responses toward being helpful, harmless, and honest. …
… A tool that remembers more also creates more responsibility. The privacy concern isn’t theoretical. Development sessions can include tokens, local paths, API responses, private project names, and other details that were never meant to become durable notes. …
… Anthropic developed Constitutional AI CAI , a technique that trains Claude using a set of principles — a 'constitution' — to guide its responses toward being helpful, harmless, and honest. …