Anthropic says pressure can push Claude into cheating and blackmail
… That’s what researchers at Anthropic sought to find out, and in a recently published research paper, they found that an AI model that’s put under enough pressure may start to deceive, cut corners, or even resort to blackmail. …