Rude to ChatGPT? Don't be surprised if it gets weird
… Aside from how they’re treated, some models are inherently “happier” than others, the researchers said–and interestingly, the largest models tend to be the least happy. …
… Aside from how they’re treated, some models are inherently “happier” than others, the researchers said–and interestingly, the largest models tend to be the least happy. …
… Anthropic also shared some “concerning hints related to evaluation awareness”--meaning that Opus 4.8 showed signs that it knew it was being tested--while noting a “tendency for the model to reason about how its outputs will be graded.” Those concerns aren’t unique to Opus 4.8; indeed, the latest “f… …
On Tuesday, Anthropic unveiled its latest AI model called Claude Mythos. This "general-purpose, unreleased frontier model" is so impressively powerful that Anthropic is wary of releasing it to the public at large. …
… A step down from the high-end Opus model is Sonnet 4.6, another thinking model that’s better suited for everyday tasks like crunching numbers in Excel or other office tasks. …
… A newer player in the agentic AI coding field is Google Antigravity, which lets you code with agents powered by Google’s Gemini models as well as Claude and OpenAI’s open-weight GPT-OSS models. …
… But aside from model training and security concerns, there’s another factor to consider before spilling your secrets to ChatGPT, Claude, Gemini, or other AI chatbots: the long arm of the law. …
… Eyeing my nearly empty Claude usage gauge, I downshifted the model to the cheaper Sonnet 4.6 as Claude cleaned up my mess. …