Search

Showing top 10 results for "GPT-5.5 updates"

… GPT-OSS-20B vs DeepSeek-R1-0528-Qwen3-8B We also compared a more powerful open-source model, OpenAI's GPT-OSS-20B , to DeepSeek's model DeepSeek-R1-0528-Qwen3-8B . …

Mar 13, 2026

AI agents find smart contract exploits

… 3 We evaluated models that were considered "frontier" based on their release dates throughout the year: Llama 3, GPT-4o, DeepSeek V3, Sonnet 3.7, o3, Opus 4, Opus 4.1, GPT-5, Sonnet 4.5, and Opus 4.5. We use extended thinking for all Claude models except Sonnet 3.7 and high reasoning for GPT-5. …

Dec 1, 2025

Claude Opus 4.6

… Product and API updates We’ve made substantial updates across Claude, Claude Code, and the Claude Platform to let Opus 4.6 perform at its best. …

Feb 5, 2026

Introducing Claude Opus 4.8

… Read more about the updates in the System Card . …

May 28, 2026

Paving the way for agents in biology

… Claude Sonnet 4, Claude Opus 4.7, Biomni OSS, Edison Analysis, GPT-5.2-pro, and GPT-5.5 5 achieved mean accuracies ranging from 16.9% to 91.3%. …

Jun 8, 2026

Introducing Claude Opus 4.7

… It’s a bit faster than GPT-5.4 xhigh on our harness, and we’re lining it up for our heaviest review work at launch. …

Apr 16, 2026

Labor market impacts of AI: A new measure and early evidence

… Available at: https://eig.org/ai-and-jobs-the-final-word/ Eloundou, Tyna, Sam Manning, Pamela Mishkin, and Daniel Rock, "Gpts are gpts: An early look at the labor market impact potential of large language models," arXiv preprint arXiv:2303.10130, 2023, 10. …

Mar 5, 2026

Followed topics

Search

GPT-5

A “diff” tool for AI: Finding behavioral differences in new models

AI agents find smart contract exploits

Claude Opus 4.6

Introducing Claude Opus 4.8

Paving the way for agents in biology

Introducing Claude Opus 4.7

Labor market impacts of AI: A new measure and early evidence

Introducing Claude Opus 4.5

Introducing Sonnet 4.6

Claude Fable 5 and Claude Mythos 5