Search

Showing top 8 results for "AI safety safeguards"

techcrunch.com › 2026 › 06 › …

Anthropic's safety warnings may have just backfired — the government has pulled the plug on its most powerful AI | TechCrunch

… Anthropic’s broader argument is that its strongest safeguards operate through independent classifier systems that function separately from the model itself, meaning that even if someone convinces Fable to keep talking past a refusal, the underlying protections against the most dangerous outputs rem… …

Jun 13, 2026 · Connie Loizos
techcrunch.com › 2026 › 06 › …

Encryption, spyware, and now Mythos: History shows why cyber export control doesn't work | TechCrunch

… Amazon CEO Andy Jassy also reportedly alerted the administration after Amazon’s own researchers, he said, found a way around Fable 5’s safeguards. Anthropic disputes the “jailbreak” label, calling it a narrow, already-patched issue rather than a wholesale defeat of the model’s safety measures. …

Jun 19, 2026 · Lorenzo Franceschi-Bicchierai