Search: AI safety safeguards

OpenAI introduces new 'Trusted Contact' safeguard for cases of possible self-harm | TechCrunch

… The Trusted Contact feature follows the safeguards the company introduced last September that gave parents the power to have some oversight of their teens’ accounts, including receiving safety notifications designed to alert the parent if OpenAI’s system believes their child is facing a “serious sa… …

May 7, 2026 · Lucas Ropek

Anthropic's safety warnings may have just backfired — the government has pulled the plug on its most powerful AI | TechCrunch

… Anthropic’s broader argument is that its strongest safeguards operate through independent classifier systems that function separately from the model itself, meaning that even if someone convinces Fable to keep talking past a refusal, the underlying protections against the most dangerous outputs rem… …

Jun 13, 2026 · Connie Loizos

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims | TechCrunch

… Instead the claim targets Kim’s supervisor, xAI co-founder Jimmy Ba — who left the company earlier this year — saying that Ba ignored Musk’s directives and retaliated against Kim for pushing for safeguards, in an effort to “silence his repeated complaints about AI safety and biases.” The lawsuit po… …

Jun 10, 2026 · Rebecca Bellan

Meta will use AI to analyze height and bone structure to identify if users are underage | TechCrunch

… Aisha holds an honours bachelor’s degree from University of Toronto and a master’s degree in journalism from Western University. You can contact or verify outreach from Aisha by emailing aisha@techcrunch.com or via encrypted message at aisha malik.01 on Signal. …

May 5, 2026 · Aisha Malik

Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, agencies | TechCrunch

… OpenAI anymore AI The White House is asking OpenAI to slow roll the release of its new model over safety concerns AI Anthropic’s Claude is winning over paid consumers, a market owned by ChatGPT Latest in AI In Brief Trump Admin releases Anthropic Mythos to be used by more than 100 US companies, age…

Jun 27, 2026 · Julie Bort

Waymo issues recall to deal with a flooding problem | TechCrunch

… "We are working to implement additional software safeguards and have put mitigations in place, including refining our extreme weather operations during periods of intense rain, limiting access to areas where flash flooding might occur." Topics autonomous vehicles , avs , robotaxis , self-driving ca… …

May 12, 2026 · Sean O'Kane

OpenAI faces investigation from state attorneys general | TechCrunch

… Earlier this month, Florida Attorney General James Uthmeier sued OpenAI and its CEO Sam Altman , claiming that OpenAI and Altman “ignored internal and external safety warnings, put children at great risk, and allowed a dangerous product to reach millions of Floridians.” Altman recently apologized t… …

Jun 13, 2026 · Anthony Ha

Encryption, spyware, and now Mythos: History shows why cyber export control doesn't work | TechCrunch

… Amazon CEO Andy Jassy also reportedly alerted the administration after Amazon’s own researchers, he said, found a way around Fable 5’s safeguards. Anthropic disputes the “jailbreak” label, calling it a narrow, already-patched issue rather than a wholesale defeat of the model’s safety measures. …

Jun 19, 2026 · Lorenzo Franceschi-Bicchierai

Followed topics