Search

Showing top 122 results for "AI safety safeguards"

All sources xda-developers.com 12 wired.com 9 cnet.com 9 engadget.com 8 theverge.com 7 techcrunch.com 7 anthropic.com 6 tomsguide.com 6 macrumors.com 5 fudzilla.com 4 arstechnica.com 4 androidauthority.com 4

Keeping Google Play & Android app ecosystems safe in 2025

…Upgrading Google Play’s AI-powered, multi-layered user protections We’ve seen a clear impact from these safety efforts on Google Play. In 2025, we prevented over 1.75 million policy…

Feb 19, 2026 · Vijaya Kaza

Open models stripped bare by safety-busting tools – Fudzilla.com

AI News May 26, 2026 by Nick Farrell Open models stripped bare by safety-busting tools AI safety guardrails are being ripped off open models, leaving policymakers staring at a fresh and…

May 26, 2026 · Nick Farrell

Roblox Will Roll Out Age-Based Accounts Amid Child-Safety Push

Evolving global regulations aimed at limiting child access to harmful content, as well as incidents and lawsuits related to youth safety online , have prompted Roblox to revise how it handles accounts for…

Apr 13, 2026 · See full bio

Exposed Charging Pins on Valve Steam Controller Puck Raise Safety Questions

Valve is investigating a reported safety issue involving the charging puck bundled with its Steam Controller after a Reddit user claimed the accessory overheated and nearly caused a fire following an accidental…

May 25, 2026 · Hilbert Hagedoorn

Discussions and forums

Hacker News · u/netfortius · May 20, 2026

"An (important) message from Infomaniak's founder"

Hello ...,I'm writing to you as the founder and strategic director of Infomaniak because something important has just happened, and it concerns you directly.I no longer control InfomaniakIt's not a multinational that has…

7 1

Gemini 3.5 Flash can now see your screen, use your computer, take actions — all on its own

…raises questions around safety, especially for enterprise consumers. To mitigate those risks, Google has used targeted adversarial training for the model. It is also introducing two new safeguards built into computer use…

Jun 25, 2026 · Akshay Gangwar

Securing internal systems against increasingly capable and imperfectly aligned AI

June 18, 2026 Responsibility & Safety Securing the future of AI agents Rohin Shah and Four Flynn AI agents are transforming our relationship with technology. By autonomously executing complex tasks — from cyber defence…

Jun 18, 2026 · Rohin Shah and Four Flynn

Strengthening biosecurity in the era of AI

…By stress-testing existing screening systems against AI-designed biological sequences, the project showed both where safeguards could fail and how they could be improved. The effort followed a familiar model from…

Jun 4, 2026 · Eric Horvitz

Anthropic lets the scary AI out – Fudzilla.com

…safeguards to the broader biology and life sciences community so these capabilities can be used to accelerate biomedical research and drug discovery,” a spokesman said Wednesday. TOPICS: AI models · AI safety · anthropic…

Jun 11, 2026 · Nick Farrell

Microsoft restricts Claude Fable for employees over data retention concerns

Anthropic released Claude Fable, its first Mythos-class AI model, yesterday and it’s already causing concerns inside Microsoft. Sources tell me that Microsoft is limiting the use of Claude Fable 5…

Jun 10, 2026 · Tom Warren

Apple Defends Google Against EU Proposal to Give AI Rivals Access to Services

Apple has stepped in to warn that EU proposals to force Google to open Android to competing AI services pose serious risks to user privacy, security, and safety. Apple's latest submission…

May 13, 2026 · Hartley Charlton

Followed topics