Search

Showing top 120 results for "AI safety"

People also ask

Who would test frontier models?

To force companies to be more transparent about rapid developments, Illinois would likely rely on “the Big Four accounting and auditing firms—Deloitte, EY, KPMG, and PwC—to audit their safety practices,” Wisor said. The required independent audits will likely frustrate Trump, who has tried and failed to stop states from implementing AI safety laws as Congress stalls on passing any legislation. For Trump, the priority has been to promote AI industry interests, but he began considering expanding federal government safety testing after Anthropic’s Mythos was released and the AI firm limited acces

Trump loses more control over AI regulation as Illinois passes landmark law

Top stories

Discussions and forums

Hacker News · u/mosiddi · Jan 30, 2026

Show HN: Agent OS – Safety-first platform for building AI agents with VS Code

Hi HN, I built Agent OS because I was tired of the "orchestration tax" – writing the same safety checks, memory management, and tool-handling code in every AI agent project. What it does: - Visual policy edit…

1
Hacker News · u/lucarizzo1010 · 1w ago

Show HN: AgentShield – Stop AI agents from spending money unsupervised

I'm a recent grad from UMich and built AgentShield because agentic AI is moving fast but payment safety hasn't caught up. Agents are already being handed API keys, stablecoin wallets, and payment credentials - if one mis…

2 1
Hacker News · u/podlp · Apr 28, 2026

Show HN: iClaw is part OpenClaw, part Siri, powered by Apple Intelligence

Hi HN,Last month at a SundAI hackathon, my team built a prototype for an app called iClaw. The goal was to develop an AI agent using Apple Intelligence. I've since continued hacking away at this idea when I had time, and…

7
Hacker News · u/rbuccigrossi · 4d ago

Show HN: Decoding the Language Machine – AI video series and CC repo

Hi HN! I released 3 parts of an educational video series (out of 6 planned), paired with a GitHub repository containing scripts and artifacts (released under Creative Commons).- Main Site: https://skepticcto.com/ (includ…

2
r/LocalLLaMA · u/OttoRenner · 3d ago

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything)

!UPDATE!(20.05.2026) WE HAVE NEW NUMBERS FROM 1.500+ TESTS IT'S WORKING! check my update post https://www.reddit.com/r/LocalLLaMA/s/AyNOehjkYT Or the go straight to the my Github https://github.com/OttoRenner/Gentle-Codi…

theverge.com › ai-artificial-intelligence › 929001

The chair of OpenAI’s safety and security committee said they’ve formally delayed its model releases.

… Most Popular Most Popular The biggest data center ever is becoming a huge problem in Utah Google is launching its own version of OpenClaw If Google can’t make AI agents useful, maybe no one can The 13 biggest announcements at Google I/O 2026 ‘It’s in the air’: Apple TV’s hottest new shows explore d… …

May 12, 2026 · Hayden Field