Can AI Really Build Better AI?
…Jeff Clune , a computer scientist at the University of British Columbia who worked on both DGMs and the AI Scientist, says that improving AI with AI is “one of hottest topics in…
Current guardrails are a mix of industry best practices, voluntary screening and government regulation. Many synthesis companies willingly participate in sequence screening initiatives that flag orders matching known pathogens or dangerous sequences. Some generative AI companies also apply content and model-safety policies that block prompts about biological-harm instructions. But in the open letter, experts across the fields of science, technology, public policy, academia and law are proposing tougher measures. The public letter shows a consensus that the risks of AI-developed bioweapons des
AI Leaders Call for Rules on Synthetic DNA to Limit Bioweapons RiskIf an AI system is too cautious, it might refuse legitimate nuclear engineering coursework. Too permissive, and it could inadvertently assist bad actors. Our classifier appears to strike the right balance. In preliminary testing with synthetic data, we achieved a 94.8% detection rate for nuclear weapons queries and zero false positives (overall, 96.2% of the classifier’s labels in this test were accurate as shown in Figure 2), suggesting this system would not flag legitimate educational, medical, or research discussions as concerning. This precision matters because nuclear conversations in AI
Developing Nuclear Safeguards for AI…Jeff Clune , a computer scientist at the University of British Columbia who worked on both DGMs and the AI Scientist, says that improving AI with AI is “one of hottest topics in…
…But it’s likely, and some say inevitable, that future AI-powered weapons will eventually be able to operate with complete autonomy, leading to a watershed moment in the history of warfare…
…If your government tends to be a little too authoritarian, it could be used for bad things.” LeCun has grappled with issues related to AI safety and security before. He notes that…
…As part of this, she's explored how AI Darth Vader sets a dangerous precedent , and was one of the first reporters to look into generative AI dating companions . Sign in to…
Last Friday, citing unspecified national security concerns, the White House ordered Anthropic to restrict the export of its powerful AI models Fable and Mythos to anyone outside of the United States, as…
…Just having all these people in the room talking about AI safety, security, fairness, governance, and other challenges which come with designing, building and evaluating AI technologies, forming partnerships and collaborations was…
…an AI executive order. It was pretty toothless in the end. It just said they had to talk about what their models were capable of and release some safety testing. And then…
…Between Black & White and Google DeepMind, the trajectory is clear: AI escaped containment - from the relative safety of video game worlds to the real one where the risks are existential. Writing for…
…Despite how difficult this is there are thankfully few major leaks and exploits, especially in banking and finance. Frontier AI -the new models that haven’t been deployed to the general public…
…Related Anthropic just dropped its core AI safety promise, and that should worry you History doesn't repeat itself, but AI companies sure do. Why is Anthropic keeping Mythos under wraps? For…