Search

Showing top 49 results for "agent safety focus"

All sources anthropic.com 26 xda-developers.com 12 developer.nvidia.com 3 wired.com 2 theverge.com 2 huggingface.co 1 theregister.com 1 404media.co 1 en.wikipedia.org 1

I used Claude Code, Google Antigravity, and Codex for a month and I have a clear winner for you

… Anthropic was founded in 2021 with a strong focus on AI safety research. 02 / 8 Safety What is the name of the safety and values framework Anthropic developed to guide Claude's behavior? …

May 11, 2026 · Parth Shah

Teaching Claude why

… Thus, after Claude 4, it was clear we needed to improve our safety training and, since then, we have made significant updates to our safety training. We use agentic misalignment as a case study to highlight some of the techniques we found to be surprisingly effective. …

May 8, 2026

Transform Video Into Instantly Searchable, Actionable Intelligence with AI Agents and Skills | NVIDIA Technical Blog

May 13, 2026 · Samuel Ochoa

In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy

… Anthropic also announced an industry coalition, including competitors like Google, focused on how advances in generative AI across the field will impact cybersecurity. …

Apr 14, 2026 · Lily Hay Newman

Australian government and Anthropic sign MOU for AI safety and research

… Central to the MOU is a commitment to work with Australia’s AI Safety Institute. We will share our findings on emerging model capabilities and risks, participate in joint safety and security evaluations, and collaborate on research with Australian academic institutions. …

Mar 31, 2026

Researchers gaslit Claude into giving instructions to build explosives

… While Garraghan says other chatbots are equally vulnerable to the kind of social attack the researchers used on Claude, they focused on Anthropic given the company’s self-proclaimed attention to safety and strong performance in other red-teaming efforts, including a study testing whether chatbots w… …

May 5, 2026 · Robert Hart

Inside OpenAI’s Race to Catch Up to Claude Code

… By December 2024, several small groups inside of OpenAI were starting to focus on AI coding agents. …

Mar 11, 2026 · Maxwell Zeff

Advancing Claude in healthcare and the life sciences

… A selection of our partners describe their experiences using Claude below: We were drawn to Anthropic's focus on AI safety and Claude's Constitutional AI approach to creating more helpful, harmless, and honest AI systems. …

Jan 11, 2026

I use OpenCode over Claude Code, and it's every bit as good

… Anthropic was founded in 2021 with a strong focus on AI safety research. 02 / 8 Safety What is the name of the safety and values framework Anthropic developed to guide Claude's behavior? …

May 2, 2026 · Mahnoor Faisal

Run Autonomous, Self-Evolving Agents More Safely with NVIDIA OpenShell | NVIDIA Technical Blog

… NVIDIA NemoClaw uses open source models—like NVIDIA Nemotron —alongside the NVIDIA OpenShell runtime , which is part of the NVIDIA Agent Toolkit. By combining powerful open source models with built-in safety measures, NemoClaw simplifies and secures AI agent deployment. …

Mar 16, 2026 · Ali Golshan

Followed topics