Search

Showing top 122 results for "AI safety safeguards"

People also ask

What’s next?

As noted above, we have deployed the classifier as an experimental addition to our Safeguards framework, monitoring a percentage of Claude traffic. Its real-world performance has confirmed that the classifier works effectively beyond our testing environment. Whereas our synthetic test data provided clear examples of harmful and benign exchanges, the distribution of actual user traffic proved more complex and surprising, yet the classifier still performed well. One example of how real-world deployment differs from testing is that the classifier flagged certain conversations about nuclear weapon

Developing Nuclear Safeguards for AI

Discussions and forums

news.microsoft.com › source › 2025 › …

Microsoft Dragon Copilot provides the healthcare industry’s first unified voice AI assistant that enables clinicians to streamline clinical documentation, surface information and automate tasks - Source

…and compliance safeguards for accurate and safe AI outputs. They also align to Microsoft’s responsible AI principles to help guide AI development and use —transparency, reliability and safety, fairness, inclusiveness, accountability…

Mar 3, 2025 · Microsoft Source