Search: Benchmarks and reliability

NVIDIA and ServiceNow Partner on New Autonomous AI Agents for Enterprises

… NOWAI-Bench includes EnterpriseOps-Gym , one of the industry’s most challenging enterprise agent benchmarks, where Nemotron 3 Super currently ranks No. …

May 5, 2026 · Kari Briski

OpenAI’s New GPT-5.5 Powers Codex on NVIDIA Infrastructure — and NVIDIA Is Already Putting It to Work

… The cluster completed multiple large-scale training runs and set a new benchmark for system-level reliability at frontier scale. …

Apr 23, 2026 · Justin Boitano

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources

…This integration lets developers easily develop and deploy embodied AI techniques for underwater applications. RoboLab: Benchmarking the Next Generation of Generalist Robots 🔗 RoboLab is a high-fidelity simulation benchmark for developing and…

Apr 10, 2026 · NVIDIA Writers

What’s the Difference Between Deep Learning Training and Inference?

… Complementing these traditional benchmarks, open-source initiatives like InferenceMAX provide an additional layer of transparency and reliability. …

Jul 29, 2016 · Kyle Aubrey

New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI

… 1 position on DeepResearch Bench and DeepResearch Bench II leaderboards, benchmarks that measure an AI system’s ability to conduct thorough, multistep research across large document sets while maintaining reasoning coherence. …

Mar 11, 2026 · Kari Briski

Enterprises Onboard AI Teammates Faster With NVIDIA NeMo Tools to Scale Employee Productivity

… NeMo Evaluator simplifies the evaluation of AI models and workflows on custom and industry benchmarks with just five application programming interface API calls. …

Apr 23, 2025 · Joey Conway

NVIDIA and Partners Show That Software-Defined AI-RAN Is the Next Wireless Generation

… New benchmarking results from partners like SynaXG showed that AI-RAN running on NVIDIA platforms delivers high-speed, carrier-grade performance — meaning extreme reliability — across multiple 5G spectrum bands. …

Mar 1, 2026 · Kanika Atri

From Simulation to Production: How to Build Robots With AI

… Isaac Lab-Arena connects to industrial and academic benchmarks such as LIBERO, RoboTwin and NIST so developers can easily evaluate their progress. …

Mar 18, 2026 · Katie Washabaugh

NVIDIA Launches Earth-2 Family of Open Models — the World’s First Fully Open, Accelerated Set of Models and Tools for AI Weather

… On standard benchmarks, it outperforms leading open models on the most common forecasting variables measured by the industry. …

Jan 26, 2026 · Mike Pritchard

NVIDIA GTC 2026: Live Updates on What’s Next in AI

… EPRI is using and testing it to advance AI-powered weather forecasting to strengthen grid reliability. …

Mar 20, 2026 · NVIDIA Writers

Followed topics