OSMO Platform
…Prompt to running pipeline, from synthetic data and training to SIL and HIL evaluation, no infrastructure expertise required. How It Works Define your entire physical AI pipeline in a single YAML file…
DOCA Argus is the runtime threat detection microservice that provides real-time visibility and situational awareness across the AI factory. Argus is the foundation of the DOCA security stack. Running on BlueField data and storage processors, DOCA Argus continuously observes workload behavior at runtime using advanced memory analysis, enabling organizations to detect threats, monitor integrity, and understand operational state without impacting AI workload performance. Unlike traditional host-based security approaches, DOCA Argus operates independently from the compute node it protects. By leve
Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security | NVIDIA Technical BlogPurpose-built for AI infrastructure, NVIDIA BlueField DPUs combine high-performance networking, programmable compute, hardware acceleration, and advanced security capabilities into a single platform embedded into every AI factory compute node. Unlike traditional security approaches that rely on host system software, BlueField establishes a hardware-enforced, in-silicon, and workload-independent security layer. Operating within its own trusted execution domain, BlueField isolates infrastructure and security services from the host system. Monitoring, policy enforcement, and telemetry operate eve
Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security | NVIDIA Technical Blog…Prompt to running pipeline, from synthetic data and training to SIL and HIL evaluation, no infrastructure expertise required. How It Works Define your entire physical AI pipeline in a single YAML file…
…Oracle is a key early adopter, integrating the MCG toolkit into its OCI AI infrastructure to enhance model documentation and GPU resource optimization within dedicated AI clusters and cloud environments. AI-generated…
…This generational increase allows AI factories to scale pods, services, and tenants while also advancing infrastructure operations, efficiency, and cybersecurity. Infrastructure acceleration at AI factory scale In traditional systems, infrastructure services run…
…AI factories powered by NVIDIA bring industrial-grade discipline to AI, changing infrastructure into a strategic engine for speed, reliability, and accelerated innovation. Infrastructure is one of the five layers of AI…
…Updated on March 16, 2026, with new AI infrastructure. Discuss (0) Discuss (0) Tags Networking / Communications | General | BlueField DPU | DOCA | Dynamo | Spectrum-X Ethernet | Intermediate Technical | Deep dive | AI Agent | AI Factory…
…We plan to continue investing in making it simple to run large-scale AI training infrastructure. If you have questions or want to share what you’re building, visit SlinkyProject on GitHub…
…Previously he has held several senior engineering roles at Meta Platforms (Facebook), including leading AI infrastructure resource management and data infrastructure initiatives. With over a decade of experience in building and scaling…
…NVIDIA Vera Rubin DSX AI factory platform NVIDIA Vera Rubin DSX is the AI factory platform that provides a blueprint and reference design for co-designed AI infrastructure from chip to grid…
…Learn more AI is evolving, and reasoning models are increasing token demand, placing new requirements on every layer of AI infrastructure. More than ever, compute must scale efficiently to maximize token production…
…View all posts by Jill Foster View all posts by Jill Foster About Daniel Kim Daniel Kim is a senior AI infrastructure engineer at NVIDIA on the Global Compute Infrastructure team. He…