Search

Showing top 139 results for "AI coding models"

CUDA-X

…NVIDIA PhysicsNeMo An open source Python framework for building, training, and fine-tuning AI physics models at scale. NVIDIA Earth-2 A comprehensive family of open models, libraries, and frameworks that democratize…

Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

Agentic AI / Generative AI Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer Sep 10, 2024 By Jan Lasek , Onur Yilmaz , Chenjie Luo and Chenhan Yu Discuss (0…

Sep 10, 2024 · Jan Lasek

Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety | NVIDIA Technical Blog

…Learn more Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale, developers need models that can understand real-world…

Mar 24, 2026 · Chintan Patel

CUDA Tile Programming Now Available for BASIC! | NVIDIA Technical Blog

…Relish in the opportunity to port your modern AI or scientific computing code base to a historically pivotal language while retaining the ability to run on the most powerful hardware available! Just…

Apr 1, 2026 · Rob Armstrong

Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy | NVIDIA Technical Blog

…NVIDIA NeMo Megatron Bridge provides production-ready low-precision training recipes that allow seamless switching between precision formats, supporting efficient large-scale model training with minimal code modifications. AI-generated content may…

Feb 23, 2026 · Aditya Vavre

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform | NVIDIA Technical Blog

…As generation speeds approach 1,000 tokens per second per user, models move beyond conversation-speed interaction toward speed of thought computing. At that rate, AI systems can reason, simulate, and respond…

Mar 16, 2026 · Kyle Aubrey

Followed topics

Search

People also ask

CUDA-X

Post-Training Quantization of LLMs with NVIDIA NeMo and NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

Building NVIDIA Nemotron 3 Agents for Reasoning, Multimodal RAG, Voice, and Safety | NVIDIA Technical Blog

CUDA Tile Programming Now Available for BASIC! | NVIDIA Technical Blog

Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy | NVIDIA Technical Blog

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform | NVIDIA Technical Blog

Mastering Agentic Techniques: AI Agent Evaluation | NVIDIA Technical Blog

Using Accelerated Computing to Live-Steer Scientific Experiments at Massive Research Facilities | NVIDIA Technical Blog

NVIDIA Brev

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy | NVIDIA Technical Blog