Search: Apple uses Nvidia Blackwell

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy | NVIDIA Technical Blog

…On a single NVIDIA Blackwell DGX B200 GPU, AutoDeploy performed on par with the manually optimized baseline in TensorRT LLM (Figure 4). It delivered up to 350 tokens per second per user…

Feb 9, 2026 · Lucas Liebenwein

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning | NVIDIA Technical Blog

…Pretraining Super is pretrained on 25 trillion tokens using NVFP4, the NVIDIA 4-bit floating-point format optimized for NVIDIA Blackwell. Rather than quantizing a full-precision model after the fact, Super…

Mar 11, 2026 · Chris Alexiuk

Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI | NVIDIA Technical Blog

…training, teams can also use the NeMo Megatron-Bridge fine-tuning recipe , which provides additional performance optimizations. From data center deployments on NVIDIA Blackwell to deskside with NVIDIA DGX Station to managed…

May 29, 2026 · Anu Srivastava

AR / VR – NVIDIA Technical Blog

…8 MIN READ Apr 22, 2026 Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20 AI integration is redefining mainstream enterprise applications, from…

May 22, 2026

Developer Tools & Techniques – NVIDIA Technical Blog

…8 MIN READ Apr 22, 2026 Scaling the AI-Ready Data Center with NVIDIA RTX PRO 4500 Blackwell Server Edition and NVIDIA vGPU 20 AI integration is redefining mainstream enterprise applications, from…

May 22, 2026

NVIDIA Achieves Leading Agentic Coding Performance on First Agentic AI Benchmark | NVIDIA Technical Blog

…The benchmark uniquely captures agentic workload complexity, including non-deterministic sequences, tool call latencies, and variable sequence lengths, using private, representative test sets to avoid benchmark-specific optimization. At launch, NVIDIA GB300…

Jun 12, 2026 · Eduardo Alvarez

NVIDIA Nemotron 3 Ultra Powers Faster, More Efficient Reasoning for Long-Running Agents | NVIDIA Technical Blog

…NVFP4 precision The same NVFP4 checkpoint runs on NVIDIA Hopper, NVIDIA Blackwell, and Ampere GPUs. Developers can use one checkpoint across all NVIDIA GPU architectures thanks to specialized NVFP4 quantization kernels. NVFP4…

Jun 4, 2026 · Chris Alexiuk

How to Integrate Computer Vision Pipelines with Generative AI and Reasoning | NVIDIA Technical Blog

…used as an intelligent add-on to computer vision pipelines for low-latency alerts and direct VLM Q&A on video segments, making it suitable for edge deployments on NVIDIA Blackwell platforms…

Sep 25, 2025 · Samuel Ochoa

cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia | NVIDIA Technical Blog

…Getting started Just like cuTile Python, cuTile.jl requires an NVIDIA Ada, NVIDIA Ampere or NVIDIA Blackwell GPU and an NVIDIA driver for CUDA 13.1 or higher. The package also requires…

Mar 3, 2026 · Tim Besard

Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints | NVIDIA Technical Blog

…in the NVIDIA Developer Program. import requests invoke_url = "https://integrate.api.nvidia.com/v1/chat/completions" headers = { "Authorization": "Bearer $NVIDIA_API_KEY", "Accept": "application/json", } payload = { "messages": [ { "role": "user", "content": "" } ], "model…

Feb 4, 2026 · Anu Srivastava

Followed topics

Search

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy | NVIDIA Technical Blog

Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning | NVIDIA Technical Blog

Run Step 3.7 Flash on NVIDIA GPUs with Enterprise-Ready Multimodal AI | NVIDIA Technical Blog

AR / VR – NVIDIA Technical Blog

Top stories

How to Optimize Transformer-Based Models for Low-Precision Training | NVIDIA Technical Blog

Fine-Tuning Biological Foundation Models with LoRA Using NVIDIA BioNeMo Recipes | NVIDIA Technical Blog

Deploy Long-Context Reasoning and Agentic Workflows with MiniMax M3 on NVIDIA Accelerated Infrastructure | NVIDIA Technical Blog

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell | NVIDIA Technical Blog