Search

Showing top 119 results for "Community support"

Accelerating Long-Context Model Training in JAX and XLA | NVIDIA Technical Blog

…supporting sequences of 128K tokens, 256K tokens, and beyond. However, training these models with extended context lengths presents significant computational and communication challenges. As context lengths grow, the memory and communication overhead…

Feb 3, 2026 · Sevin Fide Varoglu

Running Large-Scale GPU Workloads on Kubernetes with Slurm | NVIDIA Technical Blog

…Slurm is configured with SwitchType = switch/nvidia_imex to use NVIDIA IMEX for cross-node GPU communication. Topology-aware scheduling: Slurm 25.11 supports TopologyParam=BlockAsNodeRank with TopologyPlugin=topology/block , ensuring allocations…

Apr 9, 2026 · Anton Polyakov

Nsight Compute 2026.1 - New Features

…The Save As dialog now supports the .ncu-repz file extension for compressed reports. NVIDIA Nsight Compute CLI Mandatory concurrent kernels (e.g. NCCL communication kernels) can now be profiled across processes…

Removing the Guesswork from Disaggregated Serving | NVIDIA Technical Blog

…How the SGLang community is contributing Mooncake: Initial SGLang support in AIConfigurator AIConfigurator initially supported only TensorRT LLM, reserving interfaces for SGLang and vLLM without full implementation. Contributors from Mooncake (an open…

Mar 9, 2026 · Tianhao Xu

Inside NVIDIA Groq 3 LPX: The Low-Latency Inference Accelerator for the NVIDIA Vera Rubin Platform | NVIDIA Technical Blog

…Individual agents can be powerful on their own, but coordinated groups of agents can accomplish far more, much like human societies scale their capability through collective intelligence and coordination. Supporting these emerging…

Mar 16, 2026 · Kyle Aubrey

Networking / Communications – NVIDIA Technical Blog

…expanding their context windows, with recent models supporting sequences of 128K tokens, 256K tokens, and beyond.... 9 MIN READ Feb 02, 2026 Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert…

May 12, 2026

NVIDIA RTX Innovations Are Powering the Next Era of Game Development | NVIDIA Technical Blog

…This collaboration will boost GPU efficiency, eliminate context-switching for smoother gameplay, and provide a unified workflow for the broader development community. Building on last summer’s preview of Cooperative Vector support…

Mar 10, 2026 · Ike Nnoli

Design, Simulate, and Scale AI Factory Infrastructure with NVIDIA DSX Air | NVIDIA Technical Blog

Networking / Communications Design, Simulate, and Scale AI Factory Infrastructure with NVIDIA DSX Air Mar 16, 2026 By Ranga Maddipudi , Avi Alkobi and Taylor Allison Discuss (0) Discuss (0) L T F R…

Mar 16, 2026 · Ranga Maddipudi

Speeding Up Variable-Length Training with Dynamic Context Parallelism and NVIDIA Megatron Core | NVIDIA Technical Blog

…Even though these sequences fit on a single GPU, they’re partitioned due to a longer sequence in the same batch, resulting in unnecessary CP communication overhead. Usually, computation hides CP communication…

Jan 28, 2026 · Kunlun Li

DOCA Software Framework

…unified communications and collaboration (UCC) and Unified Communication X (UCX), RDMA verbs, GPUDirect® Network acceleration SDK: NVIDIA Accelerated Switching and Packet Processing (ASAP2)™ software-defined networking (SDN), emulated VirtIO, Firefly time synchronization…

Followed topics