Optimizing Communication for Mixture-of-Experts Training with Hybrid Expert Parallel | NVIDIA Technical Blog
… Low-precision native support for FP8/ BF16 dispatch operators and BF16 combine operators. …
In addition to Muon, NVIDIA also supports many other optimizers for the research community to explore, including: The ultimate form of orthogonalized optimizer MOP (Momentum Orthogonalized by Polar decomposition) An advanced SOAP variant that updates eigen basis per step with eigen decomposition plus KL correction in REKLS
Advancing Emerging Optimizers for Accelerated LLM Training with NVIDIA Megatron | NVIDIA Technical Blog… Low-precision native support for FP8/ BF16 dispatch operators and BF16 combine operators. …
… Key Features Offloads collectives communications from MPI onto NVIDIA Quantum InfiniBand networking hardware Multiple transport support, including Reliable Connection RC , Dynamic Connected DC , and Unreliable Datagram UD Intra-node shared memory communication Receive-side tag matching Native suppo… …
… In addition to Muon, NVIDIA also supports many other optimizers for the research community to explore, including: The ultimate form of orthogonalized optimizer MOP Momentum Orthogonalized by Polar decomposition An advanced SOAP variant that updates eigen basis per step with eigen decomposition plus… …
… Future releases of cuTile will support fully automated cross-platform autotuning. …
… UCX is a community-driven networking library and is widely tested internally. …
… Maximize uptime and optimize performance with Enterprise Support. …
… The software infrastructure needs to evolve to support this new paradigm. The Slurm topology/block plugin provides the foundation and the commitment to continue working with the Slurm community to make it easier to deploy, understand, optimize, and operate at scale. …
… How the SGLang community is contributing Mooncake: Initial SGLang support in AIConfigurator AIConfigurator initially supported only TensorRT LLM, reserving interfaces for SGLang and vLLM without full implementation. …
… Access community benchmarks on common core. …
… Partner Supported Linux NVIDIA partners offer Enterprise-grade Linux with long-term support for Jetson. …