Search: Paralives

HPC-X

…HPC-X OpenSHMEM The HPC-X OpenSHMEM programming library is a one-side communications library that supports a unique set of parallel programming features, including point-to-point and collective routines, synchronizations…

MiniMax M2.7 Advances Scalable Agentic Workflows on NVIDIA Platforms for Complex AI Applications | NVIDIA Technical Blog

…For more information, see the vLLM guide . $ vllm serve MiniMaxAI/MiniMax-M2.7 \ --tensor-parallel-size 4 \ --tool-call-parser minimax_m2 \ --reasoning-parser minimax_m2_append_think \ --enable-auto-tool-choice…

Apr 12, 2026 · Anu Srivastava

Accelerating Vision AI Pipelines with Batch Mode VC-6 and NVIDIA Nsight | NVIDIA Technical Blog

…From N to a single decoder Execution model redesign Algorithmic changes to decode multiple images simultaneously with a single decoder Improved parallelization Leveraging the new work dimension (images) next to existing parallelization…

Apr 2, 2026 · Andreas Kieslinger

R²D²: Scaling Multimodal Robot Learning with NVIDIA Isaac Lab | NVIDIA Technical Blog

…Scaling simulation to thousands of parallel environments to overcome the slow training times of CPU-bound tools Integrating multiple sensor modalities (vision, force, and proprioception) into synchronized, high-fidelity data streams Modeling…

Feb 10, 2026 · Oyindamola Omotuyi

NVIDIA Performance Libraries (NVPL)

…NVPL ScaLAPACK A LAPACK extension designed for distributed memory parallel computing environments. Resources NVPL Documentation NVPL Samples (GitHub) Unlock the Power of NVIDIA Grace and NVIDIA Hopper™ Architectures with Foundational HPC Software…

CUDA Tile Programming Now Available for BASIC! | NVIDIA Technical Blog

…cuTile BASIC lets developers write tile-based GPU kernels in BASIC with minimal syntax, handling parallelism and data partitioning automatically, as shown with simple vector addition and matrix multiplication examples. Running cuTile…

Apr 1, 2026 · Rob Armstrong

Newton Adds Contact-Rich Manipulation and Locomotion Capabilities for Industrial Robotics | NVIDIA Technical Blog

…MuJoCo 3.5 (MJWarp) builds on the stability and accuracy the robotics community already trusts in MuJoCo, developed by Google DeepMind, now extended with GPU-scale throughput for thousands of parallel training…

Mar 16, 2026 · Philipp Reist

Building Autonomous Vehicles That Reason with NVIDIA Alpamayo | NVIDIA Technical Blog

…AlpaSim leverages a scalable, microservice-based architecture with modular APIs and pipeline parallelism, allowing efficient closed-loop simulation, flexible integration of user-defined policies, and high-throughput evaluation of end-to-end…

Jan 5, 2026 · Marco Pavone

Pruning and Distilling LLMs Using NVIDIA TensorRT Model Optimizer | NVIDIA Technical Blog

…Further details found in the NeMo distillation notebook . The script for this process is provided below, showing how to distill using a single-node eight-GPU tensor parallel setup. In practice, we…

Oct 7, 2025 · Max Xu

5 New Digital Twin Products Developers Can Use to Build 6G Networks | NVIDIA Technical Blog

…Now available on AWS Cloud, AI RSG provides scalable, on-demand access to high-fidelity RAN testing—for teams to parallelize experiments, automate benchmarking, and accelerate AI-RAN validation cycles. Calibration is…

Mar 1, 2026 · Cindy Goh

Followed topics

Paralives