Nvidia Software Pushes MLPerf Inference Benchmarks To New Highs
… The company also noted speed improvements with the GB300 NVL72 v6.0 over v5.1, ranging from 1.21 times in the Llama 3.1 405B offline benchmark to 2.77 times for DeepSeek-R1 server test. “In just the past six months, we have been able to nearly triple our performance on DeepSeek-R1, which is a very … …