Nvidia Software Pushes MLPerf Inference Benchmarks To New Highs
…The new tests included DeepSeek-R1 Interactive that tested token deliver speed and reduced time to first token, the GPT-OSS-120B, a mixture-of-experts reasoning model, and Qwen3-VL-235B…