Achieving Single-Digit Microsecond Latency Inference for Capital Markets | NVIDIA Technical Blog
… Comparison to previous submissions NVIDIA previously submitted optimized results for both throughput and latency Sumaco and Tacana benchmarks , as detailed in NVIDIA A100 Aces Throughput, Latency Results in Key Inference Benchmark for Financial Services Industry . …