Achieving Single-Digit Microsecond Latency Inference for Capital Markets | NVIDIA Technical Blog
…The STAC-ML (Markets) Inference benchmark measures LSTM model latency—the time between receiving new input and generating the output. It includes three models of increasing complexity (LSTM_A, LSTM_B, and…
