DynoSim: Simulating the Pareto Frontier | NVIDIA Technical Blog
…On an Apple M4 MacBook Air, the single-threaded Rust offline replay simulated the full 23,608-request Mooncake trace with eight round-robin workers and 512-token trace and engine blocks…