Building Token‑Metered AI Services on Telco AI Factories | NVIDIA Technical Blog
… NVIDIA GB200 NVL72 delivers order‑of‑magnitude improvements in tokens‑per‑second and cost‑per‑million‑tokens versus the previous generation, and leading inference providers report up to 10x lower cost‑per‑token on real workloads when they pair Blackwell with optimized stacks. …