NVIDIA GTC 2026: Live Updates on What’s Next in AI
… If agents are to reason, plan and iterate at scale, inference speed becomes a first‑order design constraint. Much of today’s latency, he explained, doesn’t come from computation at all — but from communication. …