Introducing NVIDIA BlueField-4-Powered CMX Context Memory Storage Platform for the Next Frontier of AI | NVIDIA Technical Blog
… As context grows, KV cache quickly exhausts local storage capacity G1-G3 , while pushing it down to enterprise storage G4 , which introduces unacceptable overheads and drives up both cost and power consumption. …