Paper page - Memory-Bound but Not Bandwidth-Limited: The Physical AI Inference Gap in Batch-1 LLM Decode
…View arXiv page View PDF Project page Add to collection Community Physical AI isn't compute-poor, it's runtime-poor. A robot serves one token at a time, not a crowd…
