AMD PACE - A vLLM Plugin for CPU Inference
…especially in data centers where 5th Gen AMD EPYC™ processors are already powering the underlying infrastructure. AMD PACE (AMD Platform Aware Compute Engine), a research framework was built to push CPU inference…