Running Large-Scale GPU Workloads on Kubernetes with Slurm | NVIDIA Technical Blog
…Production deployments at NVIDIA have demonstrated that Slinky slurm-operator scales to over 8,000 GPUs, supports nondisruptive rolling updates, maintains unified observability via Prometheus and Grafana, and achieves performance parity with…
