Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway | Google Cloud Blog
… Use a temporary Kubernetes job to download the Gemma 3 gemma-3-27b-it model weights directly into your Cloud Storage bucket. Define a ResourceClaimTemplate that explicitly requests the managed DRANET device class deviceClassName: netdev.google.com with the allocation mode set to "All". …
