Search: Hardware/support requests

GKE Inference Gateway prefix caching accelerates AI inference | Google Cloud Blog

… By ensuring requests land on the exact accelerator that is primed to process them right away, GKE transforms how you can serve your large language models LLMs , with excellent hardware utilization and ultra-fast response times. …

Jun 9, 2026 · Bob Tian

Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway | Google Cloud Blog

… This supports GPUs , and TPUs . …

Jun 2, 2026 · Ammett Williams

Enhancements to Managed Service for Apache Spark clusters | Google Cloud Blog

… Managed Service for Apache Spark pairs this preference with automated regional zone placement, dynamically scanning the entire region to fulfill your capacity requests using the best available hardware layout. …

Jun 4, 2026 · Qiqi Wu

Serverless Managed Service for Apache Spark runtime 3.0 features | Google Cloud Blog

… Enhanced multi-zonal support To protect global enterprise workloads from zonal outages or hardware stockouts, the serverless Spark 3.0 runtime introduces enhanced multi-zonal support by default. …

Jun 3, 2026 · Vinay Londhe

Google Cloud latest news and announcements | Google Cloud Blog

… Jan 26 - Jan 30 Simplify API Governance with Native OpenAPI v3 Support Eliminate integration debt and accelerate deployment velocity with the General Availability of OpenAPI v3 OASv3 support for API Gateway and Cloud Endpoints. …

Jun 5, 2026

Followed topics

GKE Inference Gateway prefix caching accelerates AI inference | Google Cloud Blog

Experimenting with TPUs, GKE Managed DRANET, and Multi-cluster Inference Gateway | Google Cloud Blog

Enhancements to Managed Service for Apache Spark clusters | Google Cloud Blog

Serverless Managed Service for Apache Spark runtime 3.0 features | Google Cloud Blog

Google Cloud latest news and announcements | Google Cloud Blog