GPU autoscaling on Kubernetes with KEDA: Building an external scaler
… This is the same pattern Kubernetes uses for device plugins and the metrics server — a per-node agent that collects local hardware data. …
Tracked topic
In a typical Kubernetes setup, every cluster ships with its own dedicated control plane — meaning at least 3 nodes per cluster just for the control plane itself. Multiply that across dev, staging, and production environments, and you’re burning through resources before your first workload even lands. This is the problem Hosted Control Planes were built to solve. Instead of running the API server, etcd, and controllers on dedicated nodes per cluster, HCP runs all of them inside a central management cluster. The result is fewer VMs, lower costs, simpler upgrades, and a single pane of control acr
Kubernetes for platform teams: Leveraging k0s and k0rdent… This is the same pattern Kubernetes uses for device plugins and the metrics server — a per-node agent that collects local hardware data. …
… The combination of: k0s lightweight Kubernetes k0rdent multi-cluster orchestration OpenStack private cloud infrastructure creates a powerful platform engineering stack . 1. Prepare your environment Before you touch Kubernetes, make sure your base environment is correct. …
… You weren’t just locked into a cloud; you were locked into its opinions about managing Kubernetes. CAPI changed the game. Your cluster is now a set of Kubernetes-native resources — Cluster, MachineDeployment, MachinePool — and a cloud-specific provider translates them into infrastructure. …
… In a typical Kubernetes workflow, validation can occur at multiple stages: Most governance strategies concentrate on the last two. …
Auto-diagnosing Kubernetes alerts with HolmesGPT and CNCF tools
… If you zoom out and look at what it really costs to run Kubernetes, the data shows where time, money, and effort add up: Komodor’s 2025 Enterprise Kubernetes Report found teams lose roughly 34 workdays per year resolving Kubernetes incidents, with nearly 80% of production issues tied to recent syst… …
… External Secrets Operator running within each Kubernetes cluster. Kubernetes Secrets generated and maintained by ESO for application consumption. …
… The Kubernetes irony The irony is sharpest in the Kubernetes ecosystem. Kubernetes is the SRE moment for infrastructure – the most successful embodiment of “design for failure” ever built. …
… Networking and traffic management The networking layer combines Kubernetes-native primitives with Istio’s service mesh capabilities to provide secure, observable, and policy-driven traffic management: Kubernetes Services expose workloads internally with stable DNS-based discovery Azure Load Balance… …
… Our AI platform, Tmax, runs on Kubernetes and supports the full ML lifecycle, from notebook-based development to training and inference deployment. …