New GKE active buffer minimizes scale-out latency

19 hours ago google cloudblog

In dynamic cloud environments, unexpected traffic spikes or scheduled scaling events can easily strain user workloads. Whether you’re running a retail application during a flash sale or a gaming platform during peak player activity, your business-critical workloads need to scale up quickly and smoothly to handle new load. In fact, having compute capacity that is immediately available when you need it is essential for maintaining consistent performance and meeting end-user latency SLOs.

While the Kubernetes Cluster Autoscaler (CA) is excellent at adding capacity when needed, the reality of provisioning new nodes is that it can take time. Today, we’re excited to announce the preview of active buffer for Google Kubernetes Engine (GKE), a GKE-native implementation of a Kubernetes OSS feature CapacityBuffer API designed to eliminate scale-out latency by keeping capacity readily available and making it available almost instantaneously.

The current challenge

Traditional cluster autoscaling often comes with significant ...

Copyright of this story solely belongs to google cloudblog . To see the full text click HERE

The current challenge

Share: