k3s-demo

Kubernetes on a single k3s node - press the button and watch the load drive autoscaling, live.

Active app pods
- / 6
CPU vs 70% target
-%
Redis counter
-
Load & autoscaling (last ~90s) CPU %pods70% target

Pressing the button sends many concurrent /burn requests, which makes the app pods work hard. Watch CPU spike past the 70% line on the chart, and the pod count rise behind it as the autoscaler reacts. After load stops, the pods scale back to 2 in about 30-60s (this HPA's scale-down is tuned for the demo; Kubernetes defaults to a cautious 5 minutes to avoid flapping). It's one node, so it caps at what the node can hold.

What am I looking at?

Seeing more than 2 pods before pressing the button? A recent load test is still scaling back down - here that takes about 30-60s (Kubernetes' default is a cautious 5 minutes; this HPA is tuned faster for the demo). Briefly seeing more than 6? During a code deploy Kubernetes runs the old and new pods at once (a zero-downtime rolling update with maxSurge), so the count can momentarily exceed the 6-pod max before settling.