Description:
- Horizontal Scaling
- HPA updates a workload resource to match demand
- by depploying more/less pods
- doesnt apply for objects that cant be scaled, ex: Kubernetes DaemonSet
- runs inside Kubernetes Control Plane
- Controlled by HPA controller
kubectl get hpa
How does HPA work:
- the interval is set by
--horizontal-pod-autoscaler-sync-periodparameter to the kube-controller-manager (and the default interval is 15 seconds). - metrics-server is required
- hpa checks with metrics server for data
Pod readiness and autoscaling metrics:
- .
- HPA controller
- Key behaviours for pod readiness
- good practice for pod readiness