With Horizontal Pod Autoscaling, Kubernetes automatically scales the number of pods in a replication controller, deployment or replica set based on observed CPU utilization (or, with alpha support, on some other, application-provided metrics).
The current stable version, which only includes support for CPU autoscaling, can
be found in the autoscaling/v1
API version. The alpha version, which includes
support for scaling on memory and custom metrics, can be found in
autoscaling/v2alpha1
in 1.7 and autoscaling/v2beta1
1.8 and 1.9.
Kops can assist in setting up HPA and recommends Kubernetes 1.7.x
to 1.9.x
and Kops >=1.7
. Relevant reading you will need to go through:
- Extending the Kubernetes API with the aggregation layer
- Configure The Aggregation Layer
- Horizontal Pod Autoscaling
While the above links go into details on how Kubernetes needs to be configured to work with HPA, a lot of that work is already done for you by Kops. Specifically:
- Enable the Aggregation Layer via the following
kube-apiserver flags:
-
--requestheader-client-ca-file=<path to aggregator CA cert>
-
--requestheader-allowed-names=aggregator
-
--requestheader-extra-headers-prefix=X-Remote-Extra-
-
--requestheader-group-headers=X-Remote-Group
-
--requestheader-username-headers=X-Remote-User
-
--proxy-client-cert-file=<path to aggregator proxy cert>
-
--proxy-client-key-file=<path to aggregator proxy key>
-
- Enable Horizontal Pod Scaling ... set the appropriate flags for
kube-controller-manager
:-
--horizontal-pod-autoscaler-use-rest-clients
should be true. -
--kubeconfig <path-to-kubeconfig>
-
Ensure that you use 1.8.5 at minimum for Kubernetes 1.8.x due to a nasty bug affecting the API aggregation layer. It was fixed in kubernetes/kubernetes#55259.
Enable API versions required to support scaling on cpu, memory and custom metrics:
# On K8s 1.7
spec:
kubeAPIServer:
runtimeConfig:
autoscaling/v2alpha1: "true"
# On K8s 1.8 and 1.9
spec:
kubeAPIServer:
runtimeConfig:
autoscaling/v2beta1: "true"
If you've set the above configuration, your cluster is now ready for the resource metrics API (installation instruction here). The compatibility matrix is as follows:
Metrics Server | Metrics API group/version | Supported Kubernetes version |
---|---|---|
0.2.x | metrics.k8s.io/v1beta1 |
1.8+ |
0.1.x | metrics/v1alpha1 |
1.7 |
Enable gathering custom metrics:
spec:
kubelet:
enableCustomMetrics: true
Enable the horizontal pod autoscaler REST client:
spec:
kubeControllerManager:
horizontalPodAutoscalerUseRestClients: true
If you've set the above configuration, your cluster is now ready for the custom metrics API. Register it via the API aggregation layer. If you're using Prometheus, checkout the custom metrics adapter for Prometheus.
These are the PRs that enable the required configuration:
- kubernetes/kops#3679 - sets
--requestheader-xxx
kube-apiserver flags required to enable aggregation layer--requestheader-client-ca-file=<path to aggregator CA cert> --requestheader-allowed-names=aggregator --requestheader-extra-headers-prefix=X-Remote-Extra- --requestheader-group-headers=X-Remote-Group --requestheader-username-headers=X-Remote-User
- kubernetes/kops#3165 - sets
--proxy-client-xxx
kube-apiserver flags required to enable aggregation layer--proxy-client-cert-file=<path to aggregator proxy cert> --proxy-client-key-file=<path to aggregator proxy key>
- kubernetes/kops#3939 - add config option to set
--horizontal-pod- autoscaler-use-rest-clients
kube-controller-manager flag required to enable custom metrics - kubernetes/kops#1574 - add config options to set
--enable-custom- metrics
flag on master and node kubelets required to enable custom metrics