Custom metrics in HPA doc

2016-04-28 20:30:48 +02:00 · 2016-04-28 20:30:48 +02:00 · 3ae0ef3444
parent 327aae6b65
commit 3ae0ef3444
1 changed files with 50 additions and 3 deletions
--- a/docs/user-guide/horizontal-pod-autoscaling/index.md
+++ b/docs/user-guide/horizontal-pod-autoscaling/index.md
@ -3,18 +3,17 @@

 This document describes the current state of Horizontal Pod Autoscaling in Kubernetes.

-
 ## What is Horizontal Pod Autoscaling?

 With Horizontal Pod Autoscaling, Kubernetes automatically scales the number of pods
-in a replication controller, deployment or replica set based on observed CPU utilization.
+in a replication controller, deployment or replica set based on observed CPU utilization
+(or, with alpha support, on some other, application-provided metrics).

 The Horizontal Pod Autoscaler is implemented as a Kubernetes API resource and a controller.
 The resource determines the behavior of the controller.
 The controller periodically adjusts the number of replicas in a replication controller or deployment
 to match the observed average CPU utilization to the target specified by user.

-
 ## How does the Horizontal Pod Autoscaler work?

 ![Horizontal Pod Autoscaler diagram](/images/docs/horizontal-pod-autoscaler.svg)
@ -76,6 +75,54 @@ i.e. you cannot bind a Horizontal Pod Autoscaler to a replication controller and
 The reason this doesn't work is that when rolling update creates a new replication controller,
 the Horizontal Pod Autoscaler will not be bound to the new replication controller.

+## Support for custom metrics
+
+Kubernetes 1.2 adds alpha support for scaling based on application-specific metrics like QPS (queries per second) or average request latency.
+
+### Prerequisites
+
+The cluster has to be started with `ENABLE_CUSTOM_METRICS` environment variable set to `true`.
+
+### Pod configuration
+
+The pods to be scaled must have cAdvisor-specific custom (aka application) metrics endpoint configured. The configuration format is described [here](https://github.com/google/cadvisor/blob/master/docs/application_metrics.md). Kubernetes expects the configuration to 
+  be placed in `definition.json` mounted via a [config map](/docs/user-guide/horizontal-pod-autoscaling/configmap/) in `/etc/custom-metrics`. A sample config map may look like this:
+
+```yaml
+apiVersion: v1
+kind: ConfigMap
+metadata:
+  name: cm-config
+data:
+  definition.json: "{\"endpoint\" : \"http://localhost:8080/metrics\"}"
+``` 
+
+**Warning**
+Due to the way cAdvisor currently works `localhost` refers to the node itself, not to the running pod. Thus the appropriate container in the pod must ask for a node port. Example:
+
+```yaml
+    ports:
+    - hostPort: 8080
+      containerPort: 8080
+```
+
+### Specifying target
+
+HPA for custom metrics is configured via an annotation. The value in the annotation is interpreted as a target metric value averaged over
+all running pods. Example: 
+
+```yaml
+    annotations:
+      alpha/target.custom-metrics.podautoscaler.kubernetes.io: '{"items":[{"name":"qps", "value": "10"}]}'
+```
+
+In this case if there are 4 pods running and each of them reports qps metric to be equal to 15 HPA will start 2 additional pods so there will be 6 pods in total. If there are multiple metrics passed in the annotation or CPU is configured as well then HPA will use the biggest 
+number of replicas that comes from the calculations.
+
+At this moment even if target CPU utilization is not specified a default of 80% will be used. 
+To calculate number of desired replicas based only on custom metrics CPU utilization
+target should be set to a very large value (e.g. 100000%). Then CPU-related logic 
+will want only 1 replica, leaving the decision about higher replica count to cusom metrics (and min/max limits).

 ## Further reading