website/content/en/docs/tasks/administer-cluster/sysctl-cluster.md

---
title: Using Sysctls in a Kubernetes Cluster
reviewers:
- sttts
content_template: templates/task
---

{{% capture overview %}}

This document describes how sysctls are used within a Kubernetes cluster.

{{% /capture %}}

{{% capture prerequisites %}}

{{< include "task-tutorial-prereqs.md" >}} {{< version-check >}}

{{% /capture %}}

{{% capture steps %}}

## Listing all Sysctl Parameters

In Linux, the sysctl interface allows an administrator to modify kernel
parameters at runtime. Parameters are available via the `/proc/sys/` virtual
process file system. The parameters cover various subsystems such as:

- kernel (common prefix: `kernel.`)
- networking (common prefix: `net.`)
- virtual memory (common prefix: `vm.`)
- MDADM (common prefix: `dev.`)
- More subsystems are described in [Kernel docs](https://www.kernel.org/doc/Documentation/sysctl/README).

To get a list of all parameters, you can run

```shell
$ sudo sysctl -a
```

## Enabling Unsafe Sysctls

Sysctls are grouped into _safe_  and _unsafe_ sysctls. In addition to proper
namespacing a _safe_ sysctl must be properly _isolated_ between pods on the same
node. This means that setting a _safe_ sysctl for one pod

- must not have any influence on any other pod on the node
- must not allow to harm the node's health
- must not allow to gain CPU or memory resources outside of the resource limits
  of a pod.

By far, most of the _namespaced_ sysctls are not necessarily considered _safe_.
The following sysctls are supported in the _safe_ set:

- `kernel.shm_rmid_forced`,
- `net.ipv4.ip_local_port_range`,
- `net.ipv4.tcp_syncookies`.

{{< note >}}
**Note**: The example `net.ipv4.tcp_syncookies` is not namespaced on Linux kernel version 4.4 or lower.
{{< /note >}}

This list will be extended in future Kubernetes versions when the kubelet
supports better isolation mechanisms.

All _safe_ sysctls are enabled by default.

All _unsafe_ sysctls are disabled by default and must be allowed manually by the
cluster admin on a per-node basis. Pods with disabled unsafe sysctls will be
scheduled, but will fail to launch.

With the warning above in mind, the cluster admin can allow certain _unsafe_
sysctls for very special situations like e.g. high-performance or real-time
application tuning. _Unsafe_ sysctls are enabled on a node-by-node basis with a
flag of the kubelet, e.g.:

```shell
$ kubelet --experimental-allowed-unsafe-sysctls \
  'kernel.msg*,net.ipv4.route.min_pmtu' ...
```

For minikube, this can be done via the `extra-config` flag:

```shell
$ minikube start --extra-config="kubelet.AllowedUnsafeSysctls=kernel.msg*,net.ipv4.route.min_pmtu"...
```

Only _namespaced_ sysctls can be enabled this way.

## Setting Sysctls for a Pod

A number of sysctls are _namespaced_ in today's Linux kernels. This means that
they can be set independently for each pod on a node. Being namespaced is a
requirement for sysctls to be accessible in a pod context within Kubernetes.

The following sysctls are known to be _namespaced_:

- `kernel.shm*`,
- `kernel.msg*`,
- `kernel.sem`,
- `fs.mqueue.*`,
- `net.*`.

Sysctls which are not namespaced are called _node-level_ and must be set
manually by the cluster admin, either by means of the underlying Linux
distribution of the nodes (e.g. via `/etc/sysctls.conf`) or using a DaemonSet
with privileged containers.

The sysctl feature is an alpha API. Therefore, sysctls are set using annotations
on pods. They apply to all containers in the same pod.

Here is an example, with different annotations for _safe_ and _unsafe_ sysctls:

```yaml
apiVersion: v1
kind: Pod
metadata:
  name: sysctl-example
  annotations:
    security.alpha.kubernetes.io/sysctls: kernel.shm_rmid_forced=1
    security.alpha.kubernetes.io/unsafe-sysctls: net.ipv4.route.min_pmtu=1000,kernel.msgmax=1 2 3
spec:
  ...
```
{{% /capture %}}

{{% capture discussion %}}

{{< warning >}}
**Warning**: Due to their nature of being _unsafe_, the use of _unsafe_ sysctls
is at-your-own-risk and can lead to severe problems like wrong behavior of
containers, resource shortage or complete breakage of a node.
{{< /warning >}}

It is good practice to consider nodes with special sysctl settings as
_tainted_ within a cluster, and only schedule pods onto them which need those
sysctl settings. It is suggested to use the Kubernetes [_taints and toleration_
feature](/docs/reference/generated/kubectl/kubectl-commands/#taint) to implement this.

A pod with the _unsafe_ sysctls will fail to launch on any node which has not
enabled those two _unsafe_ sysctls explicitly. As with _node-level_ sysctls it
is recommended to use
[_taints and toleration_ feature](/docs/reference/generated/kubectl/kubectl-commands/#taint) or
[taints on nodes](/docs/concepts/configuration/taint-and-toleration/)
to schedule those pods onto the right nodes.

## PodSecurityPolicy Annotations

The use of sysctl in pods can be controlled via annotation on the PodSecurityPolicy.

Sysctl annotation represents a whitelist of allowed safe and unsafe sysctls
in a pod spec. It's a comma-separated list of plain sysctl names or sysctl patterns
(which end in `*`). The string `*` matches all sysctls.

Here is an example, it authorizes binding user creating pod with corresponding sysctls.

```yaml
apiVersion: policy/v1beta1
kind: PodSecurityPolicy
metadata:
  name: sysctl-psp
  annotations:
    security.alpha.kubernetes.io/sysctls: 'net.ipv4.route.*,kernel.msg*'
spec:
 ...
```

{{% /capture %}}
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`---`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`title: Using Sysctls in a Kubernetes Cluster`
In concepts, in front matter, change approvers to reviewers. (#7442) 2018-02-27 18:51:46 +00:00			`reviewers:`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`- sttts`
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`content_template: templates/task`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`---`

Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% capture overview %}}`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
			`This document describes how sysctls are used within a Kubernetes cluster.`

Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% /capture %}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% capture prerequisites %}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{< include "task-tutorial-prereqs.md" >}} {{< version-check >}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% /capture %}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% capture steps %}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
			`## Listing all Sysctl Parameters`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
			`In Linux, the sysctl interface allows an administrator to modify kernel`
			parameters at runtime. Parameters are available via the `/proc/sys/` virtual
			`process file system. The parameters cover various subsystems such as:`

			- kernel (common prefix: `kernel.`)
			- networking (common prefix: `net.`)
			- virtual memory (common prefix: `vm.`)
			- MDADM (common prefix: `dev.`)
			`- More subsystems are described in [Kernel docs](https://www.kernel.org/doc/Documentation/sysctl/README).`

			`To get a list of all parameters, you can run`

Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			```shell
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`$ sudo sysctl -a`
			```

Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`## Enabling Unsafe Sysctls`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
			`Sysctls are grouped into _safe_ and _unsafe_ sysctls. In addition to proper`
			`namespacing a _safe_ sysctl must be properly _isolated_ between pods on the same`
			`node. This means that setting a _safe_ sysctl for one pod`

			`- must not have any influence on any other pod on the node`
			`- must not allow to harm the node's health`
			`- must not allow to gain CPU or memory resources outside of the resource limits`
			`of a pod.`

			`By far, most of the _namespaced_ sysctls are not necessarily considered _safe_.`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`The following sysctls are supported in the _safe_ set:`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
			- `kernel.shm_rmid_forced`,
			- `net.ipv4.ip_local_port_range`,
			- `net.ipv4.tcp_syncookies`.

Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{< note >}}`
Add a note on availability of tcp_syncookies syctl (#7627) * Add a note on availability of tcp_syncookies syctl Sysctl `net.ipv4.tcp_syncookies` is not availalbe on 4.4 kernel as it's not namespaced yet. * updating to use {: .note} notation per https://kubernetes.io/docs/home/contribute/style-guide/#note 2018-03-04 20:26:52 +00:00			Note: The example `net.ipv4.tcp_syncookies` is not namespaced on Linux kernel version 4.4 or lower.
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{< /note >}}`
Add a note on availability of tcp_syncookies syctl (#7627) * Add a note on availability of tcp_syncookies syctl Sysctl `net.ipv4.tcp_syncookies` is not availalbe on 4.4 kernel as it's not namespaced yet. * updating to use {: .note} notation per https://kubernetes.io/docs/home/contribute/style-guide/#note 2018-03-04 20:26:52 +00:00
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`This list will be extended in future Kubernetes versions when the kubelet`
			`supports better isolation mechanisms.`

			`All _safe_ sysctls are enabled by default.`

			`All _unsafe_ sysctls are disabled by default and must be allowed manually by the`
			`cluster admin on a per-node basis. Pods with disabled unsafe sysctls will be`
			`scheduled, but will fail to launch.`

			`With the warning above in mind, the cluster admin can allow certain _unsafe_`
			`sysctls for very special situations like e.g. high-performance or real-time`
			`application tuning. _Unsafe_ sysctls are enabled on a node-by-node basis with a`
			`flag of the kubelet, e.g.:`

			```shell
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`$ kubelet --experimental-allowed-unsafe-sysctls \`
			`'kernel.msg*,net.ipv4.route.min_pmtu' ...`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			```
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Update sysctl-cluster.md (#5894) Include guide on enabling unsafe sysctls in minikube 2017-10-23 18:50:18 +00:00			For minikube, this can be done via the `extra-config` flag:
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
Update sysctl-cluster.md (#5894) Include guide on enabling unsafe sysctls in minikube 2017-10-23 18:50:18 +00:00			```shell
			`$ minikube start --extra-config="kubelet.AllowedUnsafeSysctls=kernel.msg*,net.ipv4.route.min_pmtu"...`
			```
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00			`Only _namespaced_ sysctls can be enabled this way.`

			`## Setting Sysctls for a Pod`

Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`A number of sysctls are _namespaced_ in today's Linux kernels. This means that`
			`they can be set independently for each pod on a node. Being namespaced is a`
			`requirement for sysctls to be accessible in a pod context within Kubernetes.`

			`The following sysctls are known to be _namespaced_:`

			- `kernel.shm*`,
			- `kernel.msg*`,
			- `kernel.sem`,
			- `fs.mqueue.*`,
			- `net.*`.

			`Sysctls which are not namespaced are called _node-level_ and must be set`
			`manually by the cluster admin, either by means of the underlying Linux`
			distribution of the nodes (e.g. via `/etc/sysctls.conf`) or using a DaemonSet
			`with privileged containers.`

			`The sysctl feature is an alpha API. Therefore, sysctls are set using annotations`
			`on pods. They apply to all containers in the same pod.`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
			`Here is an example, with different annotations for _safe_ and _unsafe_ sysctls:`

			```yaml
			`apiVersion: v1`
			`kind: Pod`
			`metadata:`
			`name: sysctl-example`
			`annotations:`
			`security.alpha.kubernetes.io/sysctls: kernel.shm_rmid_forced=1`
			`security.alpha.kubernetes.io/unsafe-sysctls: net.ipv4.route.min_pmtu=1000,kernel.msgmax=1 2 3`
			`spec:`
			`...`
			```
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% /capture %}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% capture discussion %}}`
Move a batch of cluster admin topics. (#2813) 2017-03-14 21:09:54 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{< warning >}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`Warning: Due to their nature of being _unsafe_, the use of _unsafe_ sysctls`
			`is at-your-own-risk and can lead to severe problems like wrong behavior of`
			`containers, resource shortage or complete breakage of a node.`
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{< /warning >}}`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
			`It is good practice to consider nodes with special sysctl settings as`
			`_tainted_ within a cluster, and only schedule pods onto them which need those`
			`sysctl settings. It is suggested to use the Kubernetes [_taints and toleration_`
Remove redirect and point to the generated kubectl doc directly (#8208) Since we now generate the kubectl doc for each release, it is no longer necessary to use the old redirects. Cleaning up the references throughout the documents. 2018-04-27 22:02:19 +00:00			`feature](/docs/reference/generated/kubectl/kubectl-commands/#taint) to implement this.`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
			`A pod with the _unsafe_ sysctls will fail to launch on any node which has not`
			`enabled those two _unsafe_ sysctls explicitly. As with _node-level_ sysctls it`
			`is recommended to use`
Remove redirect and point to the generated kubectl doc directly (#8208) Since we now generate the kubectl doc for each release, it is no longer necessary to use the old redirects. Cleaning up the references throughout the documents. 2018-04-27 22:02:19 +00:00			`[_taints and toleration_ feature](/docs/reference/generated/kubectl/kubectl-commands/#taint) or`
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00			`[taints on nodes](/docs/concepts/configuration/taint-and-toleration/)`
Improve taint and toleration documentation 2017-08-18 03:55:08 +00:00			`to schedule those pods onto the right nodes.`
fix sysctl miss in podsecuritypolicy descriptions. (#7600) modified: docs/concepts/cluster-administration/sysctl-cluster.md modified: docs/concepts/policy/pod-security-policy.md 2018-03-06 16:13:53 +00:00
			`## PodSecurityPolicy Annotations`

fix a desription error in sysctl file. (#7666) modified: docs/concepts/cluster-administration/sysctl-cluster.md 2018-03-09 05:57:11 +00:00			`The use of sysctl in pods can be controlled via annotation on the PodSecurityPolicy.`
fix sysctl miss in podsecuritypolicy descriptions. (#7600) modified: docs/concepts/cluster-administration/sysctl-cluster.md modified: docs/concepts/policy/pod-security-policy.md 2018-03-06 16:13:53 +00:00
fix a desription error in sysctl file. (#7666) modified: docs/concepts/cluster-administration/sysctl-cluster.md 2018-03-09 05:57:11 +00:00			`Sysctl annotation represents a whitelist of allowed safe and unsafe sysctls`
			`in a pod spec. It's a comma-separated list of plain sysctl names or sysctl patterns`
			(which end in ``). The string `` matches all sysctls.

			`Here is an example, it authorizes binding user creating pod with corresponding sysctls.`
fix sysctl miss in podsecuritypolicy descriptions. (#7600) modified: docs/concepts/cluster-administration/sysctl-cluster.md modified: docs/concepts/policy/pod-security-policy.md 2018-03-06 16:13:53 +00:00
			```yaml
docs/tasks/administer-cluster/sysctl-cluster.md: use PSP from policy API group. (#7998) 2018-04-06 13:55:10 +00:00			`apiVersion: policy/v1beta1`
fix sysctl miss in podsecuritypolicy descriptions. (#7600) modified: docs/concepts/cluster-administration/sysctl-cluster.md modified: docs/concepts/policy/pod-security-policy.md 2018-03-06 16:13:53 +00:00			`kind: PodSecurityPolicy`
			`metadata:`
			`name: sysctl-psp`
			`annotations:`
fix a desription error in sysctl file. (#7666) modified: docs/concepts/cluster-administration/sysctl-cluster.md 2018-03-09 05:57:11 +00:00			`security.alpha.kubernetes.io/sysctls: 'net.ipv4.route.,kernel.msg'`
fix sysctl miss in podsecuritypolicy descriptions. (#7600) modified: docs/concepts/cluster-administration/sysctl-cluster.md modified: docs/concepts/policy/pod-security-policy.md 2018-03-06 16:13:53 +00:00			`spec:`
			`...`
			```
Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00
Convert site to Hugo (#8316) This commit converts content and layout to use Hugo. 2018-05-05 16:00:51 +00:00			`{{% /capture %}}`

Make using sysctls a task instead of a concept (#6808) Closes: #4505 2018-03-22 01:28:04 +00:00