Feature request: option to configure kubelet to reserve resources for system daemons #795

arielvinas · 2019-05-10T16:11:57Z

Why do you want this feature?
In EKS, nodes that starve for resources for system daemons go to NotReady state and stay there until someone manually deletes the node in EC2 UI. The feature of a timeout from the master to a node if it stays NotReady for too long because it can't comunicate will come with kubernetes 1.15 (in 2020). Reserving resources is very important https://kubernetes.io/docs/tasks/administer-cluster/reserve-compute-resources/

What feature/behavior/change do you want?
There should be a field in the cluster config yaml to configure this settings and that applies them to the userdata in the cloudformation template, something like extraKubeletFlags or appendToKubeletConfig

it would look like:

appendToKubeletConfig:
    kubeReserved:
      cpu: "300m"
      memory: "300Mi"
      ephemeral-storage: "1Gi"
    kubeReservedCgroup: "/kube-reserved"
    systemReserved:
      cpu: "300m"
      memory: "300Mi"
      ephemeral-storage: "1Gi"
    evictionHard:
      memory.available:  "200Mi"
      nodefs.available: "5%"

The text was updated successfully, but these errors were encountered:

whereisaaron · 2019-05-11T21:30:02Z

uh oh, you mean EKS/eksctl is not reserving system resources already? I thought this was default/built-in behavior of current k8s, but sounds like it is optional 😢

The reserved resource for kubelets is extremely important where you use overcommitted workloads (collections of spikey workloads) i.e. any time where Limits >= Requests. Under node resource exhaustion you want some workloads to be rescheduled, not entire nodes to go down.

If you deploy any workloads without resource Requests, or with Requests but without identical Limits, then you need to ‘thumbs up’ this now 😄

whereisaaron · 2019-05-15T14:00:24Z

Here is a thread of victims who didn’t know EKS doesn’t handle this by default.
awslabs/amazon-eks-ami#79

As well as being able to specify these setting it would be great to have some sensible defaults in eksctl. To give the kubelet on overloaded nodes a fighting chance to handle it gracefully. The same users who don’t know about this probably also don’t appreciate the importance of pod resource requests/limits (yet 😜).

arielvinas · 2019-05-15T14:07:16Z

Here is a thread of victims who didn’t know EKS doesn’t handle this by default.
awslabs/amazon-eks-ami#79

As well as being able to specify these setting it would be great to have some sensible defaults in eksctl. To give the kubelet on overloaded nodes a fighting chance to handle it gracefully. The same users who don’t know about this probably also don’t appreciate the importance of pod resource requests/limits (yet 😜).

Exactly... I'm one of those who struggle for a couple of months (coming from swarm) till realize what was wrong with my nodes

Jeffwan · 2019-06-06T00:21:55Z

I don't quite understand the case here. What's the expectation for AMI? Do you think it's better to reserve memory by default? But AMI doesn't know the size of the instances and user's workloads, it's hard to reserve right amount of resources, right?

whereisaaron · 2019-06-06T06:25:45Z

@Jeffwan this is just to reserve resource for the kubelet process itself, so that it keeps running if the node gets overloaded by its workload. If the kubelet can keep running it can evict excess workload. If it gets slammed, then whole the node goes down.

Jeffwan · 2019-06-06T21:30:51Z

@whereisaaron I agree. The challenge would be figuring out the proper size to reserve.

Use different configs for different instance types?
Use fixed number of memories. (since so many instances are supported, it has to make sure node has enough resources - reserved memories. This would be hard to small instances? )
Like t3.nano only have 500MiB, If 100Mib reserved for kubelet, it lost 80% resources.

Do you have other ideas how to support this by default?

errordeveloper · 2019-06-10T07:30:20Z

To begin with, let's just expose the option to the user. Default will have to remain what it is now. In the future, we may devise an algorithm that can take a best guess based on instance type(s), but that will need to discussed separately.

whereisaaron · 2019-06-10T20:34:36Z

@Jeffwan I'm not am sure the instance type makes much difference, since the kubelet has the same requirements regardless of instance size. Remember this is to reserve resource for the kubelet process itself. Perhaps larger instances may impact indirectly, if there tend to be more containers per instance.

But as @errordeveloper proposed, if we expose the options to the user, we can't go far wrong.

Jeffwan · 2019-06-11T00:38:46Z

Agree on the current solution. We will discuss what's the best number to use in separate thread.

errordeveloper added area/config-file area/nodegroup labels May 10, 2019

whereisaaron mentioned this issue May 15, 2019

Node becomes NotReady awslabs/amazon-eks-ami#79

Closed

errordeveloper self-assigned this Jun 5, 2019

errordeveloper modified the milestone: 0.1.35 Jun 5, 2019

errordeveloper modified the milestones: 0.1.35, 0.1.36 Jun 6, 2019

errordeveloper modified the milestones: 0.1.36, 0.1.37 Jun 11, 2019

errordeveloper removed their assignment Jun 13, 2019

martina-if assigned errordeveloper and martina-if Jun 14, 2019

martina-if mentioned this issue Jun 14, 2019

Add support for extra parameters kubelet configuration #886

Merged

4 tasks

errordeveloper closed this as completed in #886 Jun 20, 2019

deliahu mentioned this issue Sep 4, 2019

Prevent out-of-memory from taking down nodes cortexlabs/cortex#424

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: option to configure kubelet to reserve resources for system daemons #795

Feature request: option to configure kubelet to reserve resources for system daemons #795

arielvinas commented May 10, 2019

whereisaaron commented May 11, 2019 •

edited

Loading

whereisaaron commented May 15, 2019

arielvinas commented May 15, 2019

Jeffwan commented Jun 6, 2019

whereisaaron commented Jun 6, 2019

Jeffwan commented Jun 6, 2019

errordeveloper commented Jun 10, 2019

whereisaaron commented Jun 10, 2019

Jeffwan commented Jun 11, 2019

Feature request: option to configure kubelet to reserve resources for system daemons #795

Feature request: option to configure kubelet to reserve resources for system daemons #795

Comments

arielvinas commented May 10, 2019

whereisaaron commented May 11, 2019 • edited Loading

whereisaaron commented May 15, 2019

arielvinas commented May 15, 2019

Jeffwan commented Jun 6, 2019

whereisaaron commented Jun 6, 2019

Jeffwan commented Jun 6, 2019

errordeveloper commented Jun 10, 2019

whereisaaron commented Jun 10, 2019

Jeffwan commented Jun 11, 2019

whereisaaron commented May 11, 2019 •

edited

Loading