Add wildcard tolerations to kube-proxy #56589

rohitagarwal003 · 2017-11-29T20:49:41Z

Add wildcard tolerations to kube-proxy.
Add nvidia.com/gpu toleration to nvidia-gpu-device-plugin.

/kind bug
/priority critical-urgent
/sig scheduling

Release note:

kube-proxy addon tolerates all NoExecute and NoSchedule taints by default.

/assign @davidopp @bsalamat @vishh @jiayingz

It is expected that nodes with extended resources attached will be tainted with the resouce name, so that we can create dedicated nodes. If ExtendedResourceToleration admission controller is enabled, pods requesting such resources will automatically tolerate such taints. nvidia-gpu-device-plugin daemonset doesn't request such resources but still needs to run on such nodes, so it needs this toleration.

fluend-gcp already has these tolerations. kube-proxy when it runs as a static pod gets wildcard `NoExecute` toleration (all static pods get that). So, added the same toleration to kube-proxy when it runs as a daemonset. Also added wildcard `NoSchedule` toleration to kube-proxy.

rohitagarwal003 · 2017-11-29T20:50:38Z

/cc @MrHohn @bowei @gmarek

vishh · 2017-11-29T20:55:22Z

/lgtm
/approve

vishh · 2017-11-29T20:56:08Z

cc @mikedanese for approval

MrHohn

kube-proxy part LGTM.
/lgtm

rohitagarwal003 · 2017-11-29T21:50:25Z

/retest

jiayingz · 2017-11-29T21:54:05Z

/lgtm

mikedanese · 2017-11-29T23:13:59Z

squash?

mikedanese · 2017-11-29T23:18:21Z

cluster/addons/fluentd-gcp/fluentd-gcp-ds.yaml

@@ -107,7 +107,6 @@ spec:
        effect: "NoSchedule"
      - operator: "Exists"
        effect: "NoExecute"
-      #TODO: remove this toleration once #44445 is properly fixed.


Why remove this comment? The issue hasn't closed.

We don't want this toleration to be removed.

We don't is different than we never will. TODO indicates we don't but we might want to eventually. Is this issue obsolete? Should we close it?

#44445 contains multiple issues.

The title of the issue: Improvement: fluentd-gcp to get same toleration as kube-proxy will be fixed by this PR.

There was a regression in the daemonset controller mentioned in the bug which was fixed a while back.

I guess the only thing that is not fixed are the three comments starting from Improvement: fluentd-gcp to get same toleration as kube-proxy #44445 (comment) (users not being able to modify system addons on managed services like GKE and so can't use NoSchedule or NoExecute taints if these addons don't tolerate them because adding such taints make the nodes not have these "required" addons)

But even if we fix that (allow users to modify the toleration of system addons). I think the default should still be that these addons tolerate all taints. If users really want they can use the ability to modify the toleration of system addons to remove these wildcard tolerations.

Also, we need system addons that run on every node to have wildcard NoSchedule toleration for issue #55080, PR #55839.

cc @vishh

+1. The expected behavior is that all system addons that are expected to run on all GKE nodes should tolerate all taints and effects. Certain addons like GPU plugins need to tolerate GPU specific taints only.
This I feel is probably slightly different from #44445 where if a user taints all GKE nodes then cluster level system addons will not run at all. This is a separate feature and is not tied to the comment at all.
@mikedanese thoughts?

rohitagarwal003 · 2017-11-29T23:22:36Z

squash?

Those commits are doing two different things. The commit messages have more detail.

k8s-github-robot · 2017-11-30T06:06:41Z

[MILESTONENOTIFIER] Milestone Pull Request Needs Attention

@MrHohn @bsalamat @davidopp @jiayingz @mikedanese @mindprince @vishh @kubernetes/sig-scheduling-misc

Action required: During code freeze, pull requests in the milestone should be in progress.
If this pull request is not being actively worked on, please remove it from the milestone.
If it is being worked on, please add the status/in-progress label so it can be tracked with other in-flight pull requests.

Note: This pull request is marked as priority/critical-urgent, and must be updated every 1 day during code freeze.

Example update:

ACK.  In progress
ETA: DD/MM/YYYY
Risks: Complicated fix required

Pull Request Labels

sig/scheduling: Pull Request will be escalated to these SIGs if needed.
priority/critical-urgent: Never automatically move pull request out of a release milestone; continually escalate to contributor and SIG through all available channels.
kind/bug: Fixes a bug discovered during the current release.

Help

mikedanese · 2017-11-30T19:02:49Z

/approve

k8s-github-robot · 2017-11-30T19:03:23Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jiayingz, mikedanese, mindprince, MrHohn, vishh

Associated issue: 55080

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~cluster/OWNERS~~ [mikedanese]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-github-robot · 2017-11-30T19:16:56Z

/test all [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2017-11-30T20:02:19Z

Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here.

rohitagarwal003 added 2 commits November 29, 2017 11:31

k8s-ci-robot assigned bsalamat, davidopp, jiayingz and vishh Nov 29, 2017

k8s-ci-robot requested review from bowei, gmarek and MrHohn November 29, 2017 20:50

vishh added the kind/bug Categorizes issue or PR as related to a bug. label Nov 29, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 29, 2017

vishh assigned mikedanese Nov 29, 2017

k8s-ci-robot assigned MrHohn Nov 29, 2017

MrHohn approved these changes Nov 29, 2017

View reviewed changes

mikedanese reviewed Nov 29, 2017

View reviewed changes

davidopp added this to the v1.9 milestone Nov 30, 2017

davidopp added the status/approved-for-milestone label Nov 30, 2017

k8s-github-robot added the milestone/needs-attention label Nov 30, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 30, 2017

k8s-github-robot merged commit d88ce26 into kubernetes:master Nov 30, 2017

derekperkins mentioned this pull request Jul 12, 2018

How to add tolerations to kube-proxy / kube-svc-redirect Azure/AKS#363

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add wildcard tolerations to kube-proxy #56589

Add wildcard tolerations to kube-proxy #56589

rohitagarwal003 commented Nov 29, 2017 •

edited

rohitagarwal003 commented Nov 29, 2017

vishh commented Nov 29, 2017

vishh commented Nov 29, 2017

MrHohn left a comment

rohitagarwal003 commented Nov 29, 2017

jiayingz commented Nov 29, 2017

mikedanese commented Nov 29, 2017

mikedanese Nov 29, 2017

rohitagarwal003 Nov 29, 2017

mikedanese Nov 30, 2017

rohitagarwal003 Nov 30, 2017

vishh Nov 30, 2017

rohitagarwal003 commented Nov 29, 2017 •

edited

k8s-github-robot commented Nov 30, 2017

mikedanese commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

Add wildcard tolerations to kube-proxy #56589

Add wildcard tolerations to kube-proxy #56589

Conversation

rohitagarwal003 commented Nov 29, 2017 • edited

rohitagarwal003 commented Nov 29, 2017

vishh commented Nov 29, 2017

vishh commented Nov 29, 2017

MrHohn left a comment

Choose a reason for hiding this comment

rohitagarwal003 commented Nov 29, 2017

jiayingz commented Nov 29, 2017

mikedanese commented Nov 29, 2017

mikedanese Nov 29, 2017

Choose a reason for hiding this comment

rohitagarwal003 Nov 29, 2017

Choose a reason for hiding this comment

mikedanese Nov 30, 2017

Choose a reason for hiding this comment

rohitagarwal003 Nov 30, 2017

Choose a reason for hiding this comment

vishh Nov 30, 2017

Choose a reason for hiding this comment

rohitagarwal003 commented Nov 29, 2017 • edited

k8s-github-robot commented Nov 30, 2017

mikedanese commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

k8s-github-robot commented Nov 30, 2017

rohitagarwal003 commented Nov 29, 2017 •

edited

rohitagarwal003 commented Nov 29, 2017 •

edited