move "kubectl drain" into the server #25625

davidopp · 2016-05-15T06:57:58Z

random note: @roberthbailey had suggested it might be useful to have a "dry run" mode where you just ask which is the best node to drain, or best N of some set, but don't drain it. not sure how you express that using a REST API though. also can't remember what the use case was (it might have been for autoscaling scale-down, so you know which node is best to remove?)

as an aside, we need to consolidate all the issues related to this: #7351, #6080, #6079, #3885, ...

roberthbailey · 2016-05-16T07:13:21Z

I think that applies to both autoscaling down and node upgrades.

/cc @mwielgus @ihmccreery

resouer · 2016-06-19T14:19:29Z

@roberthbailey I'm wondering is this really a common use case that worth doing? Could you explain with more details, for example, how to apply to autoscaling down?

roberthbailey · 2016-06-20T18:35:07Z

When removing a node from a cluster (either because an autoscaler decides to free up space or a user requests the release of resources), we should drain the node before deleting it from the cluster. If we want to build this into automation (e.g. autoscaling), then we don't want to rely on the drain command only existing in the kubectl client that is intended to be used by a human.

davidopp · 2016-06-21T05:40:23Z

@roberthbailey I agree with that, but I think the question might have been why the dry-run mode / ability to just ask which is the best node to drain but not actually drain it is useful. TBH I can't remember why you suggested it. Presumably it's because the client (e.g. autoscaler) wants to manage the drain itself?

roberthbailey · 2016-06-21T06:47:06Z

I think it was for upgrades. But I don't recall why that would be better than just asking the server to do it.

hjacobs · 2017-02-11T09:25:24Z

I would like to see kubectl drain on the server side. kubectl drain currently is unreliable for us and fails for many cases (e.g. when hitting Job resources on some node). We are evaluating the best approach to get proper "safe" autoscaling on AWS (we are already doing it in a non-safe manner with https://github.com/hjacobs/kube-aws-autoscaler).

davidopp · 2017-03-04T08:37:37Z

One model for this: you ask the system to drain N nodes, give it some parameters controlling the choice and number simultaneous etc., and it picks the nodes and does the drains gives a callback to an HTTP endpoint you specify when it is done.

BTW Mesos machine maintenance model is described here
https://github.com/apache/mesos/blob/master/docs/maintenance.md

erictune · 2017-04-05T17:47:07Z

There would need to be a timeout or a way to cancel a drain that is taking too long.

davidopp · 2017-04-15T20:31:28Z

Related:
https://github.com/jamiehannaford/coreos-reboot-operator

mml · 2017-09-08T17:19:14Z

One new compelling reason to do this is:

We expect the operational definition of drain and cordon to change. E.g., Make "kubectl drain" use taint instead of Unschedulable #44944, but N.B. that at the moment, there is neither a clear roadmap nor timeline.
This logic is embedded in kubectl right now (mea culpa).
Authors of other clients (reasonably) want to treat drain and cordon as cluster primitives. Add drain and cordon functions. kubernetes-client/python-base#32

We now have to choose between responsibility for the correctness of these implementations (including version skew, matrix testing), disallowing them entirely, or deferring the correctness problems until later with disclaimers (i.e. create tech debt). None of these is satisfactory.

mml · 2017-09-08T20:04:59Z

@davidopp or @timothysc per my last message, is there any chance this could be prioritized for 1.9 or maybe next year?

timothysc · 2017-09-11T16:05:41Z

It's entirely based on resources and folks willing to show up and do the work.

mml · 2017-09-12T22:22:21Z

@timothysc Can I extrapolate that as of right now, no one has shown up and indicated they wish to do this work?

timothysc · 2017-09-14T14:17:39Z

@timothysc Can I extrapolate that as of right now, no one has shown up and indicated they wish to do this work?

Yes.

fejta-bot · 2018-01-05T17:14:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

redbaron · 2018-01-07T18:08:35Z

/remove-lifecycle stale

Danil-Grigorev · 2021-07-07T12:28:51Z

@fabiand In sig-cloud-provider - cc @andrewsykim @cheftako

k8s-triage-robot · 2021-10-05T13:19:43Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

florianstoeber · 2021-10-10T18:05:47Z

/remove-lifecycle stale

k8s-triage-robot · 2022-01-08T18:26:10Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

redbaron · 2022-01-08T18:32:41Z

/remove-lifecycle stale

k8s-triage-robot · 2022-04-08T18:35:03Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-05-08T19:33:26Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2022-06-07T20:14:26Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-06-07T20:14:42Z

@k8s-triage-robot: Closing this issue.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

redbaron · 2022-06-07T20:16:10Z

/reopen

k8s-ci-robot · 2022-06-07T20:16:25Z

@redbaron: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2022-06-08T07:17:25Z

@yanirq: You can't reopen an issue/PR unless you authored it or you are a collaborator.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

redbaron · 2022-06-08T07:43:42Z

OK, @k8s-ci-robot , you won. Of course lack of spamming on a well understood issue just waiting to be implemented is a clear sign that issue is not relevant anymore, I am with you. Good work.

Abirdcfly · 2022-06-08T07:52:20Z

/reopen

k8s-ci-robot · 2022-06-08T07:52:35Z

@Abirdcfly: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot · 2022-06-08T07:52:41Z

@davidopp: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Abirdcfly · 2022-06-08T07:55:25Z

@redbaron Please go on...😂
PS: I think the collaborator here is confusing🤔️

k8s-triage-robot · 2022-07-08T10:33:57Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2022-07-08T10:34:16Z

@k8s-triage-robot: Closing this issue.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

davidopp added priority/backlog Higher priority than priority/awaiting-more-evidence. team/control-plane labels May 15, 2016

hjacobs mentioned this issue Feb 11, 2017

Check out kube-node-drainer zalando-incubator/kubernetes-on-aws#133

Closed

davidopp added the sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. label Feb 11, 2017

0xmichalis mentioned this issue Mar 10, 2017

Move kubectl client logic into server #12143

Closed

bgrant0607 added area/node-lifecycle Issues or PRs related to Node lifecycle and removed team/control-plane (deprecated - do not use) labels Mar 23, 2017

ahmetb mentioned this issue May 22, 2017

Support node selectors for node, drain, taint and apply to new nodes #46231

Closed

mbohlool mentioned this issue Sep 8, 2017

Add drain and cordon functions. kubernetes-client/python-base#32

Closed

timothysc added the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Sep 14, 2017

timothysc added this to the next-candidate milestone Sep 14, 2017

mvladev mentioned this issue Nov 3, 2017

Minimalistic Machines API proposal. kubernetes-retired/kube-deploy#298

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 5, 2018

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 7, 2018

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 5, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 10, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 8, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 8, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 8, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels May 8, 2022

k8s-ci-robot closed this as completed Jun 7, 2022

k8s-ci-robot reopened this Jun 8, 2022

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Jun 8, 2022

k8s-ci-robot closed this as completed Jul 8, 2022

move "kubectl drain" into the server #25625

move "kubectl drain" into the server #25625

Comments

davidopp commented May 15, 2016

roberthbailey commented May 16, 2016

resouer commented Jun 19, 2016

roberthbailey commented Jun 20, 2016

davidopp commented Jun 21, 2016

roberthbailey commented Jun 21, 2016

hjacobs commented Feb 11, 2017

davidopp commented Mar 4, 2017

erictune commented Apr 5, 2017

davidopp commented Apr 15, 2017

mml commented Sep 8, 2017 • edited Loading

mml commented Sep 8, 2017

timothysc commented Sep 11, 2017

mml commented Sep 12, 2017 • edited Loading

timothysc commented Sep 14, 2017

fejta-bot commented Jan 5, 2018

redbaron commented Jan 7, 2018

Danil-Grigorev commented Jul 7, 2021

k8s-triage-robot commented Oct 5, 2021

florianstoeber commented Oct 10, 2021

k8s-triage-robot commented Jan 8, 2022

redbaron commented Jan 8, 2022

k8s-triage-robot commented Apr 8, 2022

k8s-triage-robot commented May 8, 2022

k8s-triage-robot commented Jun 7, 2022

k8s-ci-robot commented Jun 7, 2022

redbaron commented Jun 7, 2022

k8s-ci-robot commented Jun 7, 2022

k8s-ci-robot commented Jun 8, 2022

redbaron commented Jun 8, 2022

Abirdcfly commented Jun 8, 2022

k8s-ci-robot commented Jun 8, 2022

k8s-ci-robot commented Jun 8, 2022

Abirdcfly commented Jun 8, 2022

k8s-triage-robot commented Jul 8, 2022

k8s-ci-robot commented Jul 8, 2022

mml commented Sep 8, 2017 •

edited

Loading

mml commented Sep 12, 2017 •

edited

Loading