Detect when drain operation kills kured pod #14

awh · 2018-04-16T10:54:59Z

kubectl drain should not in theory kill the kured pod as it's a DaemonSet; we rely on this behaviour, because after the drain operation is complete we need to command the reboot. We have however experienced the kured pod being killed during drain when the embedded version of kubectl is too different to the server (specifically, with kubectl 1.7.x against server 1.9.x), resulting in a never ending cycle of lock/drain/restart/unlock without the reboot actually occurring.

Possible fixes:

Detect client/server version mismatch on startup and refuse to operate (we should probably warn on this anyway)
Ignore TERM signals after drain commences, and have a long enough terminationGracePeriodSeconds that we can complete (problem: how long is long enough?)
Catch TERM during drain and print a warning message
Catch TERM during drain, and stash some information in the lock so that we don't cycle endlessly on restart

The text was updated successfully, but these errors were encountered:

neumanndaniel · 2018-04-24T10:55:07Z

Regarding to this. It would be good, if you can maintain a section in the documentation which image version runs which kubectl version.

Additionally it would be great, if the latest image would be tagged with latest.

Thanks!

evrardjp · 2020-03-05T08:48:16Z

I am curious, does it still happen? Should we try to reproduce it nowadays?

github-actions · 2020-11-27T01:46:35Z

This issue was automatically considered stale due to lack of activity. Please update it and/or join our slack channels to promote it, before it automatically closes (in 7 days).

neumanndaniel mentioned this issue Apr 24, 2018

Node not rebooted, pods are drained and restarted #17

Closed

awh added the enhancement label Oct 30, 2018

github-actions bot added the no-issue-activity label Nov 27, 2020

github-actions bot closed this as completed Dec 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect when drain operation kills kured pod #14

Detect when drain operation kills kured pod #14

awh commented Apr 16, 2018 •

edited

neumanndaniel commented Apr 24, 2018 •

edited

evrardjp commented Mar 5, 2020

github-actions bot commented Nov 27, 2020

Detect when drain operation kills kured pod #14

Detect when drain operation kills kured pod #14

Comments

awh commented Apr 16, 2018 • edited

neumanndaniel commented Apr 24, 2018 • edited

evrardjp commented Mar 5, 2020

github-actions bot commented Nov 27, 2020

awh commented Apr 16, 2018 •

edited

neumanndaniel commented Apr 24, 2018 •

edited