Node not rebooted, pods are drained and restarted #17

timfpark · 2018-04-16T17:32:06Z

I'm using Kured on Azure with an ACS Engine generated cluster, and I can see that nodes are being drained and refilled but it looks like they are not being rebooted.

For example, a reboot-required was set on 23:43 on April 13th for node k8s-agents-27478824-4:

$ ls -al
...
-rw-r--r-- 1 root root 0 Apr 13 23:43 reboot-required
...

And I see Kured triggering: draining and refilling nodes with pods:

$ kubectl get pods --all-namespaces -o wide
NAMESPACE NAME READY STATUS RESTARTS AGE IP NODE
cassandra cassandra-cassandra-0 1/1 Running 1 3d 10.30.0.37 k8s-agents-27478824-3
cassandra cassandra-cassandra-1 0/1 Pending 0 6s
...

Sadly, this seems to happen EVERY hour without fail. Digging into this it looks like this is because the nodes are actually not being rebooted:

$ last reboot
reboot system boot 4.13.0-1011-azur Fri Apr 13 23:17 still running
reboot system boot 4.13.0-1011-azur Sun Apr 8 19:21 still running

(Note that the last reboot time is before the timestamp of the reboot-required)

Is there something I need to do with Kured in order to tell it how to reboot nodes etc.? Or is this a bug?

neumanndaniel · 2018-04-24T10:59:07Z

#14 tells more details about it. I am testing the newest image of kured and modified the yaml file for it.

-> https://quay.io/repository/weaveworks/kured?tab=tags

neumanndaniel · 2018-04-24T14:10:54Z

Using the latest image version master-b27aaa1 resolves the issue. As in #14 described this comes from the kubectl version drift between server and client. master-b27aa1 contains kubectl version 1.9.6

jakolehm mentioned this issue May 5, 2018

Kured: node not rebooted, pods are drained and restarted kontena/pharos-cluster#351

Closed

awh closed this as completed Oct 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node not rebooted, pods are drained and restarted #17

Node not rebooted, pods are drained and restarted #17

timfpark commented Apr 16, 2018 •

edited

neumanndaniel commented Apr 24, 2018

neumanndaniel commented Apr 24, 2018

Node not rebooted, pods are drained and restarted #17

Node not rebooted, pods are drained and restarted #17

Comments

timfpark commented Apr 16, 2018 • edited

neumanndaniel commented Apr 24, 2018

neumanndaniel commented Apr 24, 2018

timfpark commented Apr 16, 2018 •

edited