Single node "best-effort drain" during upgrade #3445

TeddyAndrieux · 2021-07-13T09:03:34Z

Component:

'lifecycle'

What happened:

Due to a "bug" (:question:) in kubelet when upgrading a single node cluster with a bunch of Pods running, upgrade fail because it takes too much time to restart static Pods (like apiserver).

See: kubernetes/kubernetes#103658

What was expected:

Single node upgrade to work properly

Resolution proposal (optional):

In order to avoid that kind of issue let's drain the node during the upgrade even if it's not really needed since it's .... single-node cluster so anyway every services may/will have downtime during the upgrade process.

Draining may not be possible on a single node as you do not have any other node to schedule needed Pods (e.g.: If you have some PodDisruptionBudget configured in the cluster).

Add a "best-effort" drain used for single-node drain during the upgrade, this drain just cordon the node and evict all Pods possible, and do not retry/fail if one Pod cannot be evicted just continue the "classic upgrade process"

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added kind:bug Something isn't working topic:lifecycle Issues related to upgrade or downgrade of MetalK8s release:blocker An issue that blocks a release until resolved labels Jul 13, 2021

TeddyAndrieux added this to the MetalK8s 2.10.0 milestone Jul 13, 2021

TeddyAndrieux mentioned this issue Jul 13, 2021

Make apiserver available at any point in time during upgrade #3444

Closed

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

14b0f62

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

fc36a16

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

675c37a

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

e316b03

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

b9ab437

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux added a commit that referenced this issue Jul 13, 2021

salt: Best effort drain during upgrade for single node cluster

93e723f

When running in single node cluster we want to drain the node so that we have as less pod running on the node as possible. Fixes: #3445

TeddyAndrieux mentioned this issue Jul 13, 2021

Upgrade: Best effort drain when upgrading single node cluster #3447

Merged

bert-e closed this as completed in 5db464c Jul 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single node "best-effort drain" during upgrade #3445

Single node "best-effort drain" during upgrade #3445

TeddyAndrieux commented Jul 13, 2021

Single node "best-effort drain" during upgrade #3445

Single node "best-effort drain" during upgrade #3445

Comments

TeddyAndrieux commented Jul 13, 2021