Rolling node upgrade #7938

mbforbes · 2015-05-08T00:49:13Z

After doing this and waiting a bit, kubectl get nodes reports nodes as Unknown and then back to Ready. Also, kubectl get pods reports the pods on the minions as Pending and then eventually Running.

I have some confidence this might actually be the true state of the world, because I screwed up the kube/kube-proxy tokens in an earlier implementation, and neither of the above were true.

BTW, the validate cluster doesn't actually do what's intended right now, I think, because it returned successfully far before my cluster was working again.

Comments / feedback welcome. And yes, definitely want to bake this into a(n e2e) test ASAP.

mbforbes · 2015-05-08T00:49:37Z

+cc @roberthbailey

roberthbailey · 2015-05-08T04:22:52Z

For validate-cluster, did you get pull #7932? I noticed this morning that validate-cluster was broken and @fabioy fixed it this afternoon.

roberthbailey · 2015-05-08T04:24:13Z

cluster/gce/upgrade.sh

@@ -102,6 +100,10 @@ function wait-for-master() {
 function prepare-upgrade() {
  ensure-temp-dir
  detect-project
+  # TODO(mbforbes): Do we still need get-password?
+  get-password
+  # TODO(mbforbes): Do we need get-bearer-token here?


I don't see why you'd need the kubectl (client) bearer token to upgrade nodes.

KUBE_BEARER_TOKEN is required by build-kube-env:
https://github.com/GoogleCloudPlatform/kubernetes/blob/9b2a0df811db637442daf93bcffefacaedea6f2a/cluster/gce/debian/helper.sh#L46

However, this comment in configure-vm.sh:
https://github.com/GoogleCloudPlatform/kubernetes/blob/9b2a0df811db637442daf93bcffefacaedea6f2a/cluster/gce/configure-vm.sh#L256

says that we don't need these data (KUBE_PASSWORD, KUBE_USER, KUBE_BEARER_TOKEN, KUBELET_TOKEN, KUBE_PROXY_TOKEN) except on init, and specifically not on an upgrade. However, it might be assuming that an upgrade happens in-place and retains the data that happens on cluster init; with the upgrade process introduced in this PR, that isn't true as we blow away the node.

The call to the function discussed in the previous paragraph, create-salt-auth, happens for both master and nodes:
https://github.com/GoogleCloudPlatform/kubernetes/blob/9b2a0df811db637442daf93bcffefacaedea6f2a/cluster/gce/configure-vm.sh#L456

so I don't know whether we can just remove these vars. @zmerlynn should probably chime in here.

We shouldn't need auth data when updating the master, since the master stores the auth data on a PD which survives rebuilding the VM (and if we lose that data PD we are pretty well screwed anyway since it contains the cluster state).

For the nodes, you are already scraping the kubelet token and kube proxy token from the existing metadata, so that can be used to fill in those two fields.

Unless I'm reading this incorrectly, this is within upgrade.sh so it shouldn't affect cluster provisioning to remove these here.

I've moved this to the master-only section since #8009 has been merged.

roberthbailey · 2015-05-08T05:06:14Z

Can you explain what happens during the rolling update? Do the new VMs have different names than the original VMs? Does a new VM get created (and healthchecked in any way) before deleting an existing VM? How does the rolling update command know when to start updating the second VM?

mbforbes · 2015-05-14T00:43:03Z

@roberthbailey these answers are from my current understanding of MIG rolling-updates.

Do the new VMs have different names than the original VMs?
No; they are recreated under the same name.

Does a new VM get created (and healthchecked in any way) before deleting an existing VM?
A new VM gets deleted before it is re-created because it is created with the same name, which would otherwise be a naming conflict.

A rolling-update ensures that the state of the new VM is "running," but it is not healthchecked in any other way, it does not support pluggable healthchecking, and it will not try to repair a VM if it is not "running"—it will simply pause the rolling-update.

How does the rolling update command know when to start updating the second VM?
I believe that this can be controlled by setting the --max-num-concurrent-instances and --min-instance-update-time flags. For example, if you set

--max-num-concurrent-instances=1
--min-instance-update-time=0s

then it would proceed to the second as soon as the first is finished.

It determines that the first instance is finished as soon as it is "running". It will wait up to --instance-startup-timeout for each instance to be "running", which is a default of 5 minutes.

zmerlynn · 2015-05-21T21:48:38Z

LGTM

Rolling node upgrade

mbforbes · 2015-05-21T21:48:58Z

I think this actually might be actually ready to get in as a draft. I've addressed @roberthbailey's shell comments and pushed my newer version. There are still a handful of cleanup TODOs but the core of what we care about is in. This also fixes master upgrade, which has drifted.

@roberthbailey I've updated my comment response above that answers the questions you posed; let me know if it's adequate.

@zmerlynn LGTY?

mbforbes · 2015-05-21T21:49:03Z

Well, that was fast.

googlebot added the cla: yes label May 8, 2015

mbforbes added the area/upgrade label May 8, 2015

mbforbes assigned zmerlynn May 8, 2015

roberthbailey reviewed May 8, 2015
View reviewed changes

mbforbes mentioned this pull request May 8, 2015

Refactor master vs node kube-env and salt auth #8009

Merged

Rolling node upgrade

3437ac6

zmerlynn added a commit that referenced this pull request May 21, 2015

Merge pull request #7938 from mbforbes/rollingUpgrade

3e4a940

Rolling node upgrade

zmerlynn merged commit 3e4a940 into kubernetes:master May 21, 2015

gavinfish mentioned this pull request Apr 8, 2020

Fix shellcheck warnings/errors in cluster/gce/upgrade.sh #88576

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rolling node upgrade #7938

Rolling node upgrade #7938

mbforbes commented May 8, 2015

mbforbes commented May 8, 2015

roberthbailey commented May 8, 2015

roberthbailey May 8, 2015

mbforbes May 8, 2015

roberthbailey May 8, 2015

mbforbes May 19, 2015

roberthbailey commented May 8, 2015

mbforbes commented May 14, 2015

zmerlynn commented May 21, 2015

mbforbes commented May 21, 2015

mbforbes commented May 21, 2015

Rolling node upgrade #7938

Rolling node upgrade #7938

Conversation

mbforbes commented May 8, 2015

mbforbes commented May 8, 2015

roberthbailey commented May 8, 2015

roberthbailey May 8, 2015

Choose a reason for hiding this comment

mbforbes May 8, 2015

Choose a reason for hiding this comment

roberthbailey May 8, 2015

Choose a reason for hiding this comment

mbforbes May 19, 2015

Choose a reason for hiding this comment

roberthbailey commented May 8, 2015

mbforbes commented May 14, 2015

zmerlynn commented May 21, 2015

mbforbes commented May 21, 2015

mbforbes commented May 21, 2015