Allow Kubernetes downgrade when restoring etcd snapshot #22232

dnoland1 · 2019-08-16T17:51:42Z

What kind of request is this (question/bug/enhancement/feature request):
Feature request

Description
The Kubernetes upgrade documentation listed at https://rancher.com/docs/rancher/v2.x/en/cluster-admin/editing-clusters/#upgrading-kubernetes , it is recommended backing up (taking an etcd snapshot) before doing an upgrade. This implies that if you want to revert an upgrade, you should be able to restore the etcd snapshot and that will put back your Kubernetes cluster to the same version before the upgrade. However, if you attempt to restore the etcd cluster, it will not revert the upgrade, but only restore the state of the etcd data and Rancher will still attempt to upgrade the Kubernetes cluster.

User should be able to revert an upgrade and return their cluster to the version of Kubernetes that corresponds to when the etcd snapshot was taken. For example, a user is on Kubernetes 1.14.0, takes an etcd snapshot, then upgrades to 1.14.5. The user should be able to restore the etcd snapshot and downgrade Kubernetes to v1.14.0. This would involve reverting all Kubernetes components - kube-apiserver, kube-controller, kube-proxy, kube-scheduler, and kubelet, to the previous version.

ajfriesen · 2019-10-22T14:49:42Z

Did run into this today.

Wanted to test an upgrade. Upgrade failed due to some bug with kube-dns.
We had a snapshot and tried to restore that snapshot.

This did not work properly.

For the first few seconds kubectl get nodes did show the kubernetes version from the snapshot before the upgrade but suddenly the view switched to the newthe kubernetes version which we wanted to update to (testing at least).

Also kubectl get nodes and the rancher ui did show us different workes which I wonder about.
We habd workers in kubectl get nodes which we did not have in rancher ui.
I thought it could not be different technically.

chrisbulgaria · 2019-11-30T14:51:30Z

yes - this would be a great enhancement !

cloudnautique · 2020-01-22T16:43:46Z

Distilling this down a bit:

A user needs to be able to optionally take a snapshot when initiating the upgrade. This snapshot would be tied to a specific Kubernetes version.
The user should have the option to rollback the upgrade. This means restoring the cluster, including all kubernetes components and etcd database back to the configuration pre-upgrade.

Since etcd snapshot will have the Kubernetes version data tied to it going forward, users should be able to see which version of k8s the backup was taken at.

soumyalj · 2020-03-02T18:37:53Z

Tested with master-head branch.
Verified that restoreRkeConfig field is added during etcd restore. We can restore both K8s Version and cluster config or just K8s version. Cluster restore for the below combination of tests was done with local backup:

Local backup

Regression tests were also performed.

izaac · 2020-03-02T18:41:03Z

I tested the feature in 2.4 master-head with S3 backup enabled. Covering the same test combination as commented by @soumyalj #22232 (comment)

Including same regression tests.

izaac · 2020-03-04T03:16:38Z

Rancher version 2.4 commit id: 78ee11a (master-head (03/03/2020))

Validated the upgrade scenarios using S3/Minio enabled.
Created cluster using Standard User for validating P1 cases using Minio backup storage.

Found issue #25744 while creating a cluster with Standard user, that will get tracked separately.

cc @soumyalj

soumyalj · 2020-03-04T04:05:51Z

Tested with 2.4 master-head(2a7415a2a190)
Validated upgrade scenarios with snapshot restore after upgrade from v2.3.x to master-head.

dnoland1 mentioned this issue Aug 23, 2019

Kubernetes version is not preserved when cluster is restored from a snapshots taken before kubernetes version is upgraded. #22368

Closed

dnoland1 mentioned this issue Oct 21, 2019

Kubernetes downgrades should be blocked unless they are supported #23591

Closed

dnoland1 changed the title ~~Allow downgrade when restoring etcd snapshot~~ Allow Kubernetes downgrade when restoring etcd snapshot Oct 31, 2019

cloudnautique mentioned this issue Dec 11, 2019

Main Epic for Zero Downtime upgrades #23038

Closed

maggieliu assigned luthermonson Jan 27, 2020

maggieliu added [zube]: To Triage and removed [zube]: To Triage labels Jan 27, 2020

deniseschannon mentioned this issue Jan 28, 2020

safely upgrade kubernetes cluster #17057

Closed

deniseschannon added this to the v2.4 milestone Jan 30, 2020

deniseschannon added the kind/feature Issues that represent larger new pieces of functionality, not enhancements to existing functionality label Jan 30, 2020

luthermonson added [zube]: Reopened and removed [zube]: Next Up labels Feb 14, 2020

maggieliu mentioned this issue Feb 22, 2020

UI update for restoring etcd snapshot #25525

Closed

This was referenced Feb 24, 2020

Adding k8s Version and Cluster Object to Backup rancher/types#1101

Merged

RKE Config Added to Backups and Restore Options for version/config #25540

Closed

luthermonson added [zube]: Peer Review and removed [zube]: Reopened labels Feb 24, 2020

deniseschannon added the [zube]: Final Review label Feb 24, 2020

zube bot removed the [zube]: Peer Review label Feb 24, 2020

luthermonson mentioned this issue Feb 24, 2020

RKE Config Added to Backups and Restore Options for version/config #25545

Closed

deniseschannon added [zube]: Peer Review labels Feb 25, 2020

zube bot removed [zube]: Final Review labels Feb 25, 2020

luthermonson mentioned this issue Feb 26, 2020

RKE Config Added to Backups and Restore Options for version/config #25597

Merged

luthermonson removed the [zube]: Review label Feb 26, 2020

luthermonson added the [zube]: To Test label Feb 26, 2020

sangeethah assigned izaac and soumyalj Feb 26, 2020

kinarashah mentioned this issue Feb 28, 2020

Ignore zero downtime constraints during atomic rollback #25647

Closed

soumyalj closed this as completed Mar 4, 2020

zube bot added [zube]: Done and removed [zube]: To Test labels Mar 4, 2020

zube bot removed the [zube]: Done label Oct 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Kubernetes downgrade when restoring etcd snapshot #22232

Allow Kubernetes downgrade when restoring etcd snapshot #22232

dnoland1 commented Aug 16, 2019

ajfriesen commented Oct 22, 2019

chrisbulgaria commented Nov 30, 2019

cloudnautique commented Jan 22, 2020

soumyalj commented Mar 2, 2020

izaac commented Mar 2, 2020

izaac commented Mar 4, 2020 •

edited

Loading

soumyalj commented Mar 4, 2020

Allow Kubernetes downgrade when restoring etcd snapshot #22232

Allow Kubernetes downgrade when restoring etcd snapshot #22232

Comments

dnoland1 commented Aug 16, 2019

ajfriesen commented Oct 22, 2019

chrisbulgaria commented Nov 30, 2019

cloudnautique commented Jan 22, 2020

soumyalj commented Mar 2, 2020

izaac commented Mar 2, 2020

izaac commented Mar 4, 2020 • edited Loading

soumyalj commented Mar 4, 2020

izaac commented Mar 4, 2020 •

edited

Loading