Upgrading Kubernetes #75

guiocavalcanti · 2016-09-29T13:46:06Z

Do you have plans on using k8s 1.4.0? If not, how can I upgrade my version?

wellsie · 2016-09-29T13:48:16Z

Yes. Will release update later today.

On Sep 29, 2016, at 6:46 AM, Guilherme Cavalcanti notifications@github.com wrote:

Do you have plans on using k8s 1.4.0? If not, how can I upgrade my version?

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.

owenmorgan · 2016-09-29T14:29:44Z

Will we be able to update an existing cluster?

adambom · 2016-10-01T19:52:25Z

@owenmorgan upgrading the k8s version requires deleting the etcd cluster, where all the kubernetes state is stored on ephemeral disk. I have a forked version of tack where etcd state is persisted on an ELB volume, and that works beautifully. Would anybody be interested in a PR to contribute that back to tack (@wellsie)? Let me know and I will clean up and submit.

In the meantime, you can use a simple workaround to recover from losing the etcd cluster. Before upgrading, you would run this code snippet.

This will allow you to recover all cluster state (including PV's). ELB's will be regenerated, so update any DNS records accordingly.

owenmorgan · 2016-10-04T13:10:07Z

Thanks @adambom. how are we looking on an update @wellsie ?

adambom · 2016-10-04T22:58:01Z

@owenmorgan looks like it was patched in 8f2a62e

adambom · 2016-10-04T23:00:32Z

Oh one other thing you'll need to do when you upgrade is taint or manually update the S3 bucket, so that the files in manifests/etc.tar point to the version of k8s you want to use. Otherwise the update won't actually take.

owenmorgan · 2016-10-04T23:01:57Z

great. ill give it a shot. thanks @wellsie @adambom

owenmorgan · 2016-10-04T23:12:32Z

is the backup / restore still necessary @adambom ?

wellsie · 2016-10-04T23:43:25Z

i recommend upgrading the cluster manually. i will write up the procedure later this week - in the meantime here is the basic process:

update kubelet.service on worker nodes

ssh into each node and update KUBELET_VERSION in /etc/systemd/system/kubelet.service
- valid KUBELET_VERSION values
- sudo vim /etc/systemd/system/kubelet.service
- sudo systemctl daemon-reload
- sudo systemctl restart kubelet

make instances (new with #77) will dump the ips of all nodes master (etcd,apiserver) and workers. do make ssh-bastion and then from there ssh into each box one at a time.

update kubelet.service on etcd/apiserver nodes

repeat the above procedure for the master (etcd,apiserver) nodes.

update version in kubernetes manifests on etcd/apiserver nodes

grep 1.4 /etc/kubernetes/manifests/*
/etc/kubernetes/manifests/kube-apiserver.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
/etc/kubernetes/manifests/kube-controller-manager.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
/etc/kubernetes/manifests/kube-proxy.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
/etc/kubernetes/manifests/kube-scheduler.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0

i'm looking into ways to automate this. it hasn't been a priority since the procedure is fairly straight forward. note that running pods should continue to run during this procedure.

nkhine · 2016-10-21T09:09:38Z

would this procedure work: https://github.com/coreos/coreos-baremetal/blob/master/Documentation/bootkube-upgrades.md ?

If you ever lose your etcd cluster for whatever reason, or if you should ever need to restart it, you should be able to recover your state. Mentioned in this issue: kz8s#75

rimusz · 2016-10-26T16:13:08Z

@wellsie any update on the kubernetes automated update? it is fine to do those ^^^ commands manually if you have a small cluster, but with the big would be a headache :)

rimusz · 2016-10-26T16:56:11Z

ok, have checked out to update /etc/systemd/system/kubelet.service with the never k8s version, the change does not survive the reboot. :(

yagonobre · 2016-11-04T20:18:10Z

@rimusz, It is because tack use user-data, that run every time that the machine power up.
You can stop the instance and edit the version on user-data and then start the instance.

I replace user-data with cloud init in my environment, if everything work fine i will submit a PR.

yagonobre · 2016-11-04T22:03:17Z

You can use this procedure

Update worker nodes

Create a new launch configuration, you can clone the existing LC and edit the kubernetes version on user-data (have 2 Occurrence).
Terminate all instances and create anothers with the new LC (Be sure that you no have persistentes volumes, e.g. databases, and that your pods are replicated)
1. Detach Instance by In from the ASG, mark the checkbox for create a new instance, check with kubectl get nodes if the new node are running, then terminate the node that you detach from ASG.
2. Do It for all nodes.

Update user-data for each instance is a alternative.

Update master nodes

Update kubernetes manifests on s3 bucket

Download tar file

aws s3 cp s3://[BUCKET-URL]/manifests/etc.tar .
tar -xvf etc.tar

Edit the k8s version in all files

grep 1.4 *.yml
kube-apiserver.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
kube-controller-manager.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
kube-proxy.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0
kube-scheduler.yml:    image: quay.io/coreos/hyperkube:v1.4.0_coreos.0

Compress and send file to s3

tar -cvf etc.tar *.yml
aws s3 cp etc.tar s3://[BUCKET-URL]/manifests/etc.tar

Update user-data for each node
1. You need to stop instance by instance for edit k8s version on user data (Be sure that you not stop more than one instance per time).
2. Start the instance
3. Check health of etcd cluster with etcdctl cluster-health, if all nodes are healthy do it to other instance

@wellsie, please validate this.

rimusz · 2016-11-07T11:38:26Z

@yagonobre thanks for your solution. it looks good, but has way to many manual fiddling, specially with the user-data for each instance, way too much hassle for production clusters.
I found using global fleet units for k8s services is much better way to make k8s upgrades.

rokka-n · 2017-01-14T03:13:37Z

Why is it not possible to replace etcd node and let it re-sync with the cluster?

yagonobre · 2017-01-16T15:37:53Z

@rokka-n I do it

fearphage · 2017-02-03T14:45:28Z

Are you open to incorporating automated Kubernetes upgrades? If not is the purpose of this project a one-time setup and then you don't need this project anymore?

wellsie · 2017-02-03T14:53:02Z

Yes open to automated upgrades 👍

…

On Fri, Feb 3, 2017 at 6:45 AM Phred ***@***.***> wrote: Are you open to incorporating automated Kubernetes upgrades? If not is the purpose of this project a one-time setup and then you don't need this project anymore? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#75 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE6-kcseiksMO69J11ltoL7iVgtyYuPjks5rYz2PgaJpZM4KKAq4> .

fearphage · 2017-02-03T14:55:35Z

Yes open to automated upgrades

Excellent! However without automated upgrades, is this intended to be a single-use project?

wellsie self-assigned this Oct 12, 2016

adambom mentioned this issue Oct 24, 2016

Attach EBS volumes to etcd nodes for persistence #98

Open

cemo mentioned this issue Mar 30, 2017

Modifying manifests #138

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrading Kubernetes #75

Upgrading Kubernetes #75

guiocavalcanti commented Sep 29, 2016

wellsie commented Sep 29, 2016

owenmorgan commented Sep 29, 2016

adambom commented Oct 1, 2016

owenmorgan commented Oct 4, 2016

adambom commented Oct 4, 2016

adambom commented Oct 4, 2016

owenmorgan commented Oct 4, 2016

owenmorgan commented Oct 4, 2016

wellsie commented Oct 4, 2016

nkhine commented Oct 21, 2016

rimusz commented Oct 26, 2016

rimusz commented Oct 26, 2016

yagonobre commented Nov 4, 2016

yagonobre commented Nov 4, 2016

rimusz commented Nov 7, 2016

rokka-n commented Jan 14, 2017

yagonobre commented Jan 16, 2017

fearphage commented Feb 3, 2017

wellsie commented Feb 3, 2017 via email

fearphage commented Feb 3, 2017

Upgrading Kubernetes #75

Upgrading Kubernetes #75

Comments

guiocavalcanti commented Sep 29, 2016

wellsie commented Sep 29, 2016

owenmorgan commented Sep 29, 2016

adambom commented Oct 1, 2016

owenmorgan commented Oct 4, 2016

adambom commented Oct 4, 2016

adambom commented Oct 4, 2016

owenmorgan commented Oct 4, 2016

owenmorgan commented Oct 4, 2016

wellsie commented Oct 4, 2016

update kubelet.service on worker nodes

update kubelet.service on etcd/apiserver nodes

update version in kubernetes manifests on etcd/apiserver nodes

nkhine commented Oct 21, 2016

rimusz commented Oct 26, 2016

rimusz commented Oct 26, 2016

yagonobre commented Nov 4, 2016

yagonobre commented Nov 4, 2016

Update worker nodes

Update master nodes

rimusz commented Nov 7, 2016

rokka-n commented Jan 14, 2017

yagonobre commented Jan 16, 2017

fearphage commented Feb 3, 2017

wellsie commented Feb 3, 2017 via email

fearphage commented Feb 3, 2017