-
Notifications
You must be signed in to change notification settings - Fork 685
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support smoother k8s nodes rotation when using local volumes #2806
Comments
I am not sure whether this is helpful at all because it's at such a high level, but I think the ECK operator could watch the ES data nodes and it's corresponding kubernetes nodes. In the moment where let's say Does that make sense or am I missing something? |
this simple script seems work, but not proven in prod. kubectl cordon k8s-node-abc
kubectl delete pvc -es-xxx --force --grace-period=0
kubectl drain k8s-node-abc --delete-local-data --ignore-daemonsets
kubectl uncordon k8s-node-abc |
Relates to #2448. |
We've run into this exact issue two times now. When we try to upgrade the k8s version in our nodepool, we lose all our data and the cluster goes into a completely broken state. I don't know how it works with other providers, but I can speak for GKE. We have a cluster with 3 nodes and an index with 2 shards and 1 replica per shard What I believe happens is the following:
The logs from GKE show that almost exactly one hour passes between each node teardown. |
Hello, Here's our approach to upgrade k8s on local storage node groups:
|
When using local volumes, it can be quite complicated to handle Kubernetes nodes upgrades.
One common way to upgrade a k8s node is to take it out of the cluster, and replace it with a fresh new one. In which case the local volume is lost, and the corresponding Elasticsearch Pod stays Pending forever.
When that happens, the only way out is to manually remove both Pod and PVC, so a new Pod gets created with a new volume.
In an ideal world to simplify this, we would like to:
Related discuss issue: https://discuss.elastic.co/t/does-eck-support-local-persistent-disks-and-is-it-a-good-idea/223515/3
The text was updated successfully, but these errors were encountered: