New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable scale an RKE1 cluster from 1 to 3 etcd nodes or nodes with etcd role #43356
Comments
Validated a few usecases 2.8-head -
|
There were a lot of conversations on this issue internally and TLDR is that this issue was reproduced on multiple RKE1 versions on Rancher versions going back to at least 2.7.5 (and likely applicable to versions much earlier). As such, it's no longer considered a release blocker for 2.8.0 since we're past code freeze for that release. The focus of the engineering team is to come up with recommendations for both preventative and reactive workarounds for this issue and release note them. FYI @Jono-SUSE-Rancher |
Issue Affected Versions Rootcase v2:
v3:
Workaround
Cluster stuck in waiting state:
Note: etcd restore doesn't work as a workaround, restarting rancher is required to terminate the hung request and then adding nodes one by one works. |
Have a draft PR up, but need to sync changes from v1.5 to v1.6 branch first before opening the final PR for v1.6 branch since this issue is for 2.9-Next. rancher/rke#3536 |
Rancher Server Setup
Information about the Cluster
-Node Setup: 1 node all roles
User Information
Describe the bug
Cluster hangs when attempting to scale from 1 to 3 nodes.
To Reproduce
Result
Cluster hangs with nodes in the registering state (I left them in that state overnight).
Expected Result
Cluster scales properly with no issues.
Screenshots
Provisioning log for one of those downstream clusters:
Additional context
I was unable to reproduce this issue on a fresh rancher install until I created/deleted several clusters.
The text was updated successfully, but these errors were encountered: