New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
upgrade_strategy.timeout on upgraded Rancher clusters sets to 0 instead of 120 #27333
Comments
Reproduced on a fresh install of 2.4.5-rc6
|
Same issue occurs on 2.4.5 after the first succesful upgrade of a custom cluster. Doing it again (or simply choosing "edit" and then "save") results in the same error: |
This is a frustrating bug. As a workaround edit the cluster configuration as YAML file. Then find the section "upgrade_strategy" and edit "node_drain_input" -> "timeout: 0" to "timeout: 120". |
We saw this on v2.4.5 when trying to update to k8s v1.18.5 |
The workaround works well, it's probably just because the old data contains the wrong value. |
What appears to be happening is the backend seems to be setting timeout to 0 when we save changes with these two settings:
I put in a stopgap from the frontend to resolve this but we should ultimately resolve this from the backend so the API users don't run into this. |
Verified on 2.4-head - commit id:
Expected:
|
Turns out that the min value that the backend accepts won't allow upgrades to complete. This switches the value to the default value to mitigate that issue. rancher/rancher#27333
Turns out that the min value that the backend accepts won't allow upgrades to complete. This switches the value to the default value to mitigate that issue. rancher/rancher#27333
Verified on 2.4-head - commit id: 3e543f7, ui tag:
On master-head commit id:
|
If the appliedSpec is present it will be validated along with the rest of the model. Unfortunately the backend is sometimes saving invalid models which causes this validation to fail. We shouldn't be modifying or sending this appliedSpec so I'm removing it. rancher/rancher#27333 (comment)
Unfortunately the backend is sometimes saving invalid models which causes the validation of appliedSpec to fail. To avoid this validation we're not ignoring the appliedSpec where this can go wrong. rancher/rancher#27333 (comment)
Another way to reproduce the issue on an upgraded setup:
On master-head - commit id:
Upgrade from 2.3.6 to 2.4-head commit id:
|
What kind of request is this (question/bug/enhancement/feature request):
bug
Steps to reproduce (least amount of steps as possible):
Result:
The UI will show the following error:
To fix, edit the cluster yaml and set the upgrade_strategy.timeout to something other than 0, like 120. The UI allows you to save then.
gzrancher/rancher#11317
The text was updated successfully, but these errors were encountered: