"Auto Replace" option support for RKE2 machine pools #4449

snasovich · 2021-10-26T02:00:07Z

Detailed Description
For RKE1-provisioned clusters, there is currently an option to specify the threshold on how many minutes a node can be unreachable in a node pool before it's automatically replaced:

For RKE2-provisioned clusters, we want the same functionality to be available, probably by adding a checkbox somewhere on the base pool details section highlighted below:

Context
This is needed for RKE2 provisioning parity with RKE1

Additional Details
Backend support is needed: rancher/rancher#35275

paynejacob · 2022-01-21T16:18:37Z

ref rancher/rancher#35275

paynejacob · 2022-01-21T16:24:49Z

See rancher/rancher#35916 (comment) for implementation

Support for self healing node pools was added with the pr above. The following fields were added to support this.

related: https://cluster-api.sigs.k8s.io/tasks/healthcheck.html

NodeStartupTimeout *metav1.Duration (Duration of time the node is given to initially become ready before it is replaced.)
UnhealthyNodeTimeout *metav1.Duration (How long a node can be unhealthy before it is replaced)
MaxUnhealthy *intstr.IntOrString (The maximum number of nodes that can be un healthy in a pool, if this is exceeded no nodes are replaced)
UnhealthyRange *string (A range of nodes that can be unhealthy and replaced see https://cluster-api.sigs.k8s.io/tasks/healthcheck.html#unhealthy-range for formatting)

gaktive · 2022-01-25T22:56:37Z

Thanks @paynejacob -- does this mean that UI can being work on this as QA tests the backend?

paynejacob · 2022-01-25T23:19:17Z

@gaktive yes qa can test backend and the ui work can start. Let me know if you have any questions about the api.

catherineluse · 2022-02-04T02:23:22Z

@paynejacob It looks like the UI just needs to include the UnhealthyNodeTimeout property when creating the node pool. That isn't case sensitive, right? I noticed that we don't capitalize the other properties that we send when creating the node pool. https://github.com/rancher/rancher/pull/35916/files#diff-e05181e81036cad4014bb4843ea91837fa7dd238dc3434bc4fcc598c56db6428R26

And just to confirm, should we still take the unit from the user in minutes, then convert it to seconds as the Ember UI did?

paynejacob · 2022-02-04T16:32:14Z

@catherineluse it looks like I accidently capitalized it, I will have a pr up soon to make it lowercase.

paynejacob · 2022-02-04T18:31:27Z

@catherineluse fixed

Auston-Ivison-Suse · 2022-02-22T21:31:00Z

Setup For Feature Testing
Rancher Version: v2.6-head(e55a04c)

**Steps for Reproduction: **

Start creating a node driver rke2 cluster
go to the advanced settings and you will see the option in the screenshot below:

Result
Was able to successfully view the noted amount post provisioning.

Sanity Checks

successfully provisioned cluster
successfully edited the yaml to change the values before provisioning.
successfully edited the values post provisioning via ui
successfully edited the values post provisioning via yaml

jtravee · 2022-03-16T22:25:04Z

Confirmed with @catherineluse and @gaktive to add release note label.

snasovich added status/waiting-backend area/rke2 labels Oct 26, 2021

snasovich added this to the v2.6.3 milestone Oct 26, 2021

nwmac modified the milestones: v2.6.3, v2.6.4 Nov 12, 2021

nwmac added the [zube]: To Triage label Nov 12, 2021

gaktive added the area/clusterprovisioningv2 label Dec 3, 2021

gaktive mentioned this issue Dec 3, 2021

EPIC: Cluster Provisioning v2 GA #3346

Closed

31 tasks

gaktive added the status/release-blocker label Dec 13, 2021

Sahota1225 mentioned this issue Jan 5, 2022

[Group] RKE2 Provisioning parity work for RKE2 Provisioning GA rancher/rancher#36044

Closed

24 tasks

gaktive added [zube]: Backend Blocked and removed [zube]: To Triage labels Jan 11, 2022

gaktive added [zube]: Backlog and removed [zube]: Backend Blocked status/waiting-backend labels Jan 25, 2022

Sahota1225 added the team/area2 Hostbusters label Jan 26, 2022

nwmac assigned catherineluse Feb 1, 2022

nwmac added [zube]: Next Up and removed [zube]: Backlog labels Feb 1, 2022

catherineluse added [zube]: Working and removed [zube]: Next Up labels Feb 1, 2022

catherineluse mentioned this issue Feb 4, 2022

Add auto-replace option for K3s/RKE2 machine pools #5045

Merged

catherineluse added [zube]: Review and removed [zube]: Working labels Feb 4, 2022

paynejacob mentioned this issue Feb 4, 2022

fixed unhealthyNodeTimeout naming rancher/rancher#36397

Merged

catherineluse added [zube]: Working and removed [zube]: Review labels Feb 4, 2022

catherineluse added [zube]: Review and removed [zube]: Working labels Feb 4, 2022

slickwarren assigned Auston-Ivison-Suse Feb 9, 2022

catherineluse added [zube]: To Test and removed [zube]: Review labels Feb 17, 2022

Auston-Ivison-Suse added [zube]: QA Working and removed [zube]: To Test labels Feb 18, 2022

Auston-Ivison-Suse closed this as completed Feb 22, 2022

Auston-Ivison-Suse added [zube]: Done and removed [zube]: QA Working labels Feb 22, 2022

jtravee added the release-note label Mar 16, 2022

zube bot removed the [zube]: Done label May 24, 2022

thsnielsen mentioned this issue Jul 6, 2022

auto replace unit is displayed in minutes instead of seconds when creating node driver rke2 cluster, advanced settings #6275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Auto Replace" option support for RKE2 machine pools #4449

"Auto Replace" option support for RKE2 machine pools #4449

snasovich commented Oct 26, 2021

paynejacob commented Jan 21, 2022

paynejacob commented Jan 21, 2022

gaktive commented Jan 25, 2022

paynejacob commented Jan 25, 2022

catherineluse commented Feb 4, 2022 •

edited

Loading

paynejacob commented Feb 4, 2022

paynejacob commented Feb 4, 2022

Auston-Ivison-Suse commented Feb 22, 2022

jtravee commented Mar 16, 2022

"Auto Replace" option support for RKE2 machine pools #4449

"Auto Replace" option support for RKE2 machine pools #4449

Comments

snasovich commented Oct 26, 2021

paynejacob commented Jan 21, 2022

paynejacob commented Jan 21, 2022

gaktive commented Jan 25, 2022

paynejacob commented Jan 25, 2022

catherineluse commented Feb 4, 2022 • edited Loading

paynejacob commented Feb 4, 2022

paynejacob commented Feb 4, 2022

Auston-Ivison-Suse commented Feb 22, 2022

jtravee commented Mar 16, 2022

catherineluse commented Feb 4, 2022 •

edited

Loading