Consider all possible cluster states before passing them to StateChangeConf #1114

furkatgofurov7 · 2023-05-04T11:26:24Z

Issue:

Problem

When importing an EKS cluster using the terraform provider, sometimes rancher cluster reconciliation loop can be fast and go ahead with cluster creation (going from "pending" to "active" too fast) and the provider can miss it, resulting on provider waiting for the cluster to be in the "pending" state even though it went pass that state long before and provider simply missed that

Solution

Instead of predefining the expectedState beforehand and overwriting it later if need to be, define it as slice literal with keeping the same state ("active") and append the new states later if need to be (i.e. if cluster.Driver == clusterDriverImported || (cluster.Driver == clusterDriverEKSV2 && cluster.EKSConfig.Imported) == True) and pass it to
StateChangeConf struct. Anyways, StateChangeConf struct expects Target to be []string so nothing is changed behaviour wise but just all possible cluster states are considered before passing them.

…geConf

mjura

LGTM

a-blender

@furkatgofurov7 I see that you were unable to repro this bug after multiple attempts #1003 (comment) and the original author of the issue saw a 30 sec timeout in terraform apply but the cluster still became active in about a minute. This was with using the latest terraform v3.0.0.

Is it still an issue for them?
What evidence is there that not predefining the expectedState in the provider fixes this particular issue? I'm fine to have it go in as a noted enhancement but I don't see how this explicitly resolves Intermittently imports of EKS clusters never finish #1003.

furkatgofurov7 · 2023-05-19T06:13:26Z

Is it still an issue for them?

Hey @a-blender thanks for review, as per #1003 (comment) it should be still an issue

What evidence is there that not predefining the expectedState in the provider fixes this particular issue? I'm fine to have it go in as a noted enhancement but I don't see how this explicitly resolves Intermittently imports of EKS clusters never finish #1003.

Based on the logs from the linked issue description:

Error: [ERROR] waiting for cluster (c-xfbkg) to be created: timeout while waiting for state to become 'pending' (last state: 'active', timeout: 30m0s)

and this comment, considering all possible cluster states should not hurt and in case of provider missing to catch the state of the rancher it will still be able to move on.
Also, I forgot to mention that, the piece of code where expectedState is predefined beforehand appears only in this case, in all other cases, i.e clusterUpdate the target state is a slice with bunch of different states, so from that perspective this should align also with that.

Since the reproduction was not possible, maybe we should take this in as an improvement to the provider, WDYT?

furkatgofurov7 · 2023-06-06T09:25:14Z

@a-blender can we merge this, based on #1003 (comment) it helps to fix the issue in #1003 ?

Consider all possible cluster states before passing them to StateChan…

f23fa72

…geConf

This was referenced May 4, 2023

(SURE-5616) Intermittently imports of EKS clusters never finish/or finish with Error rancher/eks-operator#84

Closed

Intermittently imports of EKS clusters never finish #1003

Closed

furkatgofurov7 requested a review from a-blender May 4, 2023 14:26

richardcase approved these changes May 5, 2023

View reviewed changes

mjura approved these changes May 9, 2023

View reviewed changes

furkatgofurov7 self-assigned this May 16, 2023

a-blender suggested changes May 17, 2023

View reviewed changes

mbologna requested a review from a-blender May 23, 2023 10:25

furkatgofurov7 merged commit 20adef8 into rancher:master Jul 11, 2023
1 check passed

furkatgofurov7 deleted the fix-cluster-create-target-state-logic branch July 11, 2023 10:12

kkaempf added the area/terraform label Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider all possible cluster states before passing them to StateChangeConf #1114

Consider all possible cluster states before passing them to StateChangeConf #1114

furkatgofurov7 commented May 4, 2023 •

edited

mjura left a comment

a-blender left a comment •

edited

furkatgofurov7 commented May 19, 2023 •

edited

furkatgofurov7 commented Jun 6, 2023

Consider all possible cluster states before passing them to StateChangeConf #1114

Consider all possible cluster states before passing them to StateChangeConf #1114

Conversation

furkatgofurov7 commented May 4, 2023 • edited

Issue:

Problem

Solution

mjura left a comment

Choose a reason for hiding this comment

a-blender left a comment • edited

Choose a reason for hiding this comment

furkatgofurov7 commented May 19, 2023 • edited

furkatgofurov7 commented Jun 6, 2023

furkatgofurov7 commented May 4, 2023 •

edited

a-blender left a comment •

edited

furkatgofurov7 commented May 19, 2023 •

edited