[Bug] Saving unchanged cluster leads to update status and cluster changes #6881

rmweir · 2022-09-13T22:20:05Z

Setup

Rancher version:
2.6.6 and 2.6.7 backend pointing at latest UI for dev.

Describe the bug

Saving unchanged cluster leads to update status and cluster changes.

To Reproduce

Create a cluster.
Wait for cluster to go to active.
Go to edit the cluster.
Switch to yaml (I think this is actually optional).
Hit save.

Result
Fields are added and the cluster goes into updating.

Expected Result

The cluster should remain unchanged and should not go into updating.

Additional context

<!--Add any other context about the problem here. -->
The diff between cluster before save and after save:
<  generation: 27
---
>  generation: 28
25c25
<  resourceVersion: "114966"
---
>  resourceVersion: "116054"
28c28,29
<  agentImageOverride: ""
---
>  agentEnvVars: []
>  agentImageOverride: null
30,32c31,36
<  description: ""
<  desiredAgentImage: ""
<  desiredAuthImage: ""
---
>  clusterTemplateRevisionName: null
>  defaultClusterRoleForProjectMembers: null
>  defaultPodSecurityPolicyTemplateName: null
>  description: null
>  desiredAgentImage: null
>  desiredAuthImage: null
50c54,57
<    nodelocal: {}
---
>    linearAutoscalerParams: {}
>    nodelocal:
>     updateStrategy: {}
>    updateStrategy: {}
98c105,113
<  scheduledClusterScan: {}
---
>  scheduledClusterScan:
>   enabled: false
>   scanConfig:
>    cisScanConfig:
>     overrideBenchmarkVersion: rke-cis-1.5
>     profile: permissive
>   scheduleConfig:
>    cronSchedule: 0 0 * * *
>    retention: 24

The text was updated successfully, but these errors were encountered:

cbron · 2022-09-13T22:21:25Z

Potentially a blocker, adding label to discuss.

catherineluse · 2022-09-14T17:08:24Z

Just reproduced it. Also noticed the same thing happens if I save the cluster from the edit config form without switching to YAML.

catherineluse · 2022-09-14T18:24:08Z

@rmweir When I go to edit a cluster, click Edit as YAML, and click Show Diff, I see these changes to hostNamePrefix. This is for a 4-node cluster with two node pools:

And this is the diff for the initial save of a single-node cluster:

Should the UI wait to apply that change until after the user has changed something? Or is the problem that these values were not appropriately applied when the cluster was first provisioned?

After saving the cluster that way, if I save it again, it doesn't add more values or put the cluster in an updating state, because the values are only added if they don't already exist.

catherineluse · 2022-09-16T09:17:43Z

Gathered a little more information:

The code that adds the node pool prefix was added by Vince in March 2021/Rancher v2.5.16 75ec34c#diff-7d87fdaaff449429c3eb17683445ad23033fb135e86941814b8f4aaa43b94326R301

if ( !entry.pool.hostnamePrefix ) {
          entry.pool.hostnamePrefix = `${ prefix }-`;
        }

The code that adds the default values for the authorized cluster endpoint was added by Vince in May 2021/Rancher v2.6.5) 6de4934#diff-f32dce3947d86b7c96a1cc70b768e414984a5025d2410c2094a213bbf5ae9b2cR27

data() {
    if ( isEmpty(this.value?.spec?.localClusterAuthEndpoint) ) {
      set(this.value, 'spec.localClusterAuthEndpoint', {
        enabled: false,
        caCerts: '',
        fqdn:    '',
      });
    }

The code that sets ignoreErrors to false was added by Vince in May 2020/Rancher v2.6.5 afdc49d#diff-a14eebe4bb7d477d4e4239e9004d2dd737fd44468f337129c232f54f4bfa732dR13

const DEFAULTS = {
  deleteEmptyDirData:              false, // Show; Kill pods using emptyDir volumes and lose the data
  disableEviction:                 false, // Hide; false = evict pods, true = delete pods
  enabled:                         false, // Show; true = Nodes must be drained before upgrade; false = YOLO
  force:                           false, // Show; true = Delete standalone pods, false = fail if there are any
  gracePeriod:                     -1, // Show; Pod shut down time, negative value uses pod default
  ignoreDaemonSets:                true, // Hide; true = work, false = never work because there's always daemonSets
  ignoreErrors:                    false, // Hide; profit?
  skipWaitForDeleteTimeoutSeconds: 0, // Hide; If the pod deletion time is older than this > 0, don't wait, for some reason
  timeout:                         120, // Show; Give up after this many seconds
};

gaktive · 2022-09-20T16:41:57Z

Taking blocker off but leaving as a priority/0. This has been around for a few versions so alas, it's not a true regression but it's still weird.

rmweir · 2022-09-22T17:53:40Z

@catherineluse after waiting a while the fields get over written and saving does the same thing again. I'm not sure, we would have to compare to what the behavior was before this started happening. I think the UI should not be changing anything about the cluster, if there is an empty/nil value it should be left as is.

catherineluse · 2022-09-22T18:28:36Z

@rmweir What's the Rancher version where you remember it working properly?

rmweir · 2022-09-22T19:03:30Z

@catherineluse I don't know which rancher version worked properly. I just know it's in 2.6.6+ at least.

catherineluse · 2022-09-23T02:37:26Z

Here's the behavior for editing an RKE2 cluster in 2.6.3:

and 2.6.0:

I couldn't reproduce it for RKE1.

Also, after saving the RKE2 cluster, it sometimes went into updating mode when saving from the form view, but not from the YAML view if the YAML diff showed no changes.

gaktive · 2022-10-04T16:09:30Z

@catherineluse indicated that backend couldn't repro this consistently though they do see some issue. They've pushed their corresponding issue to 2.7.1, so we'll do the same.

mantis-toboggan-md · 2022-10-14T18:52:51Z

Sure I can look @thaneunsoo; did you see this on RKE1, RKE2, and K3s?

thaneunsoo · 2022-10-14T18:53:22Z

@mantis-toboggan-md yes, for rke1,rke2,k3s

mantis-toboggan-md · 2022-10-14T19:11:10Z

Interesting: RKE1 provisioning is done through the old UI so there's either the same problem in both UIs or there is another problem w/ editing but not changing anything on the backend... I'm not seeing the behaviour you are on my own setup but it is older; I'm making a fresh one and looking into this further now

mantis-toboggan-md · 2022-10-18T21:45:59Z

@thaneunsoo I was able to repro this behavior with rke2 and k3s; a fix is now merged: #7218

What I saw specifically was that the rke2 or k3s cluster would go into an updating state after saving from a different view than the cluster had previously been saved in. So: provision a cluster without hitting 'view as yaml', go to 'edit config', then click 'view as yaml', save, the cluster updates despite you not changing anything. Again go to 'edit config', click 'view as yaml', save, the cluster does not change state. Go to 'edit config', save without clicking 'view as yaml', the cluster goes into updating...you get the idea

I'm having a harder time reproducing this consistently with RKE1. Could we potentially file a separate issue for that @thaneunsoo ?

thaneunsoo · 2022-10-19T19:07:43Z

Test Environment:

Rancher version: v2.7-head 9f1e043
Rancher cluster type: HA
Docker version: 20.10

Downstream cluster type: RKE2/K3s

Testing:

Provision RKE2/K3s cluster
Wait for cluster to become active
a. Click Edit Config > Save
b. Click Edit Config > View as YAML > Save

Results
Clusters do not go into Updating state. I did notice that the first I performed either of steps 2a or 2b, the cluster would go into In Progress for a split second and then go to active. If the user took their eyes off the screen for a second, they would miss this. This seems to be okay as there is editing in progress. Closing this issue as fixed

snasovich · 2022-10-19T21:47:18Z

@gaktive @nwmac , we will need this fixed in 2.6.10 as well. I've tried to create backport, but it looks like backport/forwardport bot is not available in this repo.
Could you please create a backport to ensure this is fixed in 2.6.10?
cc: @sowmyav27

rmweir added the kind/bug label Sep 13, 2022

rmweir added this to the v2.6.9 milestone Sep 13, 2022

cbron added the status/blocker label Sep 13, 2022

rmweir mentioned this issue Sep 14, 2022

Can't set cluster private registry password in UI rancher/rancher#38819

Closed

gaktive assigned catherineluse Sep 14, 2022

gaktive added [zube]: To Triage and removed [zube]: To Triage labels Sep 14, 2022

catherineluse added [zube]: Working and removed [zube]: Next Up labels Sep 14, 2022

zube bot added the team/area2 Hostbusters label Sep 15, 2022

zube bot assigned thaneunsoo Sep 15, 2022

zube bot added the QA/XS label Sep 15, 2022

gaktive added priority/0 and removed status/blocker labels Sep 20, 2022

gaktive added priority/1 and removed priority/0 labels Sep 27, 2022

gaktive modified the milestones: v2.6.9, v2.7.0 Sep 27, 2022

gaktive removed the priority/1 label Oct 4, 2022

zube bot added the [zube]: Reopened label Oct 14, 2022

mantis-toboggan-md added [zube]: Working and removed [zube]: Reopened labels Oct 14, 2022

mantis-toboggan-md mentioned this issue Oct 18, 2022

Fix createYaml data deletion #7218

Merged

github-actions bot added [zube]: Review and removed [zube]: Working labels Oct 18, 2022

mantis-toboggan-md closed this as completed in #7218 Oct 18, 2022

zube bot added [zube]: Done and removed [zube]: Review labels Oct 18, 2022

mantis-toboggan-md reopened this Oct 18, 2022

zube bot added [zube]: To Triage and removed [zube]: Done labels Oct 18, 2022

mantis-toboggan-md added the [zube]: Working label Oct 18, 2022

zube bot removed the [zube]: To Triage label Oct 18, 2022

github-actions bot added [zube]: To Test and removed [zube]: Working labels Oct 18, 2022

zube bot added [zube]: QA Working and removed [zube]: To Test labels Oct 19, 2022

zube bot closed this as completed Oct 19, 2022

zube bot added [zube]: Done and removed [zube]: QA Working labels Oct 19, 2022

gaktive mentioned this issue Oct 19, 2022

[Backport for 2.6.next1] Saving unchanged cluster leads to update status and cluster changes #7236

Closed

catherineluse mentioned this issue Nov 6, 2022

[BUG] Cant edit a cluster with encryption provider by yaml , corrupts encryption.yaml #6269

Open

zube bot removed the [zube]: Done label Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Saving unchanged cluster leads to update status and cluster changes #6881

[Bug] Saving unchanged cluster leads to update status and cluster changes #6881

rmweir commented Sep 13, 2022

cbron commented Sep 13, 2022

catherineluse commented Sep 14, 2022

catherineluse commented Sep 14, 2022 •

edited

Loading

catherineluse commented Sep 16, 2022 •

edited

Loading

gaktive commented Sep 20, 2022

rmweir commented Sep 22, 2022

catherineluse commented Sep 22, 2022

rmweir commented Sep 22, 2022

catherineluse commented Sep 23, 2022 •

edited

Loading

gaktive commented Oct 4, 2022

mantis-toboggan-md commented Oct 14, 2022

thaneunsoo commented Oct 14, 2022

mantis-toboggan-md commented Oct 14, 2022 •

edited

Loading

mantis-toboggan-md commented Oct 18, 2022

thaneunsoo commented Oct 19, 2022

snasovich commented Oct 19, 2022

[Bug] Saving unchanged cluster leads to update status and cluster changes #6881

[Bug] Saving unchanged cluster leads to update status and cluster changes #6881

Comments

rmweir commented Sep 13, 2022

cbron commented Sep 13, 2022

catherineluse commented Sep 14, 2022

catherineluse commented Sep 14, 2022 • edited Loading

catherineluse commented Sep 16, 2022 • edited Loading

gaktive commented Sep 20, 2022

rmweir commented Sep 22, 2022

catherineluse commented Sep 22, 2022

rmweir commented Sep 22, 2022

catherineluse commented Sep 23, 2022 • edited Loading

gaktive commented Oct 4, 2022

mantis-toboggan-md commented Oct 14, 2022

thaneunsoo commented Oct 14, 2022

mantis-toboggan-md commented Oct 14, 2022 • edited Loading

mantis-toboggan-md commented Oct 18, 2022

thaneunsoo commented Oct 19, 2022

Test Environment:

Testing:

snasovich commented Oct 19, 2022

catherineluse commented Sep 14, 2022 •

edited

Loading

catherineluse commented Sep 16, 2022 •

edited

Loading

catherineluse commented Sep 23, 2022 •

edited

Loading

mantis-toboggan-md commented Oct 14, 2022 •

edited

Loading