STOR-1049: Add topology aware sc #121

gnufied · 2022-12-06T20:25:43Z

Fixes https://issues.redhat.com/browse/STOR-1049

gnufied · 2022-12-06T20:26:39Z

pkg/operator/storageclasscontroller/vmware.go

rvanderp3 · 2022-12-06T20:36:50Z

pkg/operator/storageclasscontroller/vmware.go

+		return nil, fmt.Errorf("failed to access datacenter %s: %s", dcName, err)
+	}
+
+	finder = find.NewFinder(vmClient.Client, false)


do we need a new finder here?

openshift-ci · 2022-12-06T20:41:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gnufied

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [gnufied]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

gnufied · 2022-12-08T14:29:19Z

/retest

openshift-ci · 2022-12-08T16:19:24Z

@gnufied: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/e2e-vsphere-csi	`af44173`	link	true	`/test e2e-vsphere-csi`
ci/prow/e2e-vsphere-csi-migration	`af44173`	link	false	`/test e2e-vsphere-csi-migration`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

pkg/operator/storageclasscontroller/vmware.go

bertinatto · 2022-12-09T13:56:37Z

pkg/operator/storageclasscontroller/vmware.go

+		err = v.createOrUpdateTag(ctx, ds)
+		if err != nil {
+			return v.policyName, fmt.Errorf("error creating or updating tag %s: %v", v.tagName, err)
+		}


Instead of stopping the execution, should it try the next one?

I think it might be better to hard error out, rather than continuing with tagging whatever datastores we could access, because that could result in a storagepolicy which has unpredictable behaviour.

My point was when you have say 5 failureDomains/datastores, then you successfully tag 3 of them and the 4th fail. Sounds like it's better to either:

Try tagging all of them and return an aggregated error

Return early in case of error, but don't leave behind tagged datastores

okay I have aggregated the errors and returned them. I think that is a good idea in case we have multiple datastores and we don't have access to them, so as rather than returning one error at a time - we will return error about all inaccessible datastores at the same time.

I have pushed that change to - #125 however, which also includes #121

bertinatto

Just a comment related to error aggregation, otherwise LGTM (not tagging to give @jsafrane a chance to review).

jsafrane · 2022-12-13T13:24:35Z

pkg/operator/storageclasscontroller/vmware.go

+	vSphereInfraConfig := v.infra.Spec.PlatformSpec.VSphere
+	if vSphereInfraConfig != nil && len(vSphereInfraConfig.FailureDomains) > 1 {
+		return v.createZonalStoragePolicy(ctx)
+	}
+


createZonalStoragePolicy and rest of createStoragePolicy below are almost the same, createZonalStoragePolicy only loops / tags over more datastores. Would it be possible to join them together somehow? Like getZonalDatastores() + getDefaultDatastore() and then a common tagging loop + createStorageProfile over the result.

okay I have unified that code. But I have pushed my new changes to https://github.com/openshift/vmware-vsphere-csi-driver-operator/pull/125/files#diff-cd720ec01eefb8566ee378bc53aef398da5550c725aeacd65aba7efb2b39e311R146

That branch already includes commit from this branch and has more tests and ensures tags are recreated if deleted etc.

jsafrane · 2022-12-13T13:26:57Z

pkg/operator/utils/topology.go

+	if vSpherePlatformConfig != nil {
+		failureDomains := vSpherePlatformConfig.FailureDomains
+		if len(failureDomains) > 0 {
+			return []string{defaultOpenshiftZoneCategory, defaultOpenshiftRegionCategory}


I'd prefer a warning when both infra and clusterCSIDriver specify topology. Maybe with a metric + alert, but that probably does not belong to this function.

ack. I can file this as a story attached to the same epic in jira.

https://issues.redhat.com/browse/STOR-1123

gnufied · 2022-12-13T16:18:33Z

I am going to close this favour of #125

/close

openshift-ci · 2022-12-13T16:18:45Z

@gnufied: Closed this PR.

In response to this:

I am going to close this favour of #125

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Bump openshift/api type

4059e91

jcpowermac reviewed Dec 6, 2022

View reviewed changes

pkg/operator/storageclasscontroller/vmware.go Show resolved Hide resolved

rvanderp3 reviewed Dec 6, 2022

View reviewed changes

openshift-ci bot requested review from bertinatto and jsafrane December 6, 2022 20:41

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 6, 2022

Add code for creating zonal policies and SCs

af44173

gnufied force-pushed the add-topology-aware-sc branch from 889a905 to af44173 Compare December 6, 2022 21:34

gnufied mentioned this pull request Dec 8, 2022

STOR-1121:Make policy creation idempotent #125

Merged

bertinatto reviewed Dec 9, 2022

View reviewed changes

jsafrane reviewed Dec 13, 2022

View reviewed changes

openshift-ci bot closed this Dec 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STOR-1049: Add topology aware sc #121

STOR-1049: Add topology aware sc #121

gnufied commented Dec 6, 2022

gnufied commented Dec 6, 2022

rvanderp3 Dec 6, 2022

gnufied Dec 6, 2022

openshift-ci bot commented Dec 6, 2022

gnufied commented Dec 8, 2022

openshift-ci bot commented Dec 8, 2022

bertinatto Dec 9, 2022

gnufied Dec 12, 2022

bertinatto Dec 13, 2022

gnufied Dec 13, 2022

bertinatto left a comment

jsafrane Dec 13, 2022

gnufied Dec 13, 2022

jsafrane Dec 13, 2022

gnufied Dec 13, 2022

gnufied Dec 13, 2022

gnufied commented Dec 13, 2022

openshift-ci bot commented Dec 13, 2022

STOR-1049: Add topology aware sc #121

STOR-1049: Add topology aware sc #121

Conversation

gnufied commented Dec 6, 2022

gnufied commented Dec 6, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci bot commented Dec 6, 2022

gnufied commented Dec 8, 2022

openshift-ci bot commented Dec 8, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bertinatto left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gnufied commented Dec 13, 2022

openshift-ci bot commented Dec 13, 2022