🌱 make: add non-kind shared and sharded e2e #2265

stevekuznetsov · 2022-10-26T16:20:37Z

test/e2e: make sure tests actually run in parallel

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

test/e2e: be consistent about timeouts and poll intervals

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

test/e2e/workspacetype: poll, don't blindly wait for 5s

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

test/e2e/syncer: send one request, not many

You can't prove a negative, and eventually consistent systems don't like
to conform to your idea of "long enough" before synchronizing. We can't
assume that "nothing will happen" within 5s here. If we wanted to make
sure that the controller had seen & acknowledged the object, we should
have it post some observed generation.

Since these polls were not effective at best, and flaky at worst, we can
save 10s by just making one call.

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

make: add non-kind variants of shared & sharded e2e

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

make: be more intelligent with parallelism, count

First, we don't want to pass -count 1 to go test unless someone
explicitly opts into this behavior, as this explicitly invalidates the
test cache. When re-running tests locally, being able to run the full
suite every time and use the cache is incredibly useful.

Second, we should only bother with limiting parallelism when we're
starting up a full kcp and etcd server per test case.

Signed-off-by: Steve Kuznetsov skuznets@redhat.com

/cc @jmprusi @davidfestal
/assign @ncdc @sttts

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

You can't prove a negative, and eventually consistent systems don't like to conform to your idea of "long enough" before synchronizing. We can't assume that "nothing will happen" within 5s here. If we wanted to make sure that the controller had seen & acknowledged the object, we should have it post some observed generation. Since these polls were not effective at best, and flaky at worst, we can save 10s by just making one call. Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

nrb · 2022-10-26T16:48:50Z

/lgtm

nrb · 2022-10-26T16:58:41Z

Running make test-e2e-shared-minimal a second time on my machine (macOS 16.1 ARM) results in this error; deleting the whole .kcp directory fixes it.

KCP Error: cannot create the 'admin.kubeconfig` file with an empty token for the shard-admin user
error: kcp shard kcp terminated with exit code 1

stevekuznetsov · 2022-10-26T17:47:39Z

@nrb that's a pre-existing boo-boo for these targets

First, we don't want to pass `-count 1` to `go test` unless someone explicitly opts into this behavior, as this explicitly invalidates the test cache. When re-running tests locally, being able to run the full suite every time and use the cache is incredibly useful. Second, we should only bother with limiting parallelism when we're starting up a full `kcp` and `etcd` server per test case. Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

ncdc · 2022-10-26T18:00:01Z

test/e2e/syncer/syncer_test.go

-		}
-		require.NoError(t, err)
-		return false
-	}, 5*time.Second, time.Second, "upstream Deployment %s/%s got deleted or there was an error", upstreamNamespace.Name, upstreamDeployment.Name)


I think the intent here was to wait a ~short amount of time to make sure there weren't any unintentional/unexpected deletions that trickled in after a period of time.

I wrote in the commit why that is not a valid way to test this behavior.

ncdc · 2022-10-26T18:02:36Z

test/e2e/workspacetype/controller_test.go

@@ -287,10 +288,16 @@ func TestClusterWorkspaceTypes(t *testing.T) {
 				})

 				t.Logf("Expect workspace to be stuck in initializing phase")
-				time.Sleep(5 * time.Second)


This is another case where there's a desire to wait a ~short amount of time and make sure an event does not happen. Not sure how we want to approach those.

It is not - we are waiting to see it be in initializing, no?

My comment about proving negatives and assuming time-scales for eventually consistent systems applies to this case as well.

ncdc · 2022-10-26T18:42:37Z

/lgtm
/approve

openshift-ci · 2022-10-26T18:42:45Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ncdc

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ncdc]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

stevekuznetsov added 5 commits October 26, 2022 10:16

test/e2e: make sure tests actually run in parallel

ef2d401

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

test/e2e: be consistent about timeouts and poll intervals

7311c78

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

test/e2e/workspacetype: poll, don't blindly wait for 5s

207db52

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

make: add non-kind variants of shared & sharded e2e

96608ac

Signed-off-by: Steve Kuznetsov <skuznets@redhat.com>

openshift-ci bot assigned ncdc and sttts Oct 26, 2022

openshift-ci bot requested review from davidfestal and jmprusi October 26, 2022 16:20

stevekuznetsov force-pushed the skuznets/non-kind-e2e branch from 437b38d to d512fe7 Compare October 26, 2022 16:46

openshift-ci bot assigned nrb Oct 26, 2022

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Oct 26, 2022

stevekuznetsov force-pushed the skuznets/non-kind-e2e branch from d512fe7 to 8e53e07 Compare October 26, 2022 17:48

openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Oct 26, 2022

ncdc reviewed Oct 26, 2022

View reviewed changes

stevekuznetsov mentioned this pull request Oct 26, 2022

🌱 test/e2e: add the concept of suites, allow selecting #2266

Merged

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Oct 26, 2022

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 26, 2022

openshift-merge-robot merged commit 264a6fe into kcp-dev:main Oct 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌱 make: add non-kind shared and sharded e2e #2265

🌱 make: add non-kind shared and sharded e2e #2265

stevekuznetsov commented Oct 26, 2022 •

edited

nrb commented Oct 26, 2022

nrb commented Oct 26, 2022

stevekuznetsov commented Oct 26, 2022

ncdc Oct 26, 2022

stevekuznetsov Oct 26, 2022

ncdc Oct 26, 2022

stevekuznetsov Oct 26, 2022

stevekuznetsov Oct 26, 2022

ncdc commented Oct 26, 2022

openshift-ci bot commented Oct 26, 2022

🌱 make: add non-kind shared and sharded e2e #2265

🌱 make: add non-kind shared and sharded e2e #2265

Conversation

stevekuznetsov commented Oct 26, 2022 • edited

nrb commented Oct 26, 2022

nrb commented Oct 26, 2022

stevekuznetsov commented Oct 26, 2022

ncdc Oct 26, 2022

Choose a reason for hiding this comment

stevekuznetsov Oct 26, 2022

Choose a reason for hiding this comment

ncdc Oct 26, 2022

Choose a reason for hiding this comment

stevekuznetsov Oct 26, 2022

Choose a reason for hiding this comment

stevekuznetsov Oct 26, 2022

Choose a reason for hiding this comment

ncdc commented Oct 26, 2022

openshift-ci bot commented Oct 26, 2022

stevekuznetsov commented Oct 26, 2022 •

edited