Expose failures to clean up after tests #342

squaremo · 2022-10-18T10:47:06Z

When stack objects aren't deleted after tests, it can mean

resources created by the stack aren't tidied up;
the finalizer logic isn't well tested
the stack might interfere in annoying ways with other tests.

Previously, all stacks in the namespace were deleted as part of a top-level AfterEach; but because this tried to delete all stacks and wait until they'd been finalised, any rogue test case that didn't clean up properly and couldn't be finalised would cause some other test to time out.

The first commit here uses a similar idea, but fails the test suite if there are stacks left over at the very end. This highlighted a few problems, which I've also fixed here:

a couple of tests were deleting their backend directory before deleting the stack using it, and could fail to be finalized as a result;
lots of stacks had random names that didn't help with tracking down the test case
some stacks didn't even get deleted after their test case

Deleting a stack but not waiting for it to be finalized, within a test, could mean other tests were "contaminated" (usually only meaning they might take a bit longer, and get cross-talk in the logs, but still). I've made all test cases use a helper which waits for the finalizers to have run.

When stack objects aren't deleted after tests, it can mean - resources created by the stack aren't tidied up; - the finalizer logic isn't well tested - the stack might interfere in annoying ways with other tests. Previously, all stacks in the namespace were deleted as part of a top-level `AfterEach`; but because this tried to delete all stacks and wait until they'd been finalised, any rogue test case that didn't clean up properly and couldn't be finalised would cause some _other_ test to time out. Instead, this commit uses a similar idea, but fails the test suite if there are stacks left over at the very end. This highlighted a couple of problems, which I've also fixed here: - one test was deleting its working directory before deleting the stack using it, and would fail to be finalized; - lots of stacks had random names that didn't help with tracking down the test case Finally, the helper `deleteAndWaitForFinalization` can be used to ensure a test case doesn't exit until it has truly cleaned up. It's worth using this everywhere -- next, I'm going to rewrite tests to use it, and the other helpers. Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

It's a little bit faster to run a YAML program than to download dependencies and run a NodeJS program, so this saves a bit of time and reduces the possibility of timeouts. Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

If the controller repeatedly retries a Stack, it will oscillate between having the "Reconciling" condition because it has been requeued, and having the same condition because it's being processed by the controller again. This makes seeing a specific reason for retrying tricky -- but, if you've observed that the stack has been processed and failed, then _either_ reason is good enough to show that it's being retried, so that race condition can be avoided. Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

squaremo added 5 commits October 18, 2022 11:36

Use deletion helper throughout

a5aa926

Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

Replace testdata/empty-stack with YAML program

fdb8b8a

It's a little bit faster to run a YAML program than to download dependencies and run a NodeJS program, so this saves a bit of time and reduces the possibility of timeouts. Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

Raise the slow test threshold

a48d230

Signed-off-by: Michael Bridgen <mbridgen@pulumi.com>

squaremo force-pushed the do-finalisers-work-properly branch from 791706f to bcd57fd Compare October 18, 2022 16:46

squaremo marked this pull request as ready for review October 18, 2022 16:57

lblackstone approved these changes Oct 18, 2022

View reviewed changes

lblackstone added the impact/no-changelog-required This issue doesn't require a CHANGELOG update label Oct 18, 2022

lblackstone merged commit 3d0782b into master Oct 18, 2022

lblackstone deleted the do-finalisers-work-properly branch October 18, 2022 17:20

lblackstone mentioned this pull request Oct 18, 2022

Update operator image to v1.10.0-rc.1 #341

Merged

squaremo mentioned this pull request Oct 21, 2022

Make mutual exclusion of sources obvious #340

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose failures to clean up after tests #342

Expose failures to clean up after tests #342

squaremo commented Oct 18, 2022 •

edited

Loading

Expose failures to clean up after tests #342

Expose failures to clean up after tests #342

Conversation

squaremo commented Oct 18, 2022 • edited Loading

squaremo commented Oct 18, 2022 •

edited

Loading