Revisiting finalizers #64

ckreibich · 2021-10-19T23:53:42Z

I'd still like to have a way to run finalizers reliably after any test, failing or succeeding. My use case would not be to check the environment for unexpected properties, but to shut down a setup created by the initializer.

We previously discussed this in #57 and pulled the plug at the time. I just looked at this for a bit and here's my take:

I think it would be acceptable to change the semantics of Finalizer (and PartFinalizer!) to behave in this way, instead of adding a new option. I've looked over our Zeek projects, Corelight, and the Zeek package source and I've only found actual finalizer use in three public packages, all containing the same single script, checking the status of the environment after a test. Running this after a failing test wouldn't change the overall result.
We can introduce a deprecation period to make this palatable for current finalizer users: for now, we simply flag the fact that finalizer semantics will change in an upcoming version when the test parser comes upon finalizers. (This output won't affect test outcomes via baselines etc, since it's not part of the command execution.)
I think it is fine to keep the semantics of the current finalizers, i.e., the fact that they can fail the test if they want to. It seems reasonable functionality to me, and I don't have to use it if I just want to run some activity. I just need to ensure I return success in any case.
We could start tracking the failure state of the current test via an environment variable, alongside the other TEST_* ones. TEST_FAILED could become 1, say, whenever the failures counter becomes non-zero. That way, you can build finalizers that are sensitive to the outcome of the previous test sequence.

As an aside, I think we should release 1.0.0 in the not too distant future and adopt semantic versioning. That could then be the time when we switch the finalizers to the new behavior.

@awelzel @deejgregor @bbannier @rsmmr (whoa :-) ... thoughts?

The text was updated successfully, but these errors were encountered:

deejgregor · 2021-10-20T02:36:48Z

(finally chiming-in on this / #57 for the first time)

I would very much like to have a way to always run a finalizer/teardown/etc. script on every test. I don't have a significant opinion in whether this is a tweak to the existing finalizer functionality, or if a new script (e.g.: Teardown) is introduced to provide this functionality. But for brevity, I'll simply use "finalizer" for the rest of this comment.

Our tests often have significant impacts on the state of the system including stopping/starting daemons, and it can take some time for these to settle after a test completes (sometimes taking around 30 seconds or more). In particular, I would like to have the ability to run a finalizer script in btest that always runs, no matter what the outcome is of the test itself. For me, that script would wait for the system to stabilize and if it doesn't happen within a timeout, output some debugging information and fail the test. I have a test suite that currently consists of 659 btest scripts, and a significant fraction of them might impact the system in this way, and I'd like to avoid adding boilerplate to each and every test (in fact, it was @awelzel seeing that significant amount of boilerplate that led him to propose #57 initially).

My key requirements:

Run the finalizer script at the conclusion of every test, regardless of whether the test succeeded or failed. I wouldn't want it to run if a test is skipped, which is would assume is how we already treat the initializer and existing finalizer today.
Let the finalizer fail the test. I want to assert that the system is stable in my finalizer script, and if I'm not able to do that, I want the test to fail to signal that the test left the system broken.
Include the finalizer stderr output in the diagnostic output just like is done for stderr from TEST-EXEC scripts.
Pass the test outcome to the finalizer (as @ckreibich suggested above with TEST_FAILED). I could see us eventually extending our finalizer script to capture a standard set of debugging information whenever a test fails. One possibility, if we end up doing a lot with OpenTelemetry, might be to set a TRACEPARENT environment variable to a unique value for each test and if there is a failure, have the finalizer script grab everything related to that trace ID and stick it in a file in the test's working directory. If you're curious about that, Environment Variables as Carrier for Inter-Process Propagation to transport context open-telemetry/opentelemetry-specification#740 has some comments about how this is being used in some CI and test systems.

rsmmr · 2021-10-20T11:43:42Z

This sounds good to me, incl. @deejgregor's requirements, except that I suggest we use a new keyword instead of repurposing Finalizer. We can still deprecate that one and remove it later (or better: produce an error if it's still used), but I wouldn't change its semantics under the hood. If anybody else out there happens to be using it, that would be difficult to deal with (because it's anybody running that testsuite that would need to have that updated test).

deejgregor · 2021-10-20T13:43:13Z

Teardown as suggested in #57 might be a good way to go, then. And that matches similar functionality in JUnit where the per-test methods are setUp() and tearDown() -- https://junit.org/junit4/faq.html#organize_3 .

awelzel · 2021-10-20T14:18:48Z

We could start tracking the failure state of the current test via an environment variable, alongside the other TEST_* ones. TEST_FAILED could become 1, say, whenever the failures counter becomes non-zero. That way, you can build finalizers that are sensitive to the outcome of the previous test sequence.

This is a great suggestion. #57 tried that as a command-line parameter, but TEST_FAILED=0|1 in the environment is much cleaner.

ckreibich · 2021-10-20T17:11:09Z

Great stuff, looks like we have a way forward then. Teardown is fine with me too ... it will land the feature faster than waiting out a deprecation schedule. :-) I'll put something together.

* topic/christian/gh-64: Documentation for PartInitializer, Teardown, and PartTeardown. Add testcases for new functionality Add PartInitializer option Support for global and test-part teardowns Remove Python 2.7 from manifest

ckreibich · 2021-11-01T19:20:38Z

Fyi, I just released this as 0.71 so one has a minimum version to require when using the teardown. It'll be good to test this now to see if it works for everyone as planned.

ckreibich mentioned this issue Oct 22, 2021

Introduce Teardown, PartInitializer, and PartTeardown options. #65

Merged

ckreibich closed this as completed in #65 Nov 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revisiting finalizers #64

Revisiting finalizers #64

ckreibich commented Oct 19, 2021

deejgregor commented Oct 20, 2021

rsmmr commented Oct 20, 2021 •

edited

Loading

deejgregor commented Oct 20, 2021 •

edited

Loading

awelzel commented Oct 20, 2021

ckreibich commented Oct 20, 2021

ckreibich commented Nov 1, 2021

Revisiting finalizers #64

Revisiting finalizers #64

Comments

ckreibich commented Oct 19, 2021

deejgregor commented Oct 20, 2021

rsmmr commented Oct 20, 2021 • edited Loading

deejgregor commented Oct 20, 2021 • edited Loading

awelzel commented Oct 20, 2021

ckreibich commented Oct 20, 2021

ckreibich commented Nov 1, 2021

rsmmr commented Oct 20, 2021 •

edited

Loading

deejgregor commented Oct 20, 2021 •

edited

Loading