Terminate proccess's when experiencing a fatal error in ductape runner #323

imcdo · 2022-06-08T18:04:02Z

closes #322
Simply terminates process when an exception is caught in the run cycle

CLAassistant · 2022-06-08T18:04:08Z

All committers have signed the CLA.

imcdo · 2022-06-08T18:34:27Z

tested by running ducktape --max-parallel 10000 --repeat 6 --test-runner-timeout 1 systests/cluster/test_runner_operations.py against a vagrant cluster on top of pytests.

stan-is-hate

Thanks Ian!

tests/runner/check_runner.py

stan-is-hate · 2022-06-08T22:32:30Z

systests/cluster/test_runner_operations.py

+        self.service = SimpleEchoService(self.test_context)
+
+    @cluster(num_nodes=1)
+    def timeout_test(self):


I would probably want to test with multiple tests in flight - some scheduled in parallel, some still yet to schedule (maybe just run for all systests?)

well i run in parallel for the unit test as well, but yes ran in systest with some tests yet to schedule etc.

see #323 (comment)

in #323 you're saying that you've tested with test_runner_operations, which is a single test method - maybe worth testing with simply systests folder?

yeah for sure ill give it a run, ran it with repeat to simulate a bunch of test being run but yeah lets get the complete coverage.

stan-is-hate

Approved, thanks Ian! Just please run it with all of ducktape systests, maybe also play with test sizes to put the fatal test in the middle of the run or parallel with other tests etc

stan-is-hate · 2022-06-09T00:49:26Z

Also, this PR cannot be built, please don't merge without fixing that 😄

stan-is-hate · 2022-06-09T00:57:36Z

ducktape/tests/runner.py

@@ -210,6 +210,9 @@ def run_all_tests(self):
                        self._log(logging.ERROR, err_str)

                        # All processes are on the same machine, so treat communication failure as a fatal error
+                        for proc in self._client_procs.values():
+                            proc.terminate()


Also compare this to https://github.com/confluentinc/ducktape/blob/master/ducktape/tests/runner.py#L124 which uses os.kill vs terminate() - what's the difference and pros/cons?

would probably be good to have a unified cleanup_child_processes method or smth

if you read the docs for terminate:

Terminate the process. On Unix this is done using the SIGTERM signal; on Windows TerminateProcess()

which seems to be more platform agnostic

What about modifying that other line too then? Can be a separate PR though.

Yeah might be best to touch it in another pr with more testing.

confluentinc#323) * update test runner * update docstring * readd newline * add a simple test to run against * fix formating (cherry picked from commit a214102)

confluentinc#323) * update test runner * update docstring * readd newline * add a simple test to run against * fix formating

update test runner

ebb81b9

imcdo requested a review from a team June 8, 2022 18:04

imcdo added 3 commits June 8, 2022 11:05

update docstring

b36ca0c

readd newline

8ec5b9d

add a simple test to run against

2bf0a9b

stan-is-hate approved these changes Jun 8, 2022

View reviewed changes

stan-is-hate approved these changes Jun 9, 2022

View reviewed changes

stan-is-hate reviewed Jun 9, 2022

View reviewed changes

fix formating

eb82607

stan-is-hate approved these changes Jun 16, 2022

View reviewed changes

imcdo merged commit a214102 into confluentinc:0.7.x Jun 16, 2022

andrewhsu mentioned this pull request Mar 22, 2023

backport: Terminate proccess's when experiencing a fatal error in ductape runner redpanda-data/ducktape#25

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Terminate proccess's when experiencing a fatal error in ductape runner #323

Terminate proccess's when experiencing a fatal error in ductape runner #323

imcdo commented Jun 8, 2022 •

edited

CLAassistant commented Jun 8, 2022 •

edited

imcdo commented Jun 8, 2022

stan-is-hate left a comment

stan-is-hate Jun 8, 2022

imcdo Jun 8, 2022 •

edited

imcdo Jun 8, 2022

stan-is-hate Jun 9, 2022

imcdo Jun 9, 2022

stan-is-hate left a comment

stan-is-hate commented Jun 9, 2022

stan-is-hate Jun 9, 2022

stan-is-hate Jun 9, 2022

imcdo Jun 9, 2022

stan-is-hate Jun 16, 2022

imcdo Jun 16, 2022 •

edited

Terminate proccess's when experiencing a fatal error in ductape runner #323

Terminate proccess's when experiencing a fatal error in ductape runner #323

Conversation

imcdo commented Jun 8, 2022 • edited

CLAassistant commented Jun 8, 2022 • edited

imcdo commented Jun 8, 2022

stan-is-hate left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

imcdo Jun 8, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

stan-is-hate left a comment

Choose a reason for hiding this comment

stan-is-hate commented Jun 9, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

imcdo Jun 16, 2022 • edited

Choose a reason for hiding this comment

imcdo commented Jun 8, 2022 •

edited

CLAassistant commented Jun 8, 2022 •

edited

imcdo Jun 8, 2022 •

edited

imcdo Jun 16, 2022 •

edited