[DM-33438] Clean up run_demo.sh #31

cbanek · 2023-01-24T19:57:06Z

No description provided.

cbanek · 2023-01-24T19:58:20Z

This script works if I take out the multiprocessing stuff, but with the multiprocessing, I get the following error from the python tests:

___________________________________________________________________ PiplinesCheckTestCase.testExecutionButler (chain='demo_collection_exe') ___________________________________________________________________

self = <test_butler.PiplinesCheckTestCase testMethod=testExecutionButler>

def testExecutionButler(self):
    """Check outputs match in both runs."""

    for chain in (EXE_CHAIN, QBB_CHAIN):
        with self.subTest(chain=chain):
            # Check that we have identical datasets in both collections
            # except for the dataset.id
            main_datasets = self._get_datasets_from_chain(MAIN_CHAIN)
            datasets = self._get_datasets_from_chain(chain)
            self.assertGreater(len(datasets), 0)

          self.assertEqual(len(main_datasets), len(datasets))

E AssertionError: 21 != 13

tests/test_butler.py:101: AssertionError
___________________________________________________________________ PiplinesCheckTestCase.testExecutionButler (chain='demo_collection_qbb') ___________________________________________________________________

self = <test_butler.PiplinesCheckTestCase testMethod=testExecutionButler>

def testExecutionButler(self):
    """Check outputs match in both runs."""

    for chain in (EXE_CHAIN, QBB_CHAIN):
        with self.subTest(chain=chain):
            # Check that we have identical datasets in both collections
            # except for the dataset.id
            main_datasets = self._get_datasets_from_chain(MAIN_CHAIN)
            datasets = self._get_datasets_from_chain(chain)
            self.assertGreater(len(datasets), 0)

          self.assertEqual(len(main_datasets), len(datasets))

E AssertionError: 21 != 13

tests/test_butler.py:101: AssertionError

bin/run_demo.sh

timj · 2023-01-24T20:13:05Z

That test is saying that the number of datasets created in the EXECUTION BUTLER bit of code (the first 3 pipetask calls using $NODE) is equal to the number of datasets created by the QUNTUM BACKED BUTLER bit of code (the second set of 3 pipetask calls using $NODE).

timj

This looks good. There is a small possibility that the two parallel processes hitting sqlite would fall over each other but given that the second run is using the graph and not the sqlite at all and we only have two processes, I think this will almost certainly be fine. cc/ @andy-slac

bin/run_demo.sh

cbanek added 3 commits January 23, 2023 19:01

[DM-33438] Add first for loop

ba575f8

[DM-33438] Add another for loop

c2fa1cf

[DM-33438] Add first multiprocessing

dcb2b4c

timj reviewed Jan 24, 2023

View reviewed changes

bin/run_demo.sh Outdated Show resolved Hide resolved

cbanek added 2 commits January 24, 2023 20:25

[DM-33438] Remove initial multiprocessing attempt

56aba98

[DM-33438] Run execution and quantum butler in parallel

a067521

timj approved these changes Jan 24, 2023

View reviewed changes

bin/run_demo.sh Outdated Show resolved Hide resolved

bin/run_demo.sh Outdated Show resolved Hide resolved

bin/run_demo.sh Outdated Show resolved Hide resolved

bin/run_demo.sh Outdated Show resolved Hide resolved

[DM-33438] Fix up comments

4cf0ef5

cbanek merged commit 6f36787 into main Jan 25, 2023

cbanek deleted the tickets/DM-33438 branch January 25, 2023 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DM-33438] Clean up run_demo.sh #31

[DM-33438] Clean up run_demo.sh #31

cbanek commented Jan 24, 2023

cbanek commented Jan 24, 2023

timj commented Jan 24, 2023

timj left a comment

[DM-33438] Clean up run_demo.sh #31

[DM-33438] Clean up run_demo.sh #31

Conversation

cbanek commented Jan 24, 2023

cbanek commented Jan 24, 2023

timj commented Jan 24, 2023

timj left a comment

Choose a reason for hiding this comment