DM-23174: Consolidate daf_butler test code #229

kfindeisen · 2020-02-04T21:51:55Z

This PR moves the test utilities previously located in the tests/ directory to a new lsst.daf.butler.tests package (which is not included in lsst.daf.butler). It also adds some extra utilities designed for test code in other packages, such as functions for creating and configuring mock repositories (see lsst/pipe_base#114 for an example of where this would be useful).

Developer guide says to use spans for consecutive years.

timj

Thanks for this clean up. I have a few questions before I do a final sign off. In particular I'd like @TallJimbo to comment on a couple of the test routines.

COPYRIGHT

python/lsst/daf/butler/tests/_examplePythonTypes.py

python/lsst/daf/butler/tests/_testRepo.py

python/lsst/daf/butler/tests/_examplePythonTypes.py

timj · 2020-02-04T22:50:55Z

tests/test_testRepo.py

+    @classmethod
+    def setUpClass(cls):
+        # Repository should be re-created for each test case, but
+        # this has a prohibitive run-time cost at present


We need to work this out. tests/test_butler.py creates huge numbers of temp directories with their own butlers and this test class should not be any different. For example:

python tests/test_butler.py .........................................................x...... ---------------------------------------------------------------------- Ran 64 tests in 15.860s

and most of those 64 tests are calling makeRepo in their own temp directory.

This test on this branch is really fast for me:

$ time python tests/test_testRepo.py ........ ---------------------------------------------------------------------- Ran 8 tests in 0.386s OK real 0m1.711s user 0m1.070s sys 0m0.604s

Maybe we should use https://pypi.org/project/pytest-profiling/ and compare timings.

I propose splitting that into another ticket, possibly a high-priority one for February. While I am worried about what's going on, the pipeline test framework is something we need ASAP, and this ticket has already dragged on for two weeks for various reasons (e.g., the Butler API changing due to concurrent development). There is no guarantee that we can track down the cause of the slowdown (which, as far as we know, affects only my computer) in any particular amount of time.

Ok. Create a new ticket. It is concerning that you see such a problem. I guess it's a good thing that you have demonstrated that you can run all tests with a shared butler and changing collections for each test.

Done as DM-23357.

tests/test_testRepo.py

timj · 2020-02-04T22:59:08Z

tests/test_testRepo.py

+        # outfile has the most obvious effects of any Butler.makeRepo keyword
+        with tempfile.TemporaryDirectory() as temp:
+            path = os.path.join(temp, 'oddConfig.py')
+            makeTestRepo(self.root, {}, outfile=path)


I need to think why this works fine given that self.root won't include a butler.yaml file so the Butler created inside makeTestRepo should fail because it won't know about the config written to path. I think at least that the Butler returned will have a different config to the one in path.

tests/test_testRepo.py

python/lsst/daf/butler/tests/_testRepo.py

The code has been cleaned up, and tests added.

It's hard to explicitly provide correct keys without understanding how the Butler dimensions system works in detail. Moving key constraints to automated (if simple-minded) code greatly reduces the burden on callers.

Each test should have its own collection for isolation, but creating a completely new repository each time is impractical.

This change lets tests avoid using Numpy arrays, whose nonstandard __eq__ behavior makes them poor test objects.

This approach is not object-oriented, but making the Datastore interface support possibile child datastores is a nontrivial fix.

kfindeisen added 2 commits February 4, 2020 11:26

Standardize copyright dates.

5e07761

Developer guide says to use spans for consecutive years.

Fix documentation build warning.

823c75b

kfindeisen requested a review from timj February 4, 2020 21:51

timj requested changes Feb 5, 2020

View reviewed changes

kfindeisen added 7 commits February 5, 2020 11:19

Add UW copyright.

0693788

Create lsst.daf.butler.tests package.

0e8e396

Move datasetsHelper to daf.butler.tests.

524e5ed

Move dummyRegistry to daf.butler.tests.

c28c42a

Add missing __all__ to dummyRegistry.

0329612

Move examplePythonTypes to daf.butler.tests.

4ceda1d

Add missing __all__ to examplePythonTypes.

5c8ce09

TallJimbo approved these changes Feb 5, 2020

View reviewed changes

python/lsst/daf/butler/tests/_testRepo.py Show resolved Hide resolved

python/lsst/daf/butler/tests/_testRepo.py Show resolved Hide resolved

timj approved these changes Feb 5, 2020

View reviewed changes

kfindeisen added 5 commits February 5, 2020 14:05

Transfer prototype Butler code from lsst.verify.

4c8c2f9

The code has been cleaned up, and tests added.

Make makeTestButler infer relationships.

c912dbc

It's hard to explicitly provide correct keys without understanding how the Butler dimensions system works in detail. Moving key constraints to automated (if simple-minded) code greatly reduces the burden on callers.

Add expandUniqueId test utility to recover auto-generated data IDs.

0ee8f1f

Split repository/collection creation.

4fc4edf

Each test should have its own collection for isolation, but creating a completely new repository each time is impractical.

Add generic support for MetricsExample.

b4cb1d7

This change lets tests avoid using Numpy arrays, whose nonstandard __eq__ behavior makes them poor test objects.

kfindeisen force-pushed the tickets/DM-23174 branch from f7ebf5a to 48bfafd Compare February 6, 2020 00:58

kfindeisen added 3 commits February 6, 2020 12:26

Add support for ChainedDatastore to registerMetricsExample.

c37dd25

This approach is not object-oriented, but making the Datastore interface support possibile child datastores is a nontrivial fix.

Let makeTestRepo take same arguments as Butler.makeRepo.

defc6c8

Make makeTestRepo use an in-memory repository by default.

37c383a

kfindeisen force-pushed the tickets/DM-23174 branch from 48bfafd to 37c383a Compare February 6, 2020 18:32

kfindeisen merged commit 37c383a into master Feb 6, 2020

kfindeisen deleted the tickets/DM-23174 branch February 6, 2020 22:04

kfindeisen mentioned this pull request Feb 7, 2020

DM-22599: Develop PipelineTask unit test framework lsst/pipe_base#116

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-23174: Consolidate daf_butler test code #229

DM-23174: Consolidate daf_butler test code #229

kfindeisen commented Feb 4, 2020

timj left a comment

timj Feb 4, 2020

timj Feb 4, 2020

timj Feb 5, 2020

kfindeisen Feb 5, 2020

timj Feb 5, 2020

kfindeisen Feb 5, 2020

timj Feb 4, 2020

DM-23174: Consolidate daf_butler test code #229

DM-23174: Consolidate daf_butler test code #229

Conversation

kfindeisen commented Feb 4, 2020

timj left a comment

Choose a reason for hiding this comment

timj Feb 4, 2020

Choose a reason for hiding this comment

timj Feb 4, 2020

Choose a reason for hiding this comment

timj Feb 5, 2020

Choose a reason for hiding this comment

kfindeisen Feb 5, 2020

Choose a reason for hiding this comment

timj Feb 5, 2020

Choose a reason for hiding this comment

kfindeisen Feb 5, 2020

Choose a reason for hiding this comment

timj Feb 4, 2020

Choose a reason for hiding this comment