DM-25000: Make registry query methods work with composition again. #289

TallJimbo · 2020-05-21T16:59:20Z

No description provided.

We stopped using these code paths outside of tests on DM-24288, and removing them is a huge simplification for Registry. In detail: - DatasetRef.components has been removed, along with the flatten and allRefs methods. This means DatasetRefs now only have two states (resolved and unresolved), instead of it being unclear whether an empty dict means there are no components or that we don't know about the components. DatasetRef.makeComponentRef has been added to make it easy to make a ref for a component dataset from the parent ref. - All "recursive" kwargs have been removed, along with some occasionally-complex logic needed to implement them. - The dataset_composition table has been removed. It is worth noting that I found a couple of spots where we were still calling insertDatasets with recursive=True (ingest and import from yaml). So I think we were still generating unnecessary database rows sometimes, including in contexts where a lot of datasets were involved. I don't think those rows were ever used after insertion, which is why this didn't break anything.

We now just implement __repr__ and let __str__ delegate to that, and aim to make it informative and non-confusing because we've never been able to make the string eval'able as in the ideal case. Note that the biggest problem with the old repr (which is what is used by __str__ in built-in containers of data IDs, regardless of what we'd prefer) is that it printed the full DimensionGraph, which often has implied dimensions and hence more "keys" than the mapping interface actually provided access to. That it impossible to infer what the dict-like form actually was, and it's the dict-like form that people actually want to know.

We now have an option to control whether components are included in the results. We expect users to consider them noise if the parent is also included in the results, so the default is now to not include them unless the parent is not in the results.

timj

Looks ok.

timj · 2020-05-21T19:43:52Z

python/lsst/daf/butler/registry/tests/_registry.py

+        childType = registry.getDatasetType("permabias.wcs")
+        parentRefResolved = registry.findDataset(parentType, collections=collection,
+                                                 instrument="Cam1", detector=1)
+        self.assertIsNotNone(parentRefResolved)


Maybe we could be more positive in this test and use:

self.assertIsInstance(parentRefResolved, DatasetRef)

?

timj · 2020-05-21T19:45:49Z

python/lsst/daf/butler/registry/tests/_registry.py

+        parentRefUnresolved = parentRefResolved.unresolved()
+        # Search for a single dataset with findDataset.
+        childRef1 = registry.findDataset("permabias.wcs", collections=collection,
+                                         dataId=parentRefUnresolved.dataId)


Isn't the dataId of parentRefUnresolved the same as the dataId for parentRefResolved?

Good catch; the unresolved variable's existence is a relic of a previous incarnation of the test, and I've removed it.

TallJimbo added 3 commits May 21, 2020 12:59

TallJimbo force-pushed the tickets/DM-25000 branch from cee2c98 to f7999e9 Compare May 21, 2020 16:59

timj approved these changes May 21, 2020

View reviewed changes

Handle dataset components in queryDimensions and queryDatasets.

11c8da9

TallJimbo force-pushed the tickets/DM-25000 branch from 97367b7 to 11c8da9 Compare May 22, 2020 16:09

TallJimbo merged commit f58ce13 into master May 22, 2020

TallJimbo deleted the tickets/DM-25000 branch May 22, 2020 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-25000: Make registry query methods work with composition again. #289

DM-25000: Make registry query methods work with composition again. #289

TallJimbo commented May 21, 2020

timj left a comment

timj May 21, 2020

timj May 21, 2020

TallJimbo May 22, 2020

DM-25000: Make registry query methods work with composition again. #289

DM-25000: Make registry query methods work with composition again. #289

Conversation

TallJimbo commented May 21, 2020

timj left a comment

Choose a reason for hiding this comment

timj May 21, 2020

Choose a reason for hiding this comment

timj May 21, 2020

Choose a reason for hiding this comment

TallJimbo May 22, 2020

Choose a reason for hiding this comment