DM-37786: allow PipelineTasks to control default dataset-query-constraint behavior. #304

TallJimbo · 2023-02-14T18:09:30Z

Checklist

ran Jenkins
added a release note for user-visible changes to doc/changes

codecov · 2023-02-14T18:12:47Z

Codecov Report

Base: 80.56% // Head: 80.61% // Increases project coverage by +0.04% 🎉

Coverage data is based on head (0107320) compared to base (2a0a5ff).
Patch coverage: 90.00% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #304      +/-   ##
==========================================
+ Coverage   80.56%   80.61%   +0.04%     
==========================================
  Files          57       57              
  Lines        6304     6319      +15     
  Branches     1174     1280     +106     
==========================================
+ Hits         5079     5094      +15     
- Misses        981      982       +1     
+ Partials      244      243       -1

Impacted Files	Coverage Δ
python/lsst/pipe/base/graphBuilder.py	`65.99% <77.77%> (-0.14%)`	⬇️
python/lsst/pipe/base/connectionTypes.py	`84.84% <100.00%> (+0.23%)`	⬆️
python/lsst/pipe/base/pipeline.py	`63.63% <100.00%> (+0.82%)`	⬆️
python/lsst/pipe/base/graph/graph.py	`82.15% <0.00%> (+0.52%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

timj · 2023-02-14T18:21:06Z

@TallJimbo would you mind adding python 3.11 to the build matrix whilst you are doing this PR? It should work fine, ctrl_mpexec builds okay.

natelust

While I am slightly worried about the multiple ways and places dataset constraints are used that don't quite fully overlap being confusing to futures maintainers, I think this is a pragmatic approach without larger refactoring.

TallJimbo · 2023-02-14T19:21:12Z

I had the same misgivings and sense that it was the least-bad option, and I'll add some code comments to that effect.

I wouldn't call this a hack, but it's being added here now to work around long-standing limitations involving spatial overlaps in QG generation, and I still want to fix those limitations directly. That makes me fit a *tiny* bit better about having absolutely no idea how I'd start to write a test for this, as the circumstances under which it appears in the real world require a lot of data (multiple tracts) and a pretty complicated graph and task, and we have none of that in ci_* packages, let alone here. So I'll be relying on one-off at-scale tests for now.

Even though these are debug-level, all of the other debug-level logging in GraphBuilder is much sparser.

I've long wondered why users occasionally reported the previous "this should't be possible" error message, and finally figured it out: initInputs don't get included in the initial data ID query, so follow-ups on those can fail even if repo state hasn't changed under us.

natelust approved these changes Feb 14, 2023

View reviewed changes

TallJimbo added 6 commits February 14, 2023 15:16

Drop noisy logs in GraphBuilder.

0bb22d5

Even though these are debug-level, all of the other debug-level logging in GraphBuilder is much sparser.

Improve debug logging in GraphBuilder.

284536a

Add changelog entries.

cb16111

Add Python 3.11 to build matrix.

0107320

TallJimbo force-pushed the tickets/DM-37786 branch from f970cba to 0107320 Compare February 14, 2023 20:16

TallJimbo merged commit e56e3bd into main Feb 15, 2023

TallJimbo deleted the tickets/DM-37786 branch February 15, 2023 03:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-37786: allow PipelineTasks to control default dataset-query-constraint behavior. #304

DM-37786: allow PipelineTasks to control default dataset-query-constraint behavior. #304

TallJimbo commented Feb 14, 2023 •

edited

codecov bot commented Feb 14, 2023 •

edited

timj commented Feb 14, 2023

natelust left a comment

TallJimbo commented Feb 14, 2023

DM-37786: allow PipelineTasks to control default dataset-query-constraint behavior. #304

DM-37786: allow PipelineTasks to control default dataset-query-constraint behavior. #304

Conversation

TallJimbo commented Feb 14, 2023 • edited

Checklist

codecov bot commented Feb 14, 2023 • edited

Codecov Report

timj commented Feb 14, 2023

natelust left a comment

Choose a reason for hiding this comment

TallJimbo commented Feb 14, 2023

TallJimbo commented Feb 14, 2023 •

edited

codecov bot commented Feb 14, 2023 •

edited