Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DM-31031: Dedicate the step2 subset to per-visit tasks #369

Merged
merged 1 commit into from
Jul 8, 2021
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
43 changes: 17 additions & 26 deletions pipelines/DRP.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -137,19 +137,27 @@ subsets:
- calibrate
- writeSourceTable
- transformSourceTable
description: >
Per-detector tasks that can be run together to start the DRP pipeline.

These should never be run with 'tract' or 'patch' as part of the data ID
expression if any later steps will also be run, because downstream steps
require full visits and 'tract' and 'patch' constraints will always
select partial visits that overlap that region.
step2:
subset:
- consolidateSourceTable
- consolidateVisitSummary
- skyCorr
- fgcmBuildStarsTable
- fgcmFitCycle
- fgcmOutputProducts
description: |
Tasks that can be run together to start the DRP pipeline.
description: >
Per-visit tasks that can be run together, but only after the 'step1'.

These should never be run with 'tract' or 'patch' as part of the data ID
expression if any later steps will also be run, because downstream steps
require full visits and 'tract' and 'patch' constraints will always
select partial visits that overlap that region.
expression. skyCorr and FGCM require full visits and 'tract' and 'patch'
constraints will always select partial visits that overlap that region.

This includes FGCM because it's configured here to run in "global" mode,
which means one should not use 'tract' expression to constrain it, and if
Expand All @@ -159,28 +167,11 @@ subsets:

This subset is considered a workaround for missing middleware and task
functionality. It may be removed in the future.
step2:
step3:
subset:
- jointcal
- makeWarp
- assembleCoadd
description: >
Tasks that can be run together, but only after the 'step1'.

These should be run with explicit 'tract' constraints essentially all the
time, because otherwise quanta will be created for jobs with only partial
visit coverage.

This subset cannot be run at the same time as the tasks in the 'step3'
at present because assembleCoadd has no good way to communicate to
downstream tasks that there is no coadd for a particular patch+band
combination because there no input data. Creating a new QuantumGraph
after all possible coadds have been built works around this.

This subset is considered a workaround for missing middleware and task
functionality. It may be removed in the future.
step3:
subset:
- detection
- mergeDetections
- deblend
Expand All @@ -196,9 +187,9 @@ subsets:
Tasks that can be run together, but only after the 'step1' and 'step2'
subsets.

Adding a 'tract' constraint to the data ID expression when running this
subset should have no effect on the result, if 'step2' was run with a
tract constraint.
These should be run with explicit 'tract' constraints essentially all the
time, because otherwise quanta will be created for jobs with only partial
visit coverage.

It is expected that many forcedPhotCcd quanta will "normally" fail when
running this subset, but this isn't a problem right now because there
Expand Down