DM-26140: Centralize Gen 3 pipeline configuration info for ap_verify datasets #139

kfindeisen · 2021-09-08T20:06:55Z

This PR adds instrument specializations of the ApVerify and ApVerifyWithFakes pipelines, and moves many of the previous pipeline overrides (in particular, the assumption of deep coadds) into individual datasets. It also modifies the behavior of the ap_verify.py --pipeline argument to more naturally support dataset-specific pipelines.

mrawls

As best I can tell these pipelines work how I'd expect, but I'm a bit lost in some of the details of the instrument-specific fakes pipelines. Please clarify, and make everything two-spaced indents while you're at it (sorry).

mrawls · 2021-09-29T00:34:36Z

pipelines/ApVerifyWithFakes.yaml

+            - isr
+            - characterizeImage
+            - calibrate
+            - visitFakes


I do think this needs to be here, but it is odd to me that it's not in the prepareFakes subset. Maybe a comment explaining?

Just to make sure, is "this" just the visitFakes line? That probably is in the wrong place, based on me misunderstanding what that task is for.

I dont really know what each of these tasks is used for, but as a note, it is possible to include a label in more than one subset, they do not have to be mutually exclusive. In DRP there is processCcd, but also Single frame that includes even more. These are just helpful organizers for people to help run or refer to collections of tasks.

I actually did mean to create a partition, because that's the organization that makes the most sense to me.

And on third(?) thought, I think visitFakes belongs in apPipeWithFakes (because it runs characterizeImage and calibrate on the data) but not in prepareFakes (because it requires partially processed raws whereas the other tasks can be done without them). Not ready to push yet, but here's what I have so far:

apPipe: subset: - isr - characterizeImage - calibrate - imageDifferenceNoFakes description: > The AP pipeline without fakes. Only includes processing through image differencing. prepareFakes: subset: - createFakes - coaddFakes description: > Creation of fake sources. apPipeWithFakes: subset: - visitFakes # characterizeImage and calibrate with fakes - imageDifference - transformDiaSrcCat - diaPipe - matchFakes description: > The AP pipeline with fakes. Requires apPipe and prepareFakes subsets.

pipelines/ApVerifyWithFakes.yaml

pipelines/DarkEnergyCamera/ApVerifyWithFakes.yaml

pipelines/HyperSuprimeCam/ApVerifyWithFakes.yaml

Like configs, pipelines are copied to the workspace directory for ease of access and to allow reconstruction of exactly what was run.

The pipelines now inherit the task defaults (where technically possible), with deep coadds being requested only by the ap_verify datasets that use them.

This solution avoids the need to alter the ap_verify CLI or the ap_verify dataset framework, making it relatively easy to undo the workaround once DM-31492 is resolved.

kfindeisen requested review from mrawls and natelust September 8, 2021 20:06

mrawls reviewed Sep 29, 2021

View reviewed changes

kfindeisen added 10 commits October 5, 2021 19:04

Add subsets to fakes pipeline.

9aa6a52

Add instrument specializations for ApVerify.yaml.

d4539c4

Support pipelines in dataset format.

4ac259b

Like configs, pipelines are copied to the workspace directory for ease of access and to allow reconstruction of exactly what was run.

Use dataset-specific pipeline by default.

cf93cdd

Remove hardcoded per-dataset config overrides from Gen 3.

3950683

Add instrument specializations for ApVerifyWithFakes.yaml.

801f904

Remove ApVerify pipelines using deep coadds by default.

2460118

The pipelines now inherit the task defaults (where technically possible), with deep coadds being requested only by the ap_verify datasets that use them.

Hard-code support for DECam crosstalk.

82cde94

This solution avoids the need to alter the ap_verify CLI or the ap_verify dataset framework, making it relatively easy to undo the workaround once DM-31492 is resolved.

Standardize pipelines on 2-space indent.

a2a6135

Use parameters to describe fakes-based pipeline datasets.

59d1a0e

kfindeisen force-pushed the tickets/DM-26140 branch from 7481bba to 59d1a0e Compare October 7, 2021 20:59

kfindeisen merged commit 2358f0d into master Oct 8, 2021

kfindeisen deleted the tickets/DM-26140 branch October 8, 2021 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-26140: Centralize Gen 3 pipeline configuration info for ap_verify datasets #139

DM-26140: Centralize Gen 3 pipeline configuration info for ap_verify datasets #139

kfindeisen commented Sep 8, 2021

mrawls left a comment

mrawls Sep 29, 2021

kfindeisen Sep 29, 2021

natelust Oct 5, 2021

kfindeisen Oct 6, 2021

DM-26140: Centralize Gen 3 pipeline configuration info for ap_verify datasets #139

DM-26140: Centralize Gen 3 pipeline configuration info for ap_verify datasets #139

Conversation

kfindeisen commented Sep 8, 2021

mrawls left a comment

Choose a reason for hiding this comment

mrawls Sep 29, 2021

Choose a reason for hiding this comment

kfindeisen Sep 29, 2021

Choose a reason for hiding this comment

natelust Oct 5, 2021

Choose a reason for hiding this comment

kfindeisen Oct 6, 2021

Choose a reason for hiding this comment