DM-11390: Plug prototype pipeline script into verify_ap framework #6

mrawls · 2017-09-21T02:35:02Z

No description provided.

-Ensure each of the main functions returns just one metadata PropertySet -Rename "ref_cats" to "refcats" throughout -Return a dict rather than a bunch of lists from parsePipelineArgs -Consistently refer to "dataset_root" instead of ambiguous "dataset"

kfindeisen

My biggest concern is the number of ignored exceptions; please think about whether pushing forward really is appropriate in all cases. Otherwise a bunch of style comments.

kfindeisen · 2017-09-21T22:17:42Z

python/lsst/ap/pipe/ap_pipe.py


+# IN PROGRESS: figure out which of these ALL-CAPS VARIABLES are already known by
+# ap_verify and which need to be explicitly provided in a function here in ap_pipe


Not sure what "IN PROGRESS" means; I assume it's not a scheme to loophole your way past https://developer.lsst.io/coding/python_style_guide.html#to-do-comments-should-include-a-jira-issue-key?

Also, please replace "ALL-CAPS VARIABLES" with "constants".

Whoops, this comment should have been removed once this situation was solved.

kfindeisen · 2017-09-21T23:21:23Z

python/lsst/ap/pipe/ap_pipe.py

+    Parameters
+    ----------
+    dataset_root: `str`
+        The top-level directory containing all pieces of an ap_verify-style dataset.


Copy-paste error.

kfindeisen · 2017-09-22T17:48:28Z

python/lsst/ap/pipe/ap_pipe.py

    types = ('*.fits', '*.fz')
-    datafiles = []
    allcalibdatafiles = []


Try to use readable_variable_names instead of unreadablevariablenames, here and elsewhere.

kfindeisen · 2017-09-22T17:54:15Z

python/lsst/ap/pipe/ap_pipe.py


+def get_defectfiles(defect_location, DEFECT_TARBALL=DEFECT_TARBALL):


Function parameters should be lowercase.

kfindeisen · 2017-09-22T17:57:32Z

python/lsst/ap/pipe/ap_pipe.py

    # Retrieve defect filenames from tarball
-    defectloc = os.path.join(args.dataset, DEFECT_DIR)
-    defect_tarfile_path = glob(os.path.join(defectloc, DEFECT_TARBALL))[0]
+    defect_tarfile_path = glob(os.path.join(defect_location, DEFECT_TARBALL))[0]


What's the point of glob(...)[0]? Do you expect wildcards in the raw path?

This is an idiosyncrasy of tarballs - using glob lists all the files in the tarball, and the zeroth entry is the name of the tarball itself.

I meant why can't you just have os.path.join(defect_location, DEFECT_TARBALL), without the call to glob?

Hey what do you know, that works! 🥇

kfindeisen · 2017-09-22T18:48:47Z

python/lsst/ap/pipe/ap_pipe.py

    config = ProcessCcdConfig()
+    config.load(OBS_DECAM_DIR + '/config/processCcd.py')
+    config.load(OBS_DECAM_DIR + '/config/processCcdCpIsr.py')


Prefer os.path.join.

kfindeisen · 2017-09-22T18:51:43Z

python/lsst/ap/pipe/ap_pipe.py

@@ -527,27 +604,73 @@ def doDiffIm(processed_repo, sciencevisit, ccdnum, templatevisit, diffim_repo):
        config.doDecorrelation = True

    TODO: use coadds as templates by default, not another visit (DM-11422).
+    Some of the comments in this function are placeholders for DM-11422 work.


This line should probably not be in the documentation, since it's a statement about the source code rather than the behavior.

kfindeisen · 2017-09-22T18:51:59Z

python/lsst/ap/pipe/ap_pipe.py

    and catalogs of detected sources (diaSrc, diffexp, and metadata files)
    '''
-    if os.path.exists(os.path.join(diffim_repo, 'deepDiff', 'v' + sciencevisit)):
-        print('DiffIm has already been run for visit {0}, skipping...'.format(sciencevisit))
+    lsst.log.configure()


Redundant initialization

kfindeisen · 2017-09-22T18:53:04Z

python/lsst/ap/pipe/ap_pipe.py

+    if 'visit' not in dataId_dict.keys():
+        raise RuntimeError('The dataId string is missing \'visit\'')
+    else:  # save the visit number from the dataId
+        visit = dataId_dict['visit']


Duplicate code with doProcessCcd, consider factoring.

Point taken, but the duplication is minimal and dataId handling is less than ideal all around. Leaving it as-is for now.

kfindeisen · 2017-09-22T18:54:56Z

python/lsst/ap/pipe/ap_pipe.py

    config.detection.thresholdValue = 5.0
    config.doDecorrelation = True
-    args = [processed_repo, '--id', 'visit=' + sciencevisit, 'ccdnum=' + ccdnum,
-            '--templateId', 'visit=' + templatevisit, '--output', diffim_repo]
+    dataId = dataId.split(' ')


Why are you splitting on three characters above but only on spaces here?

Unfortunately, dataId handling has to be different here because difference imaging uses parseAndRun while processing uses run. In the future, both Tasks may use run.

- The ref_cats directory in the ingested image repo must be 'ref_cats', not 'refcats' as in the ap_verify_hits2015 dataset and /datasets - Changes to the obs_decam processCcd config include a hard-wired pan-starrs refcat for photometry and astrometry. Therefore: - Use run instead of parseAndRun, which requires a Butler but allows various configs to be set or overwritten sequentially - Pass dataId string as a dict when appropriate

mrawls added 5 commits August 11, 2017 17:05

Support importing ap_pipe

a8b3c60

Move main ap_pipe executable 'runPipelineAlone' to bin.src

93786b4

Implement logging in lieu of print statements everywhere

df4021f

Docstring updates

5dd65ec

kfindeisen approved these changes Sep 22, 2017

View reviewed changes

mrawls added 6 commits September 22, 2017 15:54

Add support for dataId input argument

6800856

Adjust arguments for dataset/repo I/O to work with ap_verify

be5196b

Update README and main ap_pipe docstring

9790db9

Add a test for ap_pipe

1fb2c02

Adjust how dataset and output locations are passed

3881aef

mrawls force-pushed the tickets/DM-11390 branch from d90d0d5 to 3881aef Compare September 22, 2017 22:54

mrawls merged commit 3881aef into master Sep 22, 2017

kfindeisen deleted the tickets/DM-11390 branch April 13, 2022 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-11390: Plug prototype pipeline script into verify_ap framework #6

DM-11390: Plug prototype pipeline script into verify_ap framework #6

mrawls commented Sep 21, 2017

kfindeisen left a comment

kfindeisen Sep 21, 2017

kfindeisen Sep 22, 2017

mrawls Sep 22, 2017

kfindeisen Sep 21, 2017

kfindeisen Sep 22, 2017

kfindeisen Sep 22, 2017

kfindeisen Sep 22, 2017

mrawls Sep 22, 2017

kfindeisen Sep 22, 2017

mrawls Sep 22, 2017

kfindeisen Sep 22, 2017

kfindeisen Sep 22, 2017

kfindeisen Sep 22, 2017

kfindeisen Sep 22, 2017

mrawls Sep 22, 2017

kfindeisen Sep 22, 2017

mrawls Sep 22, 2017


		# IN PROGRESS: figure out which of these ALL-CAPS VARIABLES are already known by
		# ap_verify and which need to be explicitly provided in a function here in ap_pipe


		def get_defectfiles(defect_location, DEFECT_TARBALL=DEFECT_TARBALL):

DM-11390: Plug prototype pipeline script into verify_ap framework #6

DM-11390: Plug prototype pipeline script into verify_ap framework #6

Conversation

mrawls commented Sep 21, 2017

kfindeisen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment