DM-10729: support jointcal/meas_mosaic outputs and add a new CmdLineTask driver #49

TallJimbo · 2017-07-24T17:57:29Z

This PR adds support for using jointcal/meas_mosaic outputs to the lower-level metric generation code (runOneFilter and below). I wasn't able to use the existing higher-level drivers for the tests I wanted to do, so I haven't tried to update those.

Instead, I've added a new CmdLineTask driver that delegates most of its work to runOneFilter, which makes it much easier to control the configuration and inputs from the command-line. It's not really a complete CmdLineTask, because it doesn't use Butler for its outputs (since all of the code it calls just write JSON directly). But IMO it's already a more convenient driver for manual execution of jobs, and I think it could be a useful starting point for making more of this code use the Task (and ultimately SuperTask) framework.

I've also found a number of small things that appear to be minor bugs (unused arguments, mostly).

wmwv

Could you add a test to ensure that the correct version of photometry (jointcall vs meas_mosaic/single_frame) is being read? The current code looks clear enough, but it would be really good to have a test to make sure that any future refactorings don't get this wrong.

wmwv · 2017-07-25T20:38:37Z

python/lsst/validate/drp/matchreduce.py

+                # performance; support for other cameras is DM-6927.
+                oldSrc = butler.get('src', vId, flags=SOURCE_IO_NO_FOOTPRINTS)
+            except:
+                oldSrc = butler.get('src', vId)


Oh, right. That's why I didn't implement SOURCE_IO_NO_FOOTPRINTS in DM-5819.

What is the performance gain in using SOURCE_IO_NO_FOOTPRINTS? In memory footprint? In I/O time?

Big improvement in I/O time, mostly. It's a big relative change in memory consumption too, but I don't notice that as much.

wmwv · 2017-07-25T20:47:02Z

python/lsst/validate/drp/matchreduce.py

+                calib = afwImage.Calib(calexpMetadata)
+
+            # We don't want to put this above the first "if useJointCal block"
+            # because we need to use it to quickly catch data IDs with no


"because we need to use the first butler.get above to quickly catch data IDs with no"

Resolve ambiguous "it".

wmwv

Looks good. Definitely worth merging. Two main questions in addition to the small comments:

Do you imagine eventually wanting to support several different calibration schemes? If so, would be it worth it to generalize from useJointCal=True to calMethod='jointcal', calMethod='meas_mosaic', calMethod='cool_thing_dominique_figures_out_in_2020'` This would preserve the ability to read historical datasets by allowing more flexibility in the config instead of in the code.
If you can make it not require --output when it doesn't even use it, that would be a nice usability improvment, particularly to the degree that matchedVisitMetrics.py could serve as a model for making validateDrp.py a Task.

wmwv · 2017-08-02T16:54:30Z

python/lsst/validate/drp/matchedVisitMetricsTask.py

+    (config.outputPrefix).  Because the CmdLineTask machinery always creates an
+    output Butler repository, however, it is necessary to run this task with
+    both an output directory and an output prefix, with the former essentially
+    unused.


Having to specify an unused --output seemed annoying.
Can you override this requirement with a default set in the default config for MatchedVisitMetricsTask ?

As per comment in JIRA issue, no - this is unfortunately baked into pipe_base, and is not controllable via config.

laurenam · 2017-08-02T20:05:03Z

python/lsst/validate/drp/matchedVisitMetricsTask.py

+
+    MatchedVisitMetricsTask is very much an incomplete CmdLineTask - it uses
+    the usual metchanisms to define its inputs and read them using a Butler,
+    but writes outputs manually to files with a configuration-defined prefix


Typo: metchanisms -> mechanisms

This matches schema naming conventions and usage in PhotoCalib.

This should yield a significant speedup in I/O.

wmwv reviewed Jul 26, 2017

View reviewed changes

TallJimbo force-pushed the tickets/DM-10729 branch from 3c29ed3 to ca6100a Compare July 31, 2017 16:21

wmwv approved these changes Aug 2, 2017

View reviewed changes

laurenam reviewed Aug 2, 2017

View reviewed changes

Use magErr instead of magerr in schemas

8a11ce4

This matches schema naming conventions and usage in PhotoCalib.

TallJimbo force-pushed the tickets/DM-10729 branch from ca6100a to cf02a2a Compare August 11, 2017 14:28

TallJimbo added 8 commits August 11, 2017 13:16

Add support for calibrating with jointcal/meas_mosaic outputs

405aed4

Remove duplicate statement

ca56e79

Remove irrelevant arguments and actually use makeJson.

024744b

Allow butler to be passed in place of repo URL

a9213bd

Add CmdLineTask driver.

2e490c5

Remove unnecessary use of immediate kwarg to Butler.get.

386b65d

Ignore Footprints when reading SourceCatalogs

2d54e97

This should yield a significant speedup in I/O.

Ensure slot aliases are transferred to new Schema

9afb4a5

TallJimbo force-pushed the tickets/DM-10729 branch from cf02a2a to 9afb4a5 Compare August 11, 2017 17:17

TallJimbo merged commit 4354f3d into master Aug 11, 2017

ktlim deleted the tickets/DM-10729 branch August 25, 2018 06:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-10729: support jointcal/meas_mosaic outputs and add a new CmdLineTask driver #49

DM-10729: support jointcal/meas_mosaic outputs and add a new CmdLineTask driver #49

TallJimbo commented Jul 24, 2017

wmwv left a comment

wmwv Jul 25, 2017

wmwv Jul 26, 2017 •

edited

TallJimbo Aug 11, 2017

wmwv Jul 25, 2017

wmwv left a comment

wmwv Aug 2, 2017

TallJimbo Aug 11, 2017

laurenam Aug 2, 2017

DM-10729: support jointcal/meas_mosaic outputs and add a new CmdLineTask driver #49

DM-10729: support jointcal/meas_mosaic outputs and add a new CmdLineTask driver #49

Conversation

TallJimbo commented Jul 24, 2017

wmwv left a comment

Choose a reason for hiding this comment

wmwv Jul 25, 2017

Choose a reason for hiding this comment

wmwv Jul 26, 2017 • edited

Choose a reason for hiding this comment

TallJimbo Aug 11, 2017

Choose a reason for hiding this comment

wmwv Jul 25, 2017

Choose a reason for hiding this comment

wmwv left a comment

Choose a reason for hiding this comment

wmwv Aug 2, 2017

Choose a reason for hiding this comment

TallJimbo Aug 11, 2017

Choose a reason for hiding this comment

laurenam Aug 2, 2017

Choose a reason for hiding this comment

wmwv Jul 26, 2017 •

edited