DM-39842: Configurable exception handling in CalibrateImageTask #835

parejkoj · 2023-09-22T20:15:27Z

No description provided.

TallJimbo · 2024-02-14T18:03:58Z

python/lsst/pipe/tasks/calibrateImage.py

+                                 applied_photo_calib=None,
+                                 astrometry_matches=None,
+                                 photometry_matches=None,
+                                 )


I'm torn as to whether it's better to spell out the possible fields here with None values, vs. letting run be the sole source of truth on those. I think the QuantumContext changes in pipe_base should allow either to work.

To simplify the call to annotate below (so it's not a mess of getattr), I need to at least specify the items that are passed to annotate. I don't like specifying a result list in both places (runQuantum and the top of run); I've changed it so run just makes an empty Struct (this caught a bug!) which is a bit better.

This feels like it could be fragile if we ever change this task to output more things in the result Struct. Is it not possible to construct a list of arguments to feed the error annotation by iterating over all the existing things in the Struct?

Not without using a whole bunch of getattr and tests of whether those things have a metadata property. In this design, the PipelineTask author has to specifically choose which output datasets could get annotated on failure and pass them to annotate individually. It might be nice to just pass the result struct and let annotate figure it out, but that's probably more complicated than it's worth (not all datasets support metadata, e.g. AstropyArrow).

mrawls · 2024-03-21T19:07:08Z

python/lsst/pipe/tasks/calibrateImage.py

-
-        butlerQC.put(outputs, outputRefs)
+        # This should not happen with a properly configured execution context.
+        assert not inputs, "runQuantum got more inputs than expected"


If/when this falls over, will the error message specify that the runQuantum being talked about is the one from calibrateImage?

Yes, the assert will give where it happened. This is a "this should absolutely never happen" error, hence the bare assert.

mrawls · 2024-03-21T19:08:51Z

python/lsst/pipe/tasks/calibrateImage.py

+            butlerQC.put(result, outputRefs)
+            raise error from e
+
+        butlerQC.put(result, outputRefs)


Should this be in a finally block?

No. We only butler.put if the task succeeded, or if AlgorithmError is caught, all other errors can't be annotated, so writing partial outputs is unhelpful (and potentially harmful, as downstream tasks won't know that they're partial).

mrawls · 2024-03-21T19:11:15Z

python/lsst/pipe/tasks/calibrateImage.py

+                                 applied_photo_calib=None,
+                                 astrometry_matches=None,
+                                 photometry_matches=None,
+                                 )


This feels like it could be fragile if we ever change this task to output more things in the result Struct. Is it not possible to construct a list of arguments to feed the error annotation by iterating over all the existing things in the Struct?

mrawls · 2024-03-21T19:14:05Z

python/lsst/pipe/tasks/calibrateImage.py


-        self._summarize(exposure, stars, background)
+        self._summarize(result.exposure, result.stars_footprints, result.background)


Why is this using result.stars_footprints and not result.stars ?

ComputeExposureSummaryStatsTask takes a SourceCatalog. result.stars is just for the output. Eventually we'll start converting downstream tasks (like ipdiffim, maybe?) to take AstropyArrow input catalogs, but that's a long ways off.

Ah, so stars_footprints and stars has the same information but in different formats? Cute, OK, carry on.

See the Connections here: https://github.com/lsst/pipe_tasks/blob/main/python/lsst/pipe/tasks/calibrateImage.py#L87

Eventually, the goal will be fore the stars_footprints to only contain the footprints and none of the catalog information, but that's a long ways off.

mrawls · 2024-03-21T19:16:17Z

python/lsst/pipe/tasks/calibrateImage.py

@@ -448,7 +471,7 @@ def run(self, *, exposures, id_generator=None):
        result : `lsst.pipe.base.Struct`
            Results as a struct with attributes:


Is the results Struct guaranteed to have all of the documented attributes, now?

I'm not sure what you mean?

If the task completes, it returns result. If an exception is raised, the input parameter has (hopefully!) been modified, but nothing is returned.

Oh, but apparently I never added docs for the result parameter: I'll do that now.

Here's what I added:

result : `lsst.pipe.base.Struct`, optional Result struct that is modified to allow saving of partial outputs for some failure conditions. If the task completes successfully, this is also returned.

tests/test_calibrateImage.py

Use the new Task exceptions and a pre-defined results struct to manage partial outputs for failures of CalibrateImage subtasks. Add tests of runQuantum exception handling. The input is now a multiple `exposures`, so we can use that to simplify the code by renaming output_exposure->exposure.

This didn't actually affect the tests, but it's better to have two distinct exposures in the test butler.

parejkoj force-pushed the tickets/DM-39842 branch from 63dacc7 to 93c41a8 Compare December 6, 2023 23:27

parejkoj force-pushed the tickets/DM-39842 branch from 93c41a8 to 893d52a Compare January 26, 2024 00:50

parejkoj force-pushed the tickets/DM-39842 branch 2 times, most recently from 89cb8e8 to 49817ee Compare February 9, 2024 08:34

TallJimbo reviewed Feb 14, 2024

View reviewed changes

parejkoj force-pushed the tickets/DM-39842 branch from 49817ee to e5eb69d Compare February 14, 2024 22:11

parejkoj force-pushed the tickets/DM-39842 branch 7 times, most recently from 33d9efb to 5d7d3d4 Compare February 28, 2024 23:02

parejkoj force-pushed the tickets/DM-39842 branch from 5d7d3d4 to cf9e092 Compare March 19, 2024 18:18

mrawls approved these changes Mar 21, 2024

View reviewed changes

parejkoj force-pushed the tickets/DM-39842 branch 2 times, most recently from ed3e79f to 3314006 Compare March 21, 2024 21:41

parejkoj added 2 commits March 22, 2024 01:08

Fix bug in snap tests

f2d3e67

This didn't actually affect the tests, but it's better to have two distinct exposures in the test butler.

parejkoj force-pushed the tickets/DM-39842 branch from 3314006 to f2d3e67 Compare March 22, 2024 08:09

parejkoj merged commit 88ce0b6 into main Mar 22, 2024
2 checks passed

parejkoj deleted the tickets/DM-39842 branch March 22, 2024 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-39842: Configurable exception handling in CalibrateImageTask #835

DM-39842: Configurable exception handling in CalibrateImageTask #835

parejkoj commented Sep 22, 2023

TallJimbo Feb 14, 2024

parejkoj Feb 14, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024 •

edited

mrawls Mar 21, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024

mrawls Mar 21, 2024

parejkoj Mar 21, 2024

parejkoj Mar 21, 2024

parejkoj Mar 21, 2024


		self._summarize(exposure, stars, background)
		self._summarize(result.exposure, result.stars_footprints, result.background)

		@@ -448,7 +471,7 @@ def run(self, *, exposures, id_generator=None):
		result : `lsst.pipe.base.Struct`
		Results as a struct with attributes:

DM-39842: Configurable exception handling in CalibrateImageTask #835

DM-39842: Configurable exception handling in CalibrateImageTask #835

Conversation

parejkoj commented Sep 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

parejkoj Mar 21, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

parejkoj Mar 21, 2024 •

edited