DM-43516: Pass ObservationalIdentifiers to SingleCellCoadd #47

arunkannawadi · 2024-03-29T05:02:50Z

No description provided.

since all kwargs get treated as part of the dataId.

arunkannawadi · 2024-03-29T05:05:17Z

python/lsst/drp/tasks/assemble_cell_coadd.py

+                    for ccd_row in warp.getInfo().getCoaddInputs().ccds:
+                        if ccd_row.contains(cell_centers_sky[cellInfo.index]):
+                            break
+


I'm curious to know which of the two patterns is preferable. The former allows for certain assumptions, which had been useful in identifying existing bugs (that are getting fixed in DM-43515). The latter seems slightly more efficient in that the loop can break immediately after the corresponding detector has been found.

Given that there are very few detectors that could potentially overlap a warp, I think the first one's efficiency would be fine.

Or, rather, the big inefficiency here is that we're doing per-cell AST WCS calls, and we know from experience with our current coadds that that can be very slow. Ideally we would:

Make a Numpy array of cell centers in sky coordinates outside the loop over warps.

For each detector in each warp's coaddInputs.ccds, transform the sky cell centers to pixel coordinates (but outside of a loop over cells; this needs to be a vectorized skyToPixel call).

Compare those detector-coordinate positions to the detector bounding boxes.

To guard against WCSs that extrapolate really badly, make sure the cell-center positions on detectors that match round-trip back to the sky cell-center.

ExposureCatalog.subsetContaining and ExposureRecord.contains were written before we switched to AST and they're just fundamentally the wrong order of operations for dealing with its horrific non-vectorized performance.

That will probably be a fair bit uglier than what you have, but I think it's still the right thing to do.

Make a Numpy array of cell centers in sky coordinates outside the loop over warps.

OK, we do this now but as a dict instead of an array.

Compare those detector-coordinate positions to the detector bounding boxes.

We're going to have to do comparisons with polygons, since detectors aren't within rectangles anymore. I'll have to check that those comparisons are efficient as well.

But I'm going to create another ticket to improve the efficiency, since we are still very much limited by PSF evaluation per-cell.

since detectors aren't within rectangles anymore.

That's why I suggested going all the way back to detector coordinates. But doing this in polygons on the sky or in coadd coordinates may well be more efficient if it avoids the reverse-transform to check for bad extrapolation.

Ah, I misunderstood the coordinate system in your comment at first.

TallJimbo · 2024-03-29T16:55:40Z

python/lsst/drp/tasks/assemble_cell_coadd.py

+                    for ccd_row in warp.getInfo().getCoaddInputs().ccds:
+                        if ccd_row.contains(cell_centers_sky[cellInfo.index]):
+                            break
+


Given that there are very few detectors that could potentially overlap a warp, I think the first one's efficiency would be fine.

Or, rather, the big inefficiency here is that we're doing per-cell AST WCS calls, and we know from experience with our current coadds that that can be very slow. Ideally we would:

Make a Numpy array of cell centers in sky coordinates outside the loop over warps.

For each detector in each warp's coaddInputs.ccds, transform the sky cell centers to pixel coordinates (but outside of a loop over cells; this needs to be a vectorized skyToPixel call).

Compare those detector-coordinate positions to the detector bounding boxes.

To guard against WCSs that extrapolate really badly, make sure the cell-center positions on detectors that match round-trip back to the sky cell-center.

ExposureCatalog.subsetContaining and ExposureRecord.contains were written before we switched to AST and they're just fundamentally the wrong order of operations for dealing with its horrific non-vectorized performance.

That will probably be a fair bit uglier than what you have, but I think it's still the right thing to do.

TallJimbo · 2024-03-29T17:02:24Z

python/lsst/drp/tasks/assemble_cell_coadd.py

@@ -359,7 +403,7 @@ def run(self, inputWarps, skyInfo, **kwargs):
                outer=image_planes,
                psf=cell_coadd_psf.computeKernelImage(cell_coadd_psf.getAveragePosition()),
                inner_bbox=cellInfo.inner_bbox,
-                inputs=None,  # TODO
+                inputs=frozenset(observation_identifiers_gc[cellInfo.index]),


If we use a frozenset or set here we need to make sure we sort on save to avoid making the persisted order dependent on salted hashing. Might be better to remove duplicates then sort it and make a tuple right here.

Also, we really don't want to be using getAveragePosition two lines above. That is the average position of all the stars that went into the PSF model, and it's not necessarily going to be anywhere near this cell.

The PSF coaddition, including dropping the use of getAveragePosition, is coming next in DM-43515.

Do we care about the ordering here when persisting. I can see if we try to compare if two files are identical without even opening them, but is that something we expect to support?

I'm not sure we can make it all the way to full support, but I think it's good practice to minimize unnecessary differences.

Alright, in the other PR, I added a __lt__ operator for ObservationIdentifiers and sort the inputs in the __init__ method of SingleCellCoadd.

ObservationIdentifiers are sorted in the order of packed.

arunkannawadi force-pushed the tickets/DM-43516 branch from 4884921 to 7402797 Compare March 29, 2024 05:18

arunkannawadi added 10 commits March 29, 2024 11:10

Fix typos in ConvertMultipleCellCoaddToExposure

5fcd024

Remove redundant runQuantum method

68732f9

Add a boolean option to skip ScaleZeroPoint

53a7f93

Make sub tasks only if needed

b9d2a05

Keep track of input observations

db9e791

Skip coadding cells with no input

e1a8ea6

Stop passing coaddName to InMemoryDatasetHandle

5db1692

since all kwargs get treated as part of the dataId.

Add missing parameters for makeDataRefList

36980ca

Make a proper dataId for inMemoryDatasetHandle

4f8a23a

Add a unit test covering visit_count

c1f3b87

arunkannawadi force-pushed the tickets/DM-43516 branch from 7408787 to c1f3b87 Compare March 29, 2024 15:11

arunkannawadi requested a review from TallJimbo March 29, 2024 15:13

arunkannawadi commented Mar 29, 2024

View reviewed changes

TallJimbo approved these changes Mar 29, 2024

View reviewed changes

arunkannawadi mentioned this pull request Mar 29, 2024

DM-43516: Pass ObservationIdentifiers constructed from warps to SingleCellCoadd objects lsst/cell_coadds#31

Merged

Check that inputs get sorted

07e4229

arunkannawadi merged commit 563af15 into main Apr 1, 2024
3 checks passed

arunkannawadi deleted the tickets/DM-43516 branch April 1, 2024 20:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-43516: Pass ObservationalIdentifiers to SingleCellCoadd #47

DM-43516: Pass ObservationalIdentifiers to SingleCellCoadd #47

arunkannawadi commented Mar 29, 2024

arunkannawadi Mar 29, 2024

TallJimbo Mar 29, 2024

arunkannawadi Mar 29, 2024

TallJimbo Mar 29, 2024

arunkannawadi Mar 29, 2024

TallJimbo Mar 29, 2024

TallJimbo Mar 29, 2024

TallJimbo Mar 29, 2024

arunkannawadi Mar 29, 2024

arunkannawadi Mar 29, 2024

TallJimbo Mar 29, 2024

arunkannawadi Mar 29, 2024

arunkannawadi Mar 29, 2024

DM-43516: Pass ObservationalIdentifiers to SingleCellCoadd #47

DM-43516: Pass ObservationalIdentifiers to SingleCellCoadd #47

Conversation

arunkannawadi commented Mar 29, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment