DM-40400: Create total AP pipeline timing metric #216

kfindeisen · 2024-03-07T22:58:54Z

This PR adds the timing metric introduced on lsst/ap_pipe#168 to the standard runtime metrics for ApVerify. Like all timing metrics, it is not run on ApVerifyWithFakes.

Do unit tests pass (scons and/or stack-os-matrix)?
Did you run ap_verify.py on at least one of the standard datasets?
For changes to metrics, the print_metricvalues script from lsst.verify will be useful.
Is the Sphinx documentation up-to-date?

This metric calculates the total wall-clock time for the pipeline, assuming ISR is run first and DiaPipelineTask is run last. Since metric tasks may run at any point, some variance is expected.

parejkoj

One suggestion, otherwise this makes the use of the new metric quite clear.

parejkoj · 2024-03-08T21:18:16Z

pipelines/_ingredients/MetricsRuntime.yaml

+      connections.package: ap_pipe
+      connections.metric: ApPipelineTime
+      connections.labelStart: isr
+      connections.labelEnd: diaPipe


Maybe worth a comment here that diaPipe is what sends the alerts, and thus is the endpoint of our runtime requirement? That might not be obvious to someone reading, especially if our pipeline grows some post-diaPipe afterburners.

We may also implement some things in diaPipe after packageAlerts (writing detectorVisitProcessingSummary comes to mind) so it's always going to be an upper-bound on the runtime.

I don't think alerts are particularly relevant here. It's simply the last task in the "bare" AP pipeline (though quite a few metrics depend on diaPipe outputs).

To clarify, this metric is pretty much irrelevant for "our runtime requirement", because the batch processing environment is completely different from the prompt one. Based on my experiments with the three ap_verify datasets, a good chunk of the time measured by this metric is time spent running other data IDs' tasks.

Register ApPipelineTime metric with AP pipeline.

900b65d

This metric calculates the total wall-clock time for the pipeline, assuming ISR is run first and DiaPipelineTask is run last. Since metric tasks may run at any point, some variance is expected.

kfindeisen requested a review from parejkoj March 7, 2024 22:58

parejkoj approved these changes Mar 8, 2024

View reviewed changes

kfindeisen merged commit d40764c into main Mar 9, 2024
2 checks passed

kfindeisen deleted the tickets/DM-40400 branch March 9, 2024 01:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-40400: Create total AP pipeline timing metric #216

DM-40400: Create total AP pipeline timing metric #216

kfindeisen commented Mar 7, 2024

parejkoj left a comment

parejkoj Mar 8, 2024

kfindeisen Mar 8, 2024

kfindeisen Mar 8, 2024 •

edited

DM-40400: Create total AP pipeline timing metric #216

DM-40400: Create total AP pipeline timing metric #216

Conversation

kfindeisen commented Mar 7, 2024

parejkoj left a comment

Choose a reason for hiding this comment

parejkoj Mar 8, 2024

Choose a reason for hiding this comment

kfindeisen Mar 8, 2024

Choose a reason for hiding this comment

kfindeisen Mar 8, 2024 • edited

Choose a reason for hiding this comment

kfindeisen Mar 8, 2024 •

edited