DM-41543: Adds Reporting Exit Code Information to bps report #157

villarrealas · 2023-11-29T19:19:07Z

Preliminary draft version to run some things past Michelle since it is my first significant contribution to BPS code base.

Checklist

ran Jenkins
added a release note for user-visible changes to doc/changes

codecov · 2023-11-29T19:21:35Z

Codecov Report

Attention: 17 lines in your changes are missing coverage. Please review.

Comparison is base (4842456) 78.70% compared to head (717c01e) 78.60%.

❗ Current head 717c01e differs from pull request most recent head 85cbcea. Consider uploading reports for the commit 85cbcea to get more accurate results

Files	Patch %	Lines
python/lsst/ctrl/bps/bps_reports.py	68.75%	8 Missing and 2 partials ⚠️
python/lsst/ctrl/bps/report.py	25.00%	6 Missing ⚠️
python/lsst/ctrl/bps/drivers.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #157      +/-   ##
==========================================
- Coverage   78.70%   78.60%   -0.10%     
==========================================
  Files          40       40              
  Lines        3090     3151      +61     
  Branches      519      530      +11     
==========================================
+ Hits         2432     2477      +45     
- Misses        568      582      +14     
- Partials       90       92       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mxk62 · 2023-12-07T15:36:58Z

python/lsst/ctrl/bps/wms_service.py

-    """Job counts per label and per state.
+    """Job counts per label and per state."""
+
+    exit_code_summary: list[set] = None


I think this type description may be incorrect. Unless I got it wrong, it will be a list of lists (see lines 189, 192, 213, and 241 in ctrl_bps_panda/python/ctrl/bps/panda/panda_service.py). Also, it would be nice to have type specification for the elements for these list as well (list[list[int]]?).

However, to be perfectly honest, I'd prefer exit_code_summary to be a dictionary mapping job labels to list/sets of their exit codes (similarly to the job_summary mapping job labels to job states and their counts). With a dictionary a client code doesn't need to worry if the ordering of the exit_code_summary corresponds to the ordering of job labels somewhere else.

mxk62 · 2023-12-07T15:40:13Z

python/lsst/ctrl/bps/bps_reports.py

+    def __str__(self):
+        alignments = ["<"] + [">"] * (len(self._table.colnames) - 1)
+        lines = list(self._table.pformat_all(align=alignments))
+        # lines.insert(3, lines[1])


Please remove the line you commented out if you don't need it.

mxk62 · 2023-12-07T15:43:23Z

python/lsst/ctrl/bps/bps_reports.py

+        labels = []
+        if run_report.run_summary:
+            for part in run_report.run_summary.split(";"):
+                label, count = part.split(":")


Variable count doesn'st seem to be used for anything. Please replace with _. Though before making this change please see the comment in ctrl/bps/wms_service.py regarding the type of exit_code_summary first. If you make the changes I'm suggesting, this cooment will likely be rendered moot.

mxk62 · 2023-12-07T18:05:16Z

python/lsst/ctrl/bps/wms_service.py

+
+    exit_code_summary: list[set] = None
+    """Summary of exit codes provided by the WMS if
+    available.


Can we use something a little bit more descriptive here, e.g. "Non-zero exit codes per job label if available."?

Also, if you don't use a dictionary for exit_code_summary as I suggested earlier please include the information about expected ordering of the list.

mxk62 · 2023-12-07T18:22:43Z

doc/changes/DM-41543.feature.rst

@@ -0,0 +1 @@
+Introduced the `--return-exit-codes` flag to bps report, which provides a summary of exit code counts and exit codes for non-payload errors. This currently only works for PanDA.


I'm not really sure if we can use Markdown markup in these RST documents so just to be safe I would use double backquotes around --return-exit-codes.

mxk62 · 2023-12-08T22:42:11Z

python/lsst/ctrl/bps/report.py

+                ]
+                run_exits_report = ExitCodesReport(fields)
+                run_exits_report.add(run, use_global_id=is_global)
+                print(run_exits_report)


Can we add print("\n") before this line so the job report and the exit code report are separated by a blank line if the latter is displayed?

mxk62 · 2023-12-09T00:01:59Z

python/lsst/ctrl/bps/bps_reports.py

+        exit_code_summary = run_report.exit_code_summary
+        for label, exit_codes in zip(labels, exit_code_summary):
+            if exit_codes != [None]:
+                pipe_error_count = sum([code for code in exit_codes if code == 1])


Maybe it would be worthwhile adding a comment that we're taking advantage of the fact that payloads always return 1 on failure here (right?).

timj mentioned this pull request Nov 29, 2023

DM-41543: Add Hooks for PanDA Exit Code Information lsst/ctrl_bps_panda#61

Merged

2 tasks

villarrealas force-pushed the tickets/DM-41543 branch 3 times, most recently from 57e7d28 to 717c01e Compare December 6, 2023 20:04

mxk62 approved these changes Dec 9, 2023

View reviewed changes

added exit code summary functionality to bps report

85cbcea

villarrealas force-pushed the tickets/DM-41543 branch from a9cad42 to 85cbcea Compare December 11, 2023 23:04

villarrealas merged commit 938ed54 into main Dec 11, 2023
11 checks passed

villarrealas deleted the tickets/DM-41543 branch December 11, 2023 23:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-41543: Adds Reporting Exit Code Information to bps report #157

DM-41543: Adds Reporting Exit Code Information to bps report #157

villarrealas commented Nov 29, 2023 •

edited

codecov bot commented Nov 29, 2023 •

edited

mxk62 Dec 7, 2023

mxk62 Dec 7, 2023

mxk62 Dec 7, 2023

mxk62 Dec 7, 2023

mxk62 Dec 7, 2023

mxk62 Dec 8, 2023

mxk62 Dec 9, 2023

		@@ -0,0 +1 @@
		Introduced the `--return-exit-codes` flag to bps report, which provides a summary of exit code counts and exit codes for non-payload errors. This currently only works for PanDA.

DM-41543: Adds Reporting Exit Code Information to bps report #157

DM-41543: Adds Reporting Exit Code Information to bps report #157

Conversation

villarrealas commented Nov 29, 2023 • edited

Checklist

codecov bot commented Nov 29, 2023 • edited

Codecov Report

mxk62 Dec 7, 2023

Choose a reason for hiding this comment

mxk62 Dec 7, 2023

Choose a reason for hiding this comment

mxk62 Dec 7, 2023

Choose a reason for hiding this comment

mxk62 Dec 7, 2023

Choose a reason for hiding this comment

mxk62 Dec 7, 2023

Choose a reason for hiding this comment

mxk62 Dec 8, 2023

Choose a reason for hiding this comment

mxk62 Dec 9, 2023

Choose a reason for hiding this comment

villarrealas commented Nov 29, 2023 •

edited

codecov bot commented Nov 29, 2023 •

edited