DM-40715: move resource-usage summary tasks from analysis_drp #143

TallJimbo · 2023-09-09T20:11:14Z

No description provided.

natelust · 2023-10-16T16:08:28Z

python/lsst/analysis/tools/tasks/gatherResourceUsage.py

+        base_dataset_type_filter = re.compile(r"\w+_metadata")
+        input_dataset_types: Any
+        if not dataset_type_names:
+            input_dataset_types = base_dataset_type_filter


I would more the re compile into this block

natelust · 2023-10-18T14:31:48Z

python/lsst/analysis/tools/tasks/gatherResourceUsage.py

+        # Start by querying for metadata datasets, since we'll need to know
+        # which dataset types exist in the input collections in order to
+        # build the pipeline.
+        base_dataset_type_filter = re.compile(r"\w+_metadata")


For organizational / readability reasons, consider moving much of this logic out of init into a method/function that is called by init.

I've moved the logic inside the single biggest loop into a separate method; everything else was too intertwined for that to work, as I didn't want to end up with methods that expected the class to be in different stages of constructed-ness or use a lot of output parameters.

natelust · 2023-10-20T13:57:42Z

python/lsst/analysis/tools/tasks/gatherResourceUsage.py

+            default="",
+            help="Data ID expression used when querying for input metadata datasets.",
+        )
+        parser.add_argument(


Its not entirely clear to me what happens when both output and output-run are specified

The same as usual with pipetask run: the given output-run name is used instead of creating one by appending a timestamp.

natelust · 2023-10-20T13:58:34Z

python/lsst/analysis/tools/tasks/gatherResourceUsage.py

+            type=str,
+            action="extend",
+            help=(
+                "Glob-style patterns for input metadata dataset types.  If a pattern matches a "


I think the wording of the second sentence is unclear, or convoluted.

I've just removed the second sentence as I don't think it adds anything, and I'm not sure what it was supposed to mean either.

Logic has not changed, but some type annotations have been adjusted since analysis_tools actually checks those.

Inheriting from the new QuantumGraphBuilder base class is a significant simplification, and it avoids a dependency on soon-to-be deprecated classes like TaskDef and PipelineDatasetTypes.

TallJimbo force-pushed the tickets/DM-40715 branch 2 times, most recently from bf4eacd to 66eec3a Compare September 9, 2023 20:20

TallJimbo marked this pull request as ready for review September 9, 2023 20:24

TallJimbo force-pushed the tickets/DM-40715 branch 3 times, most recently from a21c9e3 to 6a253d3 Compare September 12, 2023 15:31

natelust approved these changes Oct 20, 2023

View reviewed changes

TallJimbo force-pushed the tickets/DM-40715 branch 3 times, most recently from abe5729 to 0de5e69 Compare October 23, 2023 20:58

TallJimbo added 6 commits November 3, 2023 14:02

Move gather-resource-usage module from analysis_drp.

4d98758

Logic has not changed, but some type annotations have been adjusted since analysis_tools actually checks those.

Fix some preexisting documentation build problems.

8335c01

Add tasks subpackage doc to reference documentation.

fb95acb

Rewrite QG generation for resource usage tasks.

ba6ac32

Inheriting from the new QuantumGraphBuilder base class is a significant simplification, and it avoids a dependency on soon-to-be deprecated classes like TaskDef and PipelineDatasetTypes.

Add resource-usage QG generation script to docs.

ac4cdcf

Address review comments.

1425926

TallJimbo force-pushed the tickets/DM-40715 branch from 0de5e69 to 1425926 Compare November 3, 2023 18:03

TallJimbo merged commit 877779b into main Nov 3, 2023
8 checks passed

TallJimbo deleted the tickets/DM-40715 branch November 3, 2023 22:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-40715: move resource-usage summary tasks from analysis_drp #143

DM-40715: move resource-usage summary tasks from analysis_drp #143

TallJimbo commented Sep 9, 2023

natelust Oct 16, 2023

natelust Oct 18, 2023

TallJimbo Oct 23, 2023

natelust Oct 20, 2023

TallJimbo Oct 23, 2023

natelust Oct 20, 2023

TallJimbo Oct 23, 2023

DM-40715: move resource-usage summary tasks from analysis_drp #143

DM-40715: move resource-usage summary tasks from analysis_drp #143

Conversation

TallJimbo commented Sep 9, 2023

natelust Oct 16, 2023

Choose a reason for hiding this comment

natelust Oct 18, 2023

Choose a reason for hiding this comment

TallJimbo Oct 23, 2023

Choose a reason for hiding this comment

natelust Oct 20, 2023

Choose a reason for hiding this comment

TallJimbo Oct 23, 2023

Choose a reason for hiding this comment

natelust Oct 20, 2023

Choose a reason for hiding this comment

TallJimbo Oct 23, 2023

Choose a reason for hiding this comment