Implemented a single qgraph file per workflow #30

SergeyPod · 2021-05-26T20:41:10Z

No description provided.

MichelleGower

Shouldn't the example yaml be updated as well (or another example written if PanDA plugin supports both small job files as well as the new single run file)?

Some questions or recommendations for minor changes. As well as a couple comments that the code will break with future changes, but we agreed it was ok for this ticket.

MichelleGower · 2021-05-27T16:56:04Z

python/lsst/ctrl/bps/wms/panda/edgenode/sw_runner

@@ -2,6 +2,7 @@
 # This is a starter script needed to initialize basic Rubin software environment inside the container and execute
 # the actual command line after decoding from hexed strings
 cd /tmp;
+ls /;


Just double checking that this was intended to be committed (as opposed to accidental committing of a temporary debugging line).

Right, this line is removed.

MichelleGower · 2021-05-27T16:58:57Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

@@ -1,6 +1,7 @@
 """

 """
+import ntpath


Why ntpath instead of os.path or pathlib?

MichelleGower · 2021-05-27T17:05:49Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

@@ -128,17 +130,20 @@ def add_dependencies(self, tasks, tasks_dependency_map):
        """
        for task in tasks:
            jobs = tasks_dependency_map[task.step]
-            dependencies = []
+            task_dependencies = []


Why not just use task.dependencies?

MichelleGower · 2021-05-27T17:09:56Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

+            dependency_map.setdefault(self.get_pseudo_input_file_name(edge[1]), []).\
+                append(self.get_pseudo_input_file_name(edge[0]))
+            self.jobs_steps[self.get_pseudo_input_file_name(edge[1])] = \
+                self.bps_workflow.nodes.get(edge[1]).get('job').label


I didn't quite follow this code, but wondering if this could be self.bps_workflow.get_job(edge[1]).label

Thanks for the tip. Fixed.

MichelleGower · 2021-05-27T17:10:22Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

-            dependency_map.setdefault(node, [])
+            dependency_map.setdefault(self.get_pseudo_input_file_name(node), [])
+            self.jobs_steps[self.get_pseudo_input_file_name(node)] = \
+                self.bps_workflow.nodes.get(node).get('job').label


Same question about get_job.

MichelleGower · 2021-05-27T17:14:36Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

+        qgraph_node_ids = self.bps_workflow.nodes.get(jobname).get("job").qgraph_node_ids
+        if qgraph_node_ids:
+            pseudo_input_file_name = self.qgraph_file+"+"+jobname + "+" + qgraph_node_ids[0].buildId + \
+                "+" + str(qgraph_node_ids[0].number)


We agreed this was fine for this ticket, but just pointing it out for future. By limiting the node_ids to a single value, this code works today but definitely will not work as soon as a single GenericWorkflow job executes more than one Quantum.

MichelleGower · 2021-05-27T17:29:59Z

python/lsst/ctrl/bps/wms/panda/panda_service.py

@@ -196,8 +201,7 @@ def from_generic_workflow(cls, config, generic_workflow, out_prefix, service_cla
        idds_workflow.generated_tasks = workflow_generator.define_tasks()
        cloud_prefix = config['bucket'] + '/' + \
            config['payload_folder'] + '/' + config['workflowName'] + '/'
-        for task in idds_workflow.generated_tasks:
-            cls.copy_pickles_into_cloud(task.local_pfns, cloud_prefix)
+        cls.copy_pickles_into_cloud([config['bps_defined']['run_qgraph_file']], cloud_prefix)


This is very fragile code as opposed to checking the inputs for the jobs. I expect that this section will be revisited in the upcoming execution butler work. We should talk then to figure out if there are GenericWorkflow issues that need to be fixed so that this can be written more generically.

MichelleGower · 2021-05-27T17:37:49Z

python/lsst/ctrl/bps/wms/panda/idds_tasks.py

@@ -225,3 +234,12 @@ def get_input_file(self, job_name):
        quantum graph file name
        """
        return next(iter(self.bps_workflow.nodes.get(job_name).get("inputs")))
+
+    def get_pseudo_input_file_name(self, jobname):


"get" sounds like a lookup. Perhaps change to something like "create"

Also, method needs docstring.

Docstring added, function name updated.

MichelleGower · 2021-05-27T17:41:20Z

python/lsst/ctrl/bps/wms/panda/edgenode/cmd_line_decoder.py

@@ -7,5 +7,9 @@
 import sys
 import binascii
 cmdline = str(binascii.unhexlify(sys.argv[1]).decode())
-cmdline = cmdline.replace("${IN/L}", sys.argv[2])
+dataparams = sys.argv[2].split(":")


Do dataparams come from the pseudo input filename? If yes, I didn't follow why ":" here, but "+" in get_pseudo_input_file_name(). Could use a couple short comments to make code easier to understand.

Implemented a single qgraph file per workflow

510a50e

SergeyPod requested a review from MichelleGower May 26, 2021 20:41

SergeyPod added 2 commits May 26, 2021 16:56

Code polish to pass PEP-8 check

202b4cc

Code polish to pass PEP-8 check

e007f7c

MichelleGower approved these changes May 27, 2021

View reviewed changes

Implemented items discussed in the review

8306c6c

SergeyPod merged commit e167350 into master May 28, 2021

SergeyPod deleted the tickets/DM-30350 branch May 28, 2021 19:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented a single qgraph file per workflow #30

Implemented a single qgraph file per workflow #30

SergeyPod commented May 26, 2021

MichelleGower left a comment

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

MichelleGower May 27, 2021

MichelleGower May 27, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

MichelleGower May 27, 2021

SergeyPod May 28, 2021

Implemented a single qgraph file per workflow #30

Implemented a single qgraph file per workflow #30

Conversation

SergeyPod commented May 26, 2021

MichelleGower left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment