Pass combined artifacts from nested workflows into downstream nodes #12223

AlanCoding · 2022-05-12T15:10:08Z

SUMMARY

Connect #4481

The issue is a reasonable request, and I am trying not to overthink the problem.

As of putting up this PR, I am doing this without a schema change. This trades lower memory use for more processing cost. Because of that, I've tried to write the method get_combined_artifacts relatively efficiently, although you can come up with cases where it will slog.

ISSUE TYPE

Feature Pull Request

COMPONENT NAME

API

AWX VERSION

21.0.0

ADDITIONAL INFORMATION

I've done two test cases

like the issue states, a workflow with (sliced job) --> (receiving job), and verify that receiving job gets artifacts
a workflow inside a workflow with normal job, and receiving job downstream in outer workflow, verify receiving job gets artifacts

Both test cases fail in devel, finding that the receiving job does not get the artifacts.

AlanCoding · 2022-05-12T19:48:53Z

test results look good, ready for review.

AlanCoding · 2022-05-13T17:55:03Z

having thought about it, I realize I do need to put in some recursion protection.

jbradberry · 2022-05-17T15:59:46Z

awx/main/models/workflow.py

@@ -57,6 +57,20 @@
 logger = logging.getLogger('awx.main.models.workflow')


+def get_artifacts_from_unified_job(job):


Instead of this standalone function, why not just have different implementations of .get_combined_artifacts() on the different UnifiedJob subtypes?

Took me a while to process, but I wound up going with exactly this suggestion. That allowed getting rid of this method altogether.

jbradberry · 2022-05-17T16:04:47Z

awx/main/models/workflow.py

+        job_queryset = (
+            UnifiedJob.objects.filter(unified_job_node__workflow_job=self)
+            .defer('job_args', 'job_cwd', 'start_args', 'result_traceback')
+            .order_by('status', 'finished', 'id')


I'm a bit iffy on ordering by status. This will wind up being alphabetical, which feels like it might have unintended consequences.

haha, I was kind of fishing for a comment like this. The status (s)uccessful comes after (f)ailed and (e)rror, thus, the successful jobs will have their artifacts take precedence.

In practice, I'm iffy on whether error or failed jobs should contribute artifacts in the first place. Still unsure on what the best thing to do here is.

.filter(status='successful') instead, then?

This brings to mind a thing that I was enthusiastic about when I noticed it early on in the Django upgrades, but that got buried under the weight of dealing with the difficult bits: Django 3 supports actual enumerated choices for choice fields. https://docs.djangoproject.com/en/3.2/ref/models/fields/#enumeration-types

It'd be nice to have these, and avoid having to hard-code choice strings. Not urgent, just kind of pleasant.

jbradberry · 2022-05-17T20:11:22Z

awx/main/models/workflow.py

+        )
+        if parents_set is None:
+            parents_set = set()
+        new_parents_set = parents_set | set([self.id])


I'd like to start getting away from Python 2-isms like set([...]).

Is it even possible to have a cycle in the jobs? The job templates, yes, I could see that potentially happening, but even if the same JT is used in multiple places inside a nested workflow won't it just spawn distinct new jobs (thus having new pks)?

No, as structured I don't think it would have any cycles. The map of workflow jobs would always be a tree, since a workflow job node can't (in practice) link to the same workflow job as another job.

I kept coming back to questioning if we should track and skip duplicate workflow job templates, which is a much stronger criteria. A workflow can use the same WFJT inside of it multiple times, or it can be nested deeper. This would cut down on queries, but I'm not sure if users might actually want to combine them because multiple runs of the same WFJT could give different artifacts.

jbradberry · 2022-05-17T20:13:48Z

awx/main/models/workflow.py

@@ -682,6 +681,28 @@ def get_ancestor_workflows(self):
            wj = wj.get_workflow_job()
        return ancestors

+    def get_effective_artifacts(self, parents_set=None):


Can we just make this signature (self, **kwargs) as well, and get parents_set out of the kwargs dict? It will make things just a bit nicer if we wind up adding more things later on.

jbradberry · 2022-05-17T20:23:01Z

awx/main/models/workflow.py

@@ -318,8 +318,7 @@ def get_job_kwargs(self):
        for parent_node in self.get_parent_nodes():
            is_root_node = False
            aa_dict.update(parent_node.ancestor_artifacts)
-            if parent_node.job and hasattr(parent_node.job, 'artifacts'):
-                aa_dict.update(parent_node.job.artifacts)
+            aa_dict.update(parent_node.job.get_effective_artifacts(parents_set=set([self.workflow_job_id])))


👀

I need to think through this. This is the part that I worry is going to cause a bunch of repeated queries.

Never mind, I think this isn't a problem.

I still dislike set([...]), though.

AlanCoding marked this pull request as ready for review May 12, 2022 15:10

github-actions bot added the component:api label May 12, 2022

AlanCoding force-pushed the wj_artifacts branch from 6dc7703 to ed01e31 Compare May 12, 2022 15:13

AlanCoding requested review from jbradberry and rebeccahhh May 12, 2022 19:49

AlanCoding force-pushed the wj_artifacts branch from ed01e31 to 8803409 Compare May 17, 2022 15:04

jbradberry reviewed May 17, 2022

View reviewed changes

jbradberry approved these changes Jun 13, 2022

View reviewed changes

AlanCoding force-pushed the wj_artifacts branch from 6f9f220 to b36cba4 Compare June 14, 2022 13:55

AlanCoding added 9 commits June 21, 2022 10:04

Track combined artifacts on workflow jobs

2fab44a

Avoid schema change for passing nested workflow artifacts

41c406f

Basic support for nested workflow artifacts, add test

d613bf6

Forgot that only does not work with polymorphic

52d9b84

Remove incorrect field

fbad161

Consolidate logic and prevent recursion with UJ artifacts method

f5b1fdb

Stop trying to do precedence by status, filter for obvious ones

8a827bf

Review comments about sets

57fb985

Fix up bug with convergence node paths and artifacts

46f3e4f

AlanCoding force-pushed the wj_artifacts branch from b36cba4 to 46f3e4f Compare June 21, 2022 14:04

jay-steurer merged commit 783b744 into ansible:devel Jun 23, 2022

AlanCoding mentioned this pull request Apr 12, 2023

workflow artifacts to be Passed between workflows inside a Super Workflow #13843

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass combined artifacts from nested workflows into downstream nodes #12223

Pass combined artifacts from nested workflows into downstream nodes #12223

AlanCoding commented May 12, 2022

AlanCoding commented May 12, 2022

AlanCoding commented May 13, 2022

jbradberry May 17, 2022

AlanCoding May 17, 2022

jbradberry May 17, 2022

AlanCoding May 17, 2022

jbradberry May 17, 2022

jbradberry May 17, 2022

jbradberry May 17, 2022 •

edited

AlanCoding May 17, 2022

jbradberry May 17, 2022

jbradberry May 17, 2022

jbradberry May 17, 2022

		@@ -57,6 +57,20 @@
		logger = logging.getLogger('awx.main.models.workflow')


		def get_artifacts_from_unified_job(job):

Pass combined artifacts from nested workflows into downstream nodes #12223

Pass combined artifacts from nested workflows into downstream nodes #12223

Conversation

AlanCoding commented May 12, 2022

SUMMARY

ISSUE TYPE

COMPONENT NAME

AWX VERSION

ADDITIONAL INFORMATION

AlanCoding commented May 12, 2022

AlanCoding commented May 13, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jbradberry May 17, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jbradberry May 17, 2022 •

edited