fix: make sure taskresult completed when mark node succeed when it has outputs #12537

shuangkun · 2024-01-17T14:49:15Z

When my cluster has lots of workflows, I meet some errors.

="Mark error node" error="failed to evaluate expression: cannot fetch steps-init-artifact from <nil> (1:6)\n | steps['init-artifact'].outputs.parameters['workflow_artifact_key']\n | .....^" namespace=argo nodeName="workflow-bhr9k[3].energy(0:0)[1].energy-steps(0:0)[3].comp-binding-energy-steps(0:0)[15]" workflow=workflow-bhr9k

When the number of workflows is not large, there is no such error.

My workflow has lots of template like this, the next step refer the output of the previous step. Like hello2a refer hello1 in parameter steps['hello1'].outputs.parameters['workflow_artifact_key'].

apiVersion: argoproj.io/v1alpha1
kind: Workflow
metadata:
  generateName: steps-
spec:
  entrypoint: hello-hello-hello
  arguments:
    parameters:
    - name: message1
      value: hello world
    - name: message2
      value: foobar
  # This spec contains two templates: hello-hello-hello and whalesay
  templates:
  - name: hello-hello-hello
    # Instead of just running a container
    # This template has a sequence of steps
    steps:
    - - name: hello1            # hello1 is run before the following steps
        continueOn: {}
        template: whalesay
        arguments:
          parameters:
          - name: message
            value: "hello1"
          - name: workflow_artifact_key
            value: "{{ workflow.parameters.message2}}"
    - - name: hello2a           # double dash => run after previous step
        template: whalesay
        arguments:
          parameters:
          - name: message
            value: "{{=steps['hello1'].outputs.parameters['workflow_artifact_key']}}"

  # This is the same template as from the previous example
  - name: whalesay
    metadata:
      annotations:
        k8s.aliyun.com/eci-spot-strategy: "SpotAsPriceGo"
    inputs:
      parameters:
      - name: message
    outputs:
      parameters:
      - name: workflow_artifact_key
        value: '{{workflow.name}}'
    script:
      image: python:alpine3.6
      command: [python]
      source: |
        import random
        i = random.randint(1, 100)
        print(i)

When I search the logs. I find the time of preStep(hello1)‘s “node changed” to succeed are earlier than "task-result changed". And
this cause the hello2a's evaluate expression error. So I want to make sure taskresult completed when mark node succeed when it has outputs.

Motivation

Modifications

Verification

…s outputs Signed-off-by: shuangkun <tsk2013uestc@163.com>

Signed-off-by: shuangkun <tsk2013uestc@163.com>

workflow/controller/operator.go

juliev0 · 2024-01-22T22:25:35Z

So, it sounds like previously the execution of a Workflow was allowed to continue even if the previous Step's Outputs weren't reconciled? Are you essentially preventing the next Step from running yet in that case?

shuangkun · 2024-01-23T04:30:19Z

So, it sounds like previously the execution of a Workflow was allowed to continue even if the previous Step's Outputs weren't reconciled? Are you essentially preventing the next Step from running yet in that case?

Yes, I want to prevent the next step from running.

shuangkun · 2024-01-23T04:37:24Z

Yes, I think this was the case before, although under normal circumstances output will be processed before pod status normally, because this resource is indeed created earlier. But on a large scale, these two processing orders may be caused by high pressure Events arrive at APIserver in different order

pkg/apis/workflow/v1alpha1/workflow_types.go

workflow/controller/operator.go

juliev0 · 2024-01-23T04:56:59Z

Is it possible to see if this worked on some older versions of code? I'm curious if something broke this. It seems like core functionality.

workflow/controller/operator.go

juliev0 · 2024-01-23T05:05:29Z

Yes, I think this was the case before, although under normal circumstances output will be processed before pod status normally, because this resource is indeed created earlier. But on a large scale, these two processing orders may be caused by high pressure Events arrive at APIserver in different order

I see. So, maybe this is a good enough answer to my request that you test on an older version - perhaps this case is just an unusual one? I am kind of curious if other people have logged similar bugs.

shuangkun · 2024-01-23T05:24:33Z

Is it possible to see if this worked on some older versions of code? I'm curious if something broke this. It seems like core functionality.

I think this may be related to the introduction of taskresult resources since 3.4. Maybe it is hard to support old， because there is a lack of records recording whether the taskresult was processed（Originally I needed to add this record, but found that it was included in the latest version.)

shuangkun · 2024-01-23T05:38:11Z

Yes, I think this was the case before, although under normal circumstances output will be processed before pod status normally, because this resource is indeed created earlier. But on a large scale, these two processing orders may be caused by high pressure Events arrive at APIserver in different order

I see. So, maybe this is a good enough answer to my request that you test on an older version - perhaps this case is just an unusual one? I am kind of curious if other people have logged similar bugs.

Yes，

failed to evaluate expression

Yes, I think this was the case before, although under normal circumstances output will be processed before pod status normally, because this resource is indeed created earlier. But on a large scale, these two processing orders may be caused by high pressure Events arrive at APIserver in different order

I see. So, maybe this is a good enough answer to my request that you test on an older version - perhaps this case is just an unusual one? I am kind of curious if other people have logged similar bugs.

Yes, I tested it on version 3.4.12 for few weeks. Looks well. There will be no "failed to evaluate expression" error like before.

juliev0 · 2024-01-23T05:47:22Z

@Garett-MacGowan do you want to look at this too?

Garett-MacGowan · 2024-01-23T08:11:57Z

@Garett-MacGowan do you want to look at this too?

I just skimmed it. I can take a proper look after I 😴. In general, if we're proceeding to next steps before outputs are reconciled, it seems important that we add the wait behavior. As you said, it seems like core functionality, so I'm surprised if it's not already accounted for.

I'm wondering if this can be tested.

Garett-MacGowan · 2024-01-23T19:38:10Z

workflow/controller/operator.go

+					}
+					// Check whether the node has output and whether its taskresult is in an incompleted state.
+					if tmpl.HasOutputs() && woc.wf.Status.IsTaskResultInCompleted(node.ID) && woc.wf.Status.IsTaskResultInCompleted(pod.Name) {
+						woc.log.WithFields(log.Fields{"nodeID": newState.ID}).WithError(err).Error("Taskresult of the node not yet completed")


Same comment along the lines of what @juliev0 was saying, I don't think this is an error. We just need to flag needReconcileTaskResult. Could maybe just log it normally if you want the log?

right, probably a Debug line

By the way, do we need to call it for both woc.wf.Status.IsTaskResultInCompleted(node.ID) && woc.wf.Status.IsTaskResultInCompleted(pod.Name)?

Oh yeah, I was thinking this but had to step away and forgot to ask. I think it should just be tmpl.HasOutputs() && woc.wf.Status.IsTaskResultInCompleted(node.ID)

Maybe confusion from the comment here

seeing that comment about the comment made me think of it :)

By the way, do we need to call it for both woc.wf.Status.IsTaskResultInCompleted(node.ID) && woc.wf.Status.IsTaskResultInCompleted(pod.Name)?

Yes, I thought about this problem at first. But there is a problem. If the outputs are in pod annotations or in taskresult, the key values are different. Maybe we can unify to podName or NodeId. I think nodeId is better, how about you?

May be I can add a func named pod.GetNodeId()

if x, ok := pod.Annotations[common.AnnotationKeyReportOutputsCompleted]; ok { woc.log.Warn("workflow uses legacy/insecure pod patch, see https://argo-workflows.readthedocs.io/en/latest/workflow-rbac/") resultName := pod.GetName() if x == "true" { woc.wf.Status.MarkTaskResultComplete(resultName) } else { woc.wf.Status.MarkTaskResultIncomplete(resultName) } }

Yes, just unify to Node ID.

workflow/controller/controller_test.go

workflow/controller/operator.go

Signed-off-by: shuangkun <tsk2013uestc@163.com>

Co-authored-by: Julie Vogelman <julievogelman0@gmail.com> Signed-off-by: shuangkun tian <72060326+shuangkun@users.noreply.github.com>

juliev0 · 2024-01-28T03:58:28Z

workflow/controller/operator.go

+			woc.log.WithField("workflow", woc.wf.ObjectMeta.Name).Info("pod reconciliation didn't complete, will retry")
+			woc.requeue()
+			return
+		}


Sorry, I just realized that we probably need to move the if err != nil clause above the if !podReconciliationCompleted {, since we can return err, false

hopefully after that we should be good, thank you for the iterations!

Yes, I changed it. Thanks!

Signed-off-by: shuangkun <tsk2013uestc@163.com>

…s outputs (argoproj#12537) Signed-off-by: shuangkun <tsk2013uestc@163.com> Signed-off-by: shuangkun tian <72060326+shuangkun@users.noreply.github.com> Co-authored-by: Julie Vogelman <julievogelman0@gmail.com>

…s outputs (argoproj#12537) Signed-off-by: shuangkun <tsk2013uestc@163.com> Signed-off-by: shuangkun tian <72060326+shuangkun@users.noreply.github.com> Co-authored-by: Julie Vogelman <julievogelman0@gmail.com> Signed-off-by: Isitha Subasinghe <isubasinghe@student.unimelb.edu.au>

fix: make sure taskresult completed when mark node succeed when it ha…

b2b564c

…s outputs Signed-off-by: shuangkun <tsk2013uestc@163.com>

shuangkun force-pushed the fix/TaskResultCompleted branch from b2b564c to 3b26b73 Compare January 17, 2024 15:04

shuangkun marked this pull request as draft January 17, 2024 15:55

solve ut failed

1f9cca7

Signed-off-by: shuangkun <tsk2013uestc@163.com>

shuangkun force-pushed the fix/TaskResultCompleted branch from 3b26b73 to 1f9cca7 Compare January 17, 2024 16:18

shuangkun added 2 commits January 18, 2024 00:31

solve ut failed

f329f7f

Signed-off-by: shuangkun <tsk2013uestc@163.com>

solve ut failed

a8271e2

Signed-off-by: shuangkun <tsk2013uestc@163.com>

shuangkun marked this pull request as ready for review January 17, 2024 16:59

agilgur5 added the area/controller Controller issues, panics label Jan 17, 2024

shuangkun marked this pull request as draft January 18, 2024 00:46

test

9585ca4

Signed-off-by: shuangkun <tsk2013uestc@163.com>

shuangkun marked this pull request as ready for review January 21, 2024 07:39

shuangkun changed the title ~~fix: make sure taskresult completed when mark node succeed when it ha…~~ fix: make sure taskresult completed when mark node succeed when it has outputs Jan 21, 2024

juliev0 self-assigned this Jan 21, 2024

juliev0 reviewed Jan 22, 2024

View reviewed changes