-
Notifications
You must be signed in to change notification settings - Fork 3.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow Retry Does Not Work for Plugin #11489
Comments
To Have Failed state for the plugin node I have on purpose added exit -1 in the plugin script. |
Looking forward to your PR! |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. |
I have a doubt that why the controller does not clean up the WorkflowTaskSet when the workflow is completed. The patch action of failed node status when manual retrying may be omitted if so. |
argo-workflows/workflow/controller/controller.go Lines 870 to 875 in 13444e6
argo-workflows/workflow/controller/taskset.go Lines 34 to 52 in 13444e6
Controller will try to delete completed nodes of HTTP/Plugin type. Howerver, the node delete patch did not affect to the status field.I've tried json patch like '[{"op": "remove", "path": "/status/nodes/plugin-demo-g5qfg"}]', it also not worked. |
argo-workflows/workflow/controller/taskset.go Lines 34 to 52 in 13444e6
So, spec and status can not be modified in one patch json, status should be patched with subresources.
|
Instead of fixing the patch bug, it would be much simpler to delete the entire WorkflowTaskSet. |
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
…ixes argoproj#11489 Signed-off-by: oninowang <oninowang@tencent.com>
Pre-requisites
:latest
What happened/what you expected to happen?
What happened:
When using Plugins , If a plugin nodes goes into Failed Phase, Workflow Retry Does Not Work. Upon Retry the Plugin Node is not retried and remains in Failed Phase.
what you expected to happen:
Plugin Node should be retried.
Self Diagnosis:
Based on exploring the codebase, I see that workflow controller loads Plugin Node state from WorkflowTaskSet Status. Ref.
On Retry Argo Workflow Server Indeed removes the Failed Plugin Node from Workflow Node Status , But does not patch the WorkflowTaskSet . B/c of which workflow controller loads the plugin node status from WorkflowTaskSet.
IMO Upon Retry Argo Workflow Server should patch the WorkflowTaskSet (Remove the status for the failed Plugin Node).
Version
latest
Paste a small workflow that reproduces the issue. We must be able to run the workflow; don't enter a workflows that uses private images.
Logs from the workflow controller
Logs from in your workflow's wait container
The text was updated successfully, but these errors were encountered: