New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v2.12: pod deleted
+ re-apply
error = errored workflow
#4798
Labels
Milestone
Comments
alexec
added
type/bug
type/regression
Regression from previous behavior (a specific type of bug)
labels
Dec 24, 2020
alexec
added a commit
to alexec/argo-workflows
that referenced
this issue
Dec 29, 2020
, argoproj#4806, argoproj#3551 Signed-off-by: Alex Collins <alex_collins@intuit.com>
1 task
saranyaeu2987
pushed a commit
to saranyaeu2987/argo-1
that referenced
this issue
Jan 5, 2021
, argoproj#4806 (argoproj#4808) Signed-off-by: Alex Collins <alex_collins@intuit.com> Signed-off-by: saranyaeu2987 <saranyaeu2987@gmail.com>
Fix for this is out on https://github.com/argoproj/argo/releases/tag/v2.12.3 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Summary
In v2.12, we can get a
pod deleted
error under high load. I believe this is caused by factors interplaying:Error: pod deleted
. The workflow is marked as erroredCauses:
reapplyUpdate
will happily overwrite a completed workflow or node.DEFAULT_REQUEUE_TIME
should be longer, up to 10s.Solution
reapplyUpdate
to check to see it is overwriting a successful workflow or any successful nodes. Error out. This will prevent any future cases of succeeded workflows being marked as error.TODO
to remove.Relates to #4795, #4634, #4794
The text was updated successfully, but these errors were encountered: