Workflow level timeouts #848

vicaire · 2018-05-01T21:00:13Z

Is this a BUG REPORT or FEATURE REQUEST?: FEATURE REQUEST

The following issue proposes to add an exponential backoff retry strategy at the container level:

Would it also make sense to provide retry strategies for whole workflows?

Additionally, would it be possible to have timeouts for both containers and complete workflow so that (independently of retries) the container and/or workflows are cancelled if they do not complete within a specified duration?

In some cases, it might be useful to specify different timeouts for the case where the execution of the container takes too long, or the scheduling of the container takes too long (due to a lack of ressources in the cluster).

jessesuen · 2018-05-01T21:37:07Z

At the container level timeouts already possible via the activeDeadlineSeconds flag which we carry forward to the pod.

At the workflow level, we have built the groundwork for supporting this (but not yet fully implemented) through the /execution annotation which all workflow pod executors will look to, when deciding if a pod should terminate. Currently the only user of this annotation is daemon steps, but it was intended to support all types steps, not just daemon. Workflow level timeouts would be implemented by setting the deadline in this annotation

vicaire · 2018-05-02T04:09:09Z

Sounds good. Thanks Jesse.

vicaire · 2018-08-29T22:15:59Z

Thanks!

jessesuen added this to the v2.2 milestone Aug 6, 2018

jessesuen changed the title ~~Retry and timeouts at the container and workflow level.~~ Workflow level timeouts Aug 25, 2018

jessesuen closed this as completed in 69c390f Aug 29, 2018

zhujl1991 mentioned this issue May 15, 2019

fix: Fix workflow level timeouts #1369

Merged

icecoffee531 pushed a commit to icecoffee531/argo-workflows that referenced this issue Jan 5, 2022

chore: Refine swagger file with referenced k8s specs (argoproj#848)

28a5c2b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow level timeouts #848

Workflow level timeouts #848

vicaire commented May 1, 2018

jessesuen commented May 1, 2018 •

edited

Loading

vicaire commented May 2, 2018

vicaire commented Aug 29, 2018

Workflow level timeouts #848

Workflow level timeouts #848

Comments

vicaire commented May 1, 2018

jessesuen commented May 1, 2018 • edited Loading

vicaire commented May 2, 2018

vicaire commented Aug 29, 2018

jessesuen commented May 1, 2018 •

edited

Loading