Add missing trigger for failed-to-start nodes #13802

AlanCoding · 2023-04-05T12:51:03Z

SUMMARY

Connect #2766

For the same scenario, the numbers with this patch are:

Started 4/5/2023, 8:40:01 AM
Finished 4/5/2023, 8:40:02 AM

So this is 1 second, compared to 50 seconds before the patch.

Looking at the code, I tried to ask the deep question of what exact criteria is missing the trigger here. I believe it's the scenario where the start checks failed. In that case, we are not waiting for the job to finish (because the job never starts to begin with), so we will run out the timer for the workflow manager scheduler if we don't re-schedule right away. That's the reason for the 50 seconds we were hitting before.

This is a very simple patch, and I don't see any risks of over-scheduling. Spawning, and failing to start a node, is a processing action which corresponds to completion of a node. In the general sense, we do need to worry about infinite scheduling loops. As long as our scheduling corresponds to a tangible and finite form of progress for processing jobs, this shouldn't happen.

ISSUE TYPE

Bug, Docs Fix or other nominal change

COMPONENT NAME

API

AlanCoding requested review from kdelee and fosterseth April 5, 2023 12:51

github-actions bot added the component:api label Apr 5, 2023

AlanCoding force-pushed the workflow_fast_failure branch 3 times, most recently from 7f36e16 to 2ab8ecd Compare April 12, 2023 02:27

AlanCoding requested a review from TheRealHaoLiu April 12, 2023 02:28

AlanCoding force-pushed the workflow_fast_failure branch from 2ab8ecd to 332d8f2 Compare May 15, 2023 20:03

Add missing trigger for failed-to-start nodes

9d70340

AlanCoding force-pushed the workflow_fast_failure branch from 332d8f2 to 9d70340 Compare July 24, 2023 14:22

fosterseth approved these changes Jul 24, 2023

View reviewed changes

AlanCoding merged commit 98bfe3f into ansible:devel Jul 24, 2023
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add missing trigger for failed-to-start nodes #13802

Add missing trigger for failed-to-start nodes #13802

AlanCoding commented Apr 5, 2023

Add missing trigger for failed-to-start nodes #13802

Add missing trigger for failed-to-start nodes #13802

Conversation

AlanCoding commented Apr 5, 2023

SUMMARY

ISSUE TYPE

COMPONENT NAME