fix(repo): explicitly stop all agents on failure #11123
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Current Behavior
We don't explicitly start CI runs on our CI. If an error happens before the distributed run was initiated, the agents will keep on spinning until they timeout (60min).
Expected Behavior
Agents should be killed immediately when CI run fails
Related Issue(s)
Issue discovered by @rarmatei on #11113
Fixes #