Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix stall on failure to mark as failed #570

Merged
merged 2 commits into from Apr 11, 2021
Merged

Fix stall on failure to mark as failed #570

merged 2 commits into from Apr 11, 2021

Conversation

Mark-Simulacrum
Copy link
Member

The root cause of this bug hasn't yet been tracked down, but the additional logging added in the first and second commits should help track it down in the future as well. It seems pretty likely that there are more cases where bailing early leads to problems (as deletion from the graph is extremely necessary but seems easily missed), but hopefully the added logging will help reduce debugging time in the future. A more exhaustive fix here seems difficult with the current abstractions in the code, so would need more thought - my hope is that the bugs we may have can be squashed out pretty quickly with how many runs we have regularly through crater, and ultimately the rewrite avoided (as it has its own likelihood of adding bugs).

If a task fails to get marked as failed (for whatever reason), we still want to
delete it from the task graph. Otherwise it'll get left in the task graph and
we'll eventually block on its completion (it's marked as running).
This should help debug early exits, by giving a clear indication of when a
worker bails out. This should normally never happen; anytime it does also
decreases our parallelism (as we don't respawn the worker), so should be tracked
and avoided.
@Mark-Simulacrum
Copy link
Member Author

r? @pietroalbini

@pietroalbini
Copy link
Member

@bors r+

@bors
Copy link
Collaborator

bors commented Apr 11, 2021

📌 Commit 580db9b has been approved by pietroalbini

@bors
Copy link
Collaborator

bors commented Apr 11, 2021

⌛ Testing commit 580db9b with merge ba03080...

@bors
Copy link
Collaborator

bors commented Apr 11, 2021

☀️ Test successful - checks-actions
Approved by: pietroalbini
Pushing ba03080 to master...

@bors bors merged commit ba03080 into rust-lang:master Apr 11, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants