-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Frequent clangd test failures on Windows buildbot #1712
Comments
I'm noticing a new message that might explain some of the instability. https://lab.llvm.org/buildbot/#/builders/123/builds/20277 has a failure:
so I'm contacting the bot owner to see if they can address that; hopefully that will resolve the mysterious failures. However, if the only error you get is "no result from call after 10 seconds" but the root cause is disk space issues, perhaps the diagnostic can be improved in clangd? |
Any update on this? Spot-checking https://lab.llvm.org/buildbot/#/builders/clang-x64-windows-msvc shows most of the failures are spurious ones from clangd tests. Given that this has been happening for a few weeks it's become kind of disruptive; should we be exploring disabling these tests? |
Considering that even the So sending out a patch to bump the deadlines. It'd be great to get rid of them completely, but they're the only thing preventing clangd from hanging the buildbots forever if things go wrong. |
patch for LSP tests in https://reviews.llvm.org/D158426, will take a look at ManyUpdates separately as it's a little bit more delicate. |
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We started seeing a lot of timeouts that align with the change in lit to execute gtests in shards. The logic there assumes tests are single-threaded, which is the case for most of the LLVM, hence they pick #shards ~ #cores (by slightly overshooting). There are enough unittests in clangd that rely on multi-threading, they can create arbitrarily many threads but we limit amount of meaningful work to ~4 thread per process. This change ensures that we're accounting for that paralelism when executing clangd tests and not overloading test executors. In theory the change overestimates the requirements, not all tests are multi-threaded, but it doesn't seem to be resulting in any regressions on my local runs. Fixes llvm#64964. Fixes clangd/clangd#1712.
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We seem to be hitting limits in some windows build bots, see clangd/clangd#1712 (comment). So bumping the timeouts to 60 seconds and completely dropping them for sync requests. As mentioned in the comment above, this should improve things, considering even the tests that don't touch any complicated scheduler is failing. Differential Revision: https://reviews.llvm.org/D158426
We started seeing a lot of timeouts that align with the change in lit to execute gtests in shards. The logic there assumes tests are single-threaded, which is the case for most of the LLVM, hence they pick #shards ~ #cores (by slightly overshooting). There are enough unittests in clangd that rely on multi-threading, they can create arbitrarily many threads but we limit amount of meaningful work to ~4 thread per process. This change ensures that we're accounting for that paralelism when executing clangd tests and not overloading test executors. In theory the change overestimates the requirements, not all tests are multi-threaded, but it doesn't seem to be resulting in any regressions on my local runs. Fixes llvm/llvm-project#64964. Fixes clangd/clangd#1712.
It appears there are frequent clangd test failures on the Windows buildbots.
A few samples:
https://lab.llvm.org/buildbot/#/builders/123/builds/20148/steps/4/logs/stdio:
https://lab.llvm.org/buildbot/#/builders/123/builds/20145/steps/4/logs/stdio
https://lab.llvm.org/buildbot/#/builders/123/builds/20142/steps/4/logs/stdio
https://lab.llvm.org/buildbot/#/builders/123/builds/20137/steps/4/logs/stdio
https://lab.llvm.org/buildbot/#/builders/123/builds/20134/steps/4/logs/stdio
https://lab.llvm.org/buildbot/#/builders/123/builds/20131/steps/4/logs/stdio
https://lab.llvm.org/buildbot/#/builders/123/builds/20022/steps/4/logs/stdio
A common element seems to be the following message printed before the failures:
cc @kadircet @hokein @sam-mccall
The text was updated successfully, but these errors were encountered: