-
Notifications
You must be signed in to change notification settings - Fork 4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows: bazel clean --expunge
sometimes causes Bazel server to crash
#3956
Comments
Possible culprit: 4869c4e |
git bisect confirmed my suspicion. |
I can reproduce this when running |
I think we've actually been seeing this in an ubuntu based docker image kubernetes/test-infra#5137, and also now with |
@BenTheElder Which Bazel version are you using? |
0.7.0, I haven't seen this with previous releases.
On Oct 25, 2017 01:43, "Yun Peng" <notifications@github.com> wrote:
@BenTheElder <https://github.com/bentheelder> Which Bazel version are you
using?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3956 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AA4Bq0KLJKVW4GW84VS_jOljf8bz-2ddks5svvSagaJpZM4QEEQq>
.
|
Using |
I just started seeing this with bazel 0.8.0rc1 on linux (ubuntu). We hadn't seen it previously. |
I wasn't able to reproduce it on my linux box after multiple runs of the loop for i in {1..10}; do bazel build src:bazel && bazel clean --expunge && bazel shutdown; done |
@kush-c we're fairly confident it's related to memory pressure since we only sometimes saw it in our CI but never locally with 0.7 (and the CI nodes are heavily loaded). My best guess is that it was triggered when the job landed on a node under extra memory pressure, though we did not see this with previous bazel versions as far as I know. At a suggestion from @ixdy we tried flipping on |
That's very helpful @BenTheElder. So the issue was present with 0.7 but has just been worsened at HEAD? Commit 4869c4e certainly may be responsible, since it tries to kill the server when the system is low on memory, and perhaps we ran into a race condition when a That also explains why I wasn't able reproduce this issue even at HEAD on my 48G linux box, any why @meteorcloudy Given this issue did also occur rarely with 0.7, and using |
Hmm.. I doubt they are the same issue on Linux and Windows. The shell script I used:
|
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
*** Reason for rollback *** Causing Bazel server to crash when running bazel clean --expunge #3956 *** Original change description *** Delayed rollforward of commit 8fb311b. This was rolled back due to Tensorflow breakage but the patch I exported to gerrit (https://bazel-review.googlesource.com/c/bazel/+/18590) passed Tensorflow (https://ci.bazel.io/job/bazel/job/presubmit/52/Downstream_projects/). Confirmed with jcater@ that the "newly failing" projects in the Global Tests are known issues. I think we can check this in now. Additionally I had attempted to reproduce any tensorflow issues with this by building and testing Tensor... *** ROLLBACK_OF=172361085 RELNOTES:None PiperOrigin-RevId: 175821671
When I run
bazel clean --expunge
on Windows, I sometimes getBoth errror_message and log file are empty.
Bazel version: HEAD
The text was updated successfully, but these errors were encountered: