-
Notifications
You must be signed in to change notification settings - Fork 18k
runtime: unresponsive tests with a running on other thread
goroutine created by runtime.gcenable
#64062
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Found new dashboard test flakes for:
2023-11-09 21:01 linux-ppc64le-power10osu tools@e3f67986 go@130baf3d x/tools/internal/gcimporter (log)
2023-11-10 13:59 linux-ppc64le-power10osu tools@92a8009c go@abf84221 x/tools/internal/gcimporter (log)
|
systemstack_switch
@golang/runtime: this looks to me like a runtime deadlock or livelock. Notably, the test binary's internal timer failed to fire. |
This comment was marked as off-topic.
This comment was marked as off-topic.
Huh. Yeah, my attempt at a |
systemstack_switch
running on other thread
goroutine created by runtime.gcenable
I believe this is the same issue as #64050. Working on it now. |
Wait, actually I see the issue here very clearly in that stack trace. ... Huh. This seems unrelated to basically anything else that has landed recently, but I'm positive it's the problem. |
Change https://go.dev/cl/541635 mentions this issue: |
Closing as a duplicate of #64067. |
@mknyszek thanks for looking it this. Reverting CL 533455 seemed to avoid the issue on ppc64, though as you note, its likely not the cause. I had to reboot a couple of the PPC64 VMs to unclog them. @bcmills do you know if make.bash is subject to a timeout during CI? Some of the jobs seemed to be stuck there for many hours. |
@pmur, I honestly don't know. 😅 You could check the code in |
Found new dashboard test flakes for:
2023-11-10 18:46 linux-ppc64le-power10osu tools@3b6876f0 go@31887586 x/tools/internal/gcimporter (log)
|
@bcmills @pmur I think it might indeed be the case that there's no timeout on In the LUCI world there's always single timeout for the overall build. I think it's quite high by default at like 2 hours or something, but it is there. We can reconfigure that fairly easily. For LUCI builders that shard out |
These functions acquire the heap lock. If they're not called on the systemstack, a stack growth could cause a self-deadlock since stack growth may allocate memory from the page heap. This has been a problem for a while. If this is what's plaguing the ppc64 port right now, it's very surprising (and probably just coincidental) that it's showing up now. For #64050. For #64062. Fixes #64067. Change-Id: I2b95dc134d17be63b9fe8f7a3370fe5b5438682f Reviewed-on: https://go-review.googlesource.com/c/go/+/541635 LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com> Run-TryBot: Michael Knyszek <mknyszek@google.com> Auto-Submit: Michael Knyszek <mknyszek@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Michael Pratt <mpratt@google.com> Reviewed-by: Paul Murphy <murp@ibm.com>
Change https://go.dev/cl/541955 mentions this issue: |
… callees on the systemstack These functions acquire the heap lock. If they're not called on the systemstack, a stack growth could cause a self-deadlock since stack growth may allocate memory from the page heap. This has been a problem for a while. If this is what's plaguing the ppc64 port right now, it's very surprising (and probably just coincidental) that it's showing up now. For #64050. For #64062. For #64067. Fixes #64073. Change-Id: I2b95dc134d17be63b9fe8f7a3370fe5b5438682f Reviewed-on: https://go-review.googlesource.com/c/go/+/541635 LUCI-TryBot-Result: Go LUCI <golang-scoped@luci-project-accounts.iam.gserviceaccount.com> Run-TryBot: Michael Knyszek <mknyszek@google.com> Auto-Submit: Michael Knyszek <mknyszek@google.com> TryBot-Result: Gopher Robot <gobot@golang.org> Reviewed-by: Michael Pratt <mpratt@google.com> Reviewed-by: Paul Murphy <murp@ibm.com> (cherry picked from commit 5f08b44) Reviewed-on: https://go-review.googlesource.com/c/go/+/541955 Reviewed-by: Dmitri Shuralyov <dmitshur@golang.org> Reviewed-by: Dmitri Shuralyov <dmitshur@google.com> Auto-Submit: Dmitri Shuralyov <dmitshur@google.com>
Issue created automatically to collect these failures.
Example (log):
— watchflakes
The text was updated successfully, but these errors were encountered: