-
Notifications
You must be signed in to change notification settings - Fork 38.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gce pd WaitForAttach never returned #63156
Comments
Weird. But it does appear to be triggered multiple times:
And there is no |
This flake only appears to happen on |
I don't think a logging issue would cause the test to fail due to Mount not happening. I do see other gce pd tests flaking, but I haven't looked into the others to see if they're the same issue: https://k8s-testgrid.appspot.com/sig-storage#gce |
I checked a couple of failed tests. Both of them has the log of "MountVolume.WaitForAttach entering for volume ....", but after that, there is no fail or succeed message. It seems like WaitForAttach was hanging. All the following operationExecutor.MountVolume will fail to start because the first operation still exist. |
Another suspicious operation I see that could block is invoking udevadm here. I think we probably need to manually try to stress and reproduce this, so we have a chance to login and see where it's stuck. |
I ran the test @msau42 suggests the flake may repro if we run multiple volume tests in parallel. Will try that when I have some time. |
Ran the tests with the following command: A good number of iterations have failed tests, and all of the failures I've seen so far are from |
@verult thanks for the test. Maybe we can try to add some logs into WaitForAttach() function since that's where we suspect to have problems. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale We should at least add some logs. Currently we cannot tell the difference between waitForAttach hanged, or the device path never appeared. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
Is this a BUG REPORT or FEATURE REQUEST?:
@kubernetes/sig-storage-bugs
What happened:
Encountered a strange test failure, where WaitForAttach never returned
Test logs: https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/pr-logs/pull/63045/pull-kubernetes-e2e-gce/32404/
The volume got successfully attached to the node:
Kubelet logs show it detected the attached volume, however WaitForAttach never returned:
The text was updated successfully, but these errors were encountered: