New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Persistent local volume e2e fail to clean up and break soak suites #68570
Comments
cc: @msau42 |
cc @davidz627 who is looking at PD leaks on e2e master tests, although this failure is on previous releases |
I think there are a few problems in the "PD should be mountable" test:
|
/assign |
|
I can't find the 1.10 soak tests anymore but it looks like these tests are consistently passing in the 1.11 soak test so I'm going to mark this as closed |
@davidz627: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
1.10 soak moved to non-blocking: |
It still looks to be failing on 1.10, but I would double check if the test version used on 1.10 includes your commit. There was an issue where the release build tags were not getting updated. |
/reopen This is still an issue. I think @msau42 and I have zeroed in on the problem: This is a problem because we use the InnerVolumeSpec to generate the paths used to tear down volumes. We are seeing failed volume teardown which causes the pod to get stuck indefinitely which causes namespace deletion to fail and the soak tests to fail forever. |
@davidz627: Reopening this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/close |
@davidz627: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
soak-gci-gce suites are blocked for days, because failing storage tests seem to fail to cleanup. After a failed test a pod
pd-client
is left interminating
state. This in turn causes test namespace to be stuck in terminating state, which causes a check in BeforeSuit to fail.Currently 1.10 test cluster is in this state, so it could be used for debugging.
Project: k8s-jkns-gci-gce-soak-1-7
Zone: us-central1-f
Master VM: bootstrap-e2e-master
As far as I can tell this is connected with "[sig-storage] Volumes PD should be mountable with ext4" failing in https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-soak-gci-gce-stable2/648.
This is a release blocking suite for 1.10, can we clean up the offending pod to unblock patch release?
Impacted suites:
https://k8s-testgrid.appspot.com/sig-release-1.10-blocking#soak-gci-gce-1.10
https://k8s-testgrid.appspot.com/sig-release-1.11-all#soak-gci-gce-1.11
/sig storage
/priority important-soon
/kind bug
The text was updated successfully, but these errors were encountered: