New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GKE tests timeout #69891
Comments
Nevermind ignore this |
I think i found the issue. This test case is stopping kubelet:
But because of #69786, the Node lifecycle controller never updated the node ready status, so the test failed. However, the test doesn't seem to have gone through its recovery procedure to restart kubelet. So all the subsequent tests that have pods scheduled to this node will fail. |
/assign @jingxu97 |
@jingxu97 any update on this issue? |
@wangzhen127 @msau42 now that #69786 is fixed, will this help this timeout as well? |
#69944 is merged and from the looks of it the gke jobs seem to be passing (atleast in master which had a recent run). Thanks @jingxu97 and @msau42 for the investigation and fix. https://k8s-testgrid.appspot.com/sig-release-master-blocking#gke-cos-master-serial @jberkus or @mortent to close this issue once upgrade jobs turn green as well |
Now that we're not timing out, we're seeing some other failures. I'll wait for one more consistent run, and then close this and open a new issue for the new failures. |
/close |
@jberkus: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Tests are timing out on GKE
Dashboards: https://k8s-testgrid.appspot.com/sig-release-master-blocking#gke-cos-master-serial
https://k8s-testgrid.appspot.com/sig-release-master-upgrade#gke-gci-new-gci-master-upgrade-master
https://k8s-testgrid.appspot.com/sig-release-master-upgrade#gke-gci-new-gci-master-upgrade-cluster
Sample failure: https://k8s-gubernator.appspot.com/build/kubernetes-jenkins/logs/ci-kubernetes-e2e-gci-gke-serial/7208
This might be related to #69597, but it seems to fail in a different way and also more tests seems to be affected.
/sig test-infrastructure
/sig gcp
/kind failing-test
/priority important-soon
/cc @msau42
/cc @justinsb
/cc @bsalamat
The text was updated successfully, but these errors were encountered: