-
Notifications
You must be signed in to change notification settings - Fork 39.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No timeout when Kubelet Calling cni plugin #65743
Comments
/assign liucimin |
/sig node |
we are also got affected by this issue #45419, can someone take a look at this PR. we will be happy to test this out |
@huydinhle |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
containernetworking/cni#568 got merged, this is resolved I presume? @liucimin /close |
@nikopen: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Just saw that a fix is needed in kubelet as well, can you confirm @liucimin ? /reopen |
@nikopen: Reopening this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Yes kubelet needs to be changed. |
@nikopen |
@bboreham what are the plans for cutting a new CNI release and bumping it in k8s? Does it need a new major k8s release to be bumped, v1.13? Here I can see the latest is v0.7.3, seen as plain 'cni' in fedora, but in the official page the latest is 0.6.0 with some 0.7.0-alpha versions. Maybe because on the v1.12 changelogs I can see k8s is still using 0.6.0, but it's still inconsistent? It's an old issue affecting many ( #45419 ), if the fix in kubelet is straightforward then this can be a great addition, given v1.13 is marked as a stability release. |
/remove-lifecycle stale |
Please can you check that all necessary changes are made in Kubernetes. The CNI change is all in the library so just vendor in whatever commit you want to, and check it all works as you need.
That's the plugins - different thing. |
@liucimin feel free to prepare the Kubelet PR, it will then be easier to coordinate the full fix. |
@nikopen |
There is no relevant change in CNI plugins. The change you are waiting for is in libcni, and is already merged to master. Do not wait for it: test at will. (I expect the calling code will need changing to make use of the new timeout, but I haven’t studied it) |
The .go files that are under [https://github.com/kubernetes/kubernetes/tree/master/vendor/github.com/containernetworking/cni/libcni] were all two years ago. Where can we find the libcni code merged? |
@roywangti the repo for libcni is https://github.com/containernetworking/cni/ Part of the next step is to copy updated files into the location you pointed at. |
@bboreham, got it, Thanks! |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Rotten issues close after 30d of inactivity. Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
@fejta-bot: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/reopen |
@liucimin: Reopened this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
BUG REPORT:
/kind bug
What happened:
When the kubelet create sandbox by cni plugin.
It will just use exec.Cmd to call the cni binary file.
In some cni plugin,such as contiv,the cni binary file may be locked and no return.
In this case, when kubelet first time call RunPodSandbox got a ctx timeout. the next time kubelet re-creates PodSandbox, it will always report this timeout.
What you expected to happen:
Next time kubelet re-creates PodSandbox it will do call cni again.
How to reproduce it (as minimally and precisely as possible):
Create a pod in a node,and make the cni binary file not return.
Environment:
kubectl version
):v.1.9.2@kubernetes/sig-node-bugs
The text was updated successfully, but these errors were encountered: