-
Notifications
You must be signed in to change notification settings - Fork 257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pods stuck in ContainerCreating state #973
Comments
@ionutleca how long are you leaving your node with the containers in "ContainerCreating" state? Do these containers ever get past this stage? In our testing with |
They were stuck in that state >10 minutes. Since that delay is not acceptable for us, we never waited to see how much they actually stay in that state, considering that we experience no such delays with the fixed runC version. |
assigning to jacob assuming so |
@Oats87 no they never pass the ContainerCreating state, I have seen them stuck for more than a day |
Validated this is working on v1.21.1-alpha5+rke2r1 and v1.20.7-rc1+rke2r1Pods no longer get stuck in this state. Can increase pod limit successfully via kubelet arg. |
Environmental Info:
RKE2 Version:
v1.20.6+rke2r1 (da4fc2f)
Node(s) CPU architecture, OS, and Version:
Linux server0 4.18.0-193.47.1.el8_2.x86_64 SMP Thu Mar 4 03:03:32 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
Cluster Configuration:
1 server
Describe the bug:
When trying to deploy more than ~100 pods we get pods stuck in the
ContainerCreating
state.After further digging into the node logs we found that we have results similar to the ones reported here opencontainers/runc#2865
Steps To Reproduce:
Expected behavior:
Being able to create pods up to the 200 limit
Actual behavior:
Pods are stuck in
ContainerCreating
well before the 200 limitAdditional context / logs:
Looks like upgrading the runC version to
1.0.0-rc94
from1.0.0-rc93
(opencontainers/runc#2871) does the trick.We replaced the runC binary with the new version and the rke2 cluster seemed to successfully schedule pods up to the specified limit.
The text was updated successfully, but these errors were encountered: