New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Node hostname is set to pod name #2983
Comments
hey @benceszikora thanks for reporting the issue this bug is known to us, and we're working on a proper fix in CRI-O. Unforunately, most of our developers are at kubecon this week, and are pretty busy. I hope to finish tackling it in the next two weeks. It's definitely my top priority. In the meantime, you can mitigate this by increasing the kernel.pid_max value to the maximum for your system. |
@haircommander Thank you, we'll increase pid_max in the meantime. Do you know if the |
That's part of the same issue 😄 |
Hello, Any updates on the fix? We are seeing the same issue. |
This was fixed in #3042 which is in 1.16.2. backports to 1.15 and 1.14 are also planned |
@haircommander could you provide some details on how that fix is related to the issue with host name changes? I don't think that conmon was OOM killed in our case. |
The only situation in which we have seen this happen recently is described here https://capsule8.com/blog/oomypod-nothin-to-cri-o-bout/, which can only happen with a conmon OOM or conmon being killed before the container is. |
Thanks a lot! And I just confirmed that a container that managed to change host name was indeed OOM killed, so it must be it. |
ah hah, then increasing pids_max and using 1.16.2 should significantly decrease the chance of that happening again. Stay tuned for 1.17 where this situation should be fixed entirely :) |
we now set manage_ns_lifecycle by default in CRI-O, which allows CRI-O to have better control over the namespace lifecycle. Further, we've made conmon have an oom adjust score of -1000, so it is never OOM'd. As such, this issue should be entirely prevented. As such, I'm closing this issue. please reopen if you disagree |
Description
We have noticed 2 nodes in our cluster that had their hostname set to the name of a pod running on them. At first glance it seems similar to issue #2122, but in our case the pod is not privileged and supposedly that issue was fixed already.
I can't really find anything related in the logs, the only interesting lines are like this:
Steps to reproduce the issue:
I can't reliably reproduce the issue. So far it happened on two nodes and both pods are Rook radosgw deployments.
Describe the results you received:
The hostnames of the nodes are set to the name of a pod running on them.
Describe the results you expected:
The hostname does not change
Additional information you deem important (e.g. issue happens only occasionally):
We have updated to crio 1.16.0 last Friday, and we have not seen this issue before.
Output of crio --version:
Example affected pod manifest:
The text was updated successfully, but these errors were encountered: