-
Notifications
You must be signed in to change notification settings - Fork 2.3k
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
k3s-agent Fails to start with with embedded registry and kill entire OS #10101
Comments
I also experience the same :( |
This error is coming from https://github.com/rancher/dynamiclistener/blob/e590d58b896cc8dd33dde7cec80c52e23ec08189/cert/io.go#L89 - the message suggests that the file was created by a previous startup of k3s, but for some reason the file contents have been lost. Your best bet is probably to just delete the file from disk and let it be recreated on startup. You might be able to find other errors in the logs to suggest why the file has no contents or its contents are corrupted, but given that this node is also rebooting unexpectedly, I suspect that you may have lost data from your filesystem when the system crashed.
That sounds like a problem with your node; K3s shouldn't be capable of doing anything that would cause it to panic and reboot. You'll need to figure that out on your own. |
@brandon agree with you! I manage to switch to an openrc system and test the same k3s version, all work as expected. systemd seems playing devil here. :( |
strange, when rolling back to 1.28.6, it runs ok with no issue. |
I have found another potential cause. As I understand, when running with systemd, the cgroup driver should be systemd, however, I found k3s mistaken it as cgroupfs, not sure if this is the issue. |
I'm not aware of any defect in k3s that would cause it to use cgroupfs instead of systemd, when using the embedded containerd on a systemd-based OS. You're not trying to use docker or another user-provided container runtime, are you? |
no, I use kairos from https://github.com/kairos-io/kairos/, which should have no other runtime available. In addition to that, I add some additional printout and find
INVOCATION_ID is empty, something go wrong with systemd, it should set this value. This is very likely systemd issue in their distribution, I will shout out loud there. :D |
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
Describe the bug:
I am experiencing an issue where the k3s-agent fails to start properly. During the startup process, the file
/var/lib/rancher/k3s/agent/containerd/peer.key
is generated but its content is empty, which is quite unusual. When I attempt to delete the/var/lib/rancher/k3s/agent/containerd/peer.key
file and then restart the k3s-agent, the system immediately freezes and then reboots.Environmental Info:
K3s Version: v1.28.9+k3s1
Node(s) CPU architecture, OS, and Version:
cpu: 16
os: ubuntu 24.04 (kairos)
Cluster Configuration:
2 servers, 1 agents
Steps To Reproduce:
--embedded-registry
Additional context / logs:
The text was updated successfully, but these errors were encountered: