-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rancher container is restarting every 15 seconds on Ubuntu 22.04 #36238
Comments
Having same issue running v2.6.3 in a single node docker container. |
This has been happening to me for about a month and I cannot solve it. I can restore snapshots via Proxmox and it'll be good for about 3 days and randomly back to the reboot loop. Prior to the random event, everything on the node looks healthy. |
Same thing, |
This has also been happenig to me. I installed Version 2.4 and it worked but the latest stable release has the behavior described above. |
I'm going to give this a shot. Would you mind dropping the tag you used? |
Sorry I wasn't more specific, I installed using the docker container with the tag v2.4.9. It works! |
Unfortunately using older versions does not help me, my downstream clusters are k8s 1. 22 so can't import them. |
Our team is seeing this behavior as well with single-node Rancher v2.6.3-patch1 running on AlmaLinux 8.5 (one of the Enterprise Linux distros). For the Rancher devs -- this is basically the same behavior reported in #35892 and #36047, which my team is also seeing -- the error messages we get alternate between the ones reported here and the ones in those tickets. The comments on #35892 seem to indicate that this is a problem with updates to various systemd packages in EL 8.5, and that the problems did not occur in EL 8.4. |
We have the same problem on a fresh Debian 11 single node server :( |
Same problem here on a fresh OpenSUSE server for v2.6. Downgrading to v2.4.9 worked. |
^ Worked for me as well. I've briefly tried Rancher so many times in the 1st quarter of this year, and never managed to get one to stay up long enough to USE it. WHY is this STILL going on? |
hello i have this same problem on ubuntu 22.04 LTS and rancher is not working.
|
I'm having the same issue ... downgrading to version 2.5 works, however some of my clusters are running 1.22 and I cannot import them. |
Waiting for server to become available: an error on the server ("apiserver not ready") has prevented the request from succeeding |
Same here. Will try Ubuntu 20.04 LTS, as the last time that worked. Edit: Downgraded to Ubuntu 20.04 LTS and that solved the problem for me. So they must fix the issue with Ubuntu 22.04 LTS |
Fresh Debian 11 images. Same issue. |
VMware - Ubuntu server 20.10 to 22.10. same issues. |
Ubuntu 20.04 - no issues with all latest releases of Rancher. This has proven to be the solution for all my issues so far. |
Ubuntu 18.04 Same thing every time. It runs long enough to get itself setup, then restarts every few minutes. I can't understand how this has been happening for months with no engagement from the devs? |
The solution for me was changing from Ubuntu to CentOS 7 (from AWS Marketplace). I'm using AWS and I installed the Rancher as a single node (for the main/local cluster). Here are the steps that I did. 1-) Install the docker on CentOS: 2-) Install vim editor: 3-) Create the file daemon.json: 3.1-) File content: 4-) Restart docker: 5-) Enable docker: 6-) Add centos user to group dockerroot: 7-) Last but not least, install Rancher: |
Downgraded from Debian11 to 10 works fine |
Also tried with Debian 10 and it works. With debian 11 it doesn't work. |
Rancher 2.6.5 on ubuntu 22.04 (Hetzner Cloud VM) - same problem. |
I had this issue on Ubuntu 22.04 i fixed it by editing /etc/default/grub file. Added these values into GRUB_CMDLINE_LINUX: GRUB_CMDLINE_LINUX="cgroup_memory=1 cgroup_enable=memory swapaccount=1 systemd.unified_cgroup_hierarchy=0" then did: sudo update-grub This resolved my problem but i spent entire day on it. Maybe it helps someone else as well. I think the same thing works for Debian 11. |
@ognjen011 Thank you, this GRUB config worked for me on Ubuntu 21.10. |
After days of struggling with the same problem on 22.04 the GRUB_CMDLINE_LINUX thing fixed it :D Thanks @ognjen011 <3 |
Thank you @ognjen011 ! After days of struggling with Ubuntu 22.04 this fixed the issue! |
It works for Debian 11(debian-11.3.0-amd64-DVD-1.iso) you save my day bro! |
@Sahota1225 not only Ubuntu is impacted, Redhat too. |
Available to test with v2.7-head once https://drone-publish.rancher.io/rancher/rancher/8317/1/1 is green |
@kinarashah for me works. Thanks |
This comment was marked as off-topic.
This comment was marked as off-topic.
thaneunsoo said: ### Test Environment: ### Downstream cluster type: Custom Testing:Tested this issue with the following steps:
Result Rancher container is still not running successfully and I am now seeing the following error in the docker logs and am unable to reach Rancher
Tracking the issue here and will close this ticket once issue is resolved and rancher is able to run successfully. |
Test Environment:Rancher version: v2.7-head eab28dd Testing:Tested this issue with the following steps:
Result - Pass |
Backport to 2.6.x ? |
Still not working in el9, correct? I can't get |
Same issue, EL9.1, even with the latest |
I managed to resolve this issue with the following procedure. It turned out that iptables where not installed and appropriate modules loaded into kernel. I tested it on Rocky 8.7 with rancher 2.7.3. SOLUTION:
Add both modules to the file
and reboot your host"
TROUBLESHOOTING: Please note that I named rancher container "rancher" (--name=rancher docker run flag) Create directory on your host:
Run rancher container with the following command. It is essential to mount volumes for debugging purposes.
Execute
If rancher fails to start because of https://127.0.0.1:6444/ cannot connect error "[INFO] Waiting for server to become available: Get "https://127.0.0.1:6444/version?timeout=15m0s": dial tcp 127.0.0.1:6444: connect: connection refused", then you need to investigate /rancher/k3s.log on your host. I found that iptables were missing on the host.
Add both modules to the file
and reboot your host"
Then I got an issue with x509 certificate which got resolved by the following commands (please note that I named my container "rancher"):
When you finish troubleshooting and get missing iptables modules loaded into kernel then you can run docker in the way provided in the solution section - without volumes attached and host network. The issue with the invalid x509 certificates could be a side effect of troubleshooting. |
on rocky linux 9.3 works fine |
@hhruszka your solution works for my oracle linux 9.3 |
Rancher Server Setup
Describe the bug
After I restarted my ubuntu vm, my Rancher UI docker container is restarting every 15 seconds
Here is the log:
Any advice how to make it work again?
The text was updated successfully, but these errors were encountered: