-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rancher network agent stuck in restart loop - DNS lookup issue #4237
Comments
What version did you upgrade from? Anything else you can share about your setup? |
The previous version of Rancher was 0.63.1. On Mar 31, 2016, 10:12 AM +0700, Denisenotifications@github.com, wrote:
|
I am getting a little closer to the issue. From within the Rancher Network container (which is the only container that is partially running) I cannot ping anything, including |
Is it possible that somehow the new Rancher broke the DNS? |
So I spawned a fresh VPS with Docker on DigitalOcean. The host is perfectly healthy, and I can ping any popular domain, including my own Rancher server domain. I then manually added this host to Rancher, and started spinning up contains, and the same issue is happening, that, from within the network agent, I CANNOT ping anything with domain name. Ping by IP address is fine though. |
I have created a new environment, and I am still seeing the same issue, that I cannot ping anything by domain name from within the network agent (agent-instance) container. I am not sure if the same issue applies to other containers, since I cannot start any of them. I think this is a bug with DNS in rancher/agent-instance:0.8.1. Unfortunately there is no way for one to specify a specific version for Rancher network agent image, so I cannot revert to 0.8.0 for testing. All my existing Rancher network agent containers (running 0.8.0) are still functioning normally (and I can ping domain names). This is a serious blocking issue for me since I cannot deploy anything when I am very close to launching my product. |
Just to clarify, right now cross host networking is working, but DNS is not? Can you try launching container and manually editting |
Here is the original content of
I then manually edited the file to only:
Then I was able to ping my domain (and other popular domains). Because of this, the container succeeded in downloading what it needs, and no long stuck in an infinite cycle. Other containers (my service containers) also got deployed successfully as well. Then I try to restart the network agent container to see if this fix persist, but the To answer your first question: yes, cross host networking is working, but DNS is not. |
For reference, here is the content of
So this points towards one of the first three lines in |
This issue is more serious than I thought. I so manually updated the Are you actively investigating this? I think this is a bug rather than just a question. |
Rancher does not support IPv6 yet, but I have the habit of enabling IPv6 on my DigitalOcean VPSs, so the Rancher network agents also add the ipv6 entries in |
@nlhkh Glad to hear you figured it out. Here's the request to support ipv6: #1403 |
Hi, I have exactly the same problem with ipv6 disabled on my host : nano /etc/sysctl.conf After a network restart, I can see than ipv6 is off. Versions :
No problem if I return on Rancher 0.63.1 |
BTW I had the same thing without ipv6. Fixed by tuning my DNS/DHCP boxes and recreating all affected Rancher server/agent/agent-instance/agent-state containers. |
After upgrading to Rancher v1.0.0, and Rancher agent v0.8.1, there seems to be a bug on Rancher agent that prevents the agent from starting, and thus cannot start any other services.
The log is as follow
v1.0.0
Cattle:v0.159.2
UI:v0.100.3
github
admin
container.labels
The text was updated successfully, but these errors were encountered: