-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agent keeps failing and host state stucks at "Reconnecting" #3908
Comments
Unable to reproduce. I launched an Azure using the Azure driver (new in v0.63.0, it has it's own dedicated page) and added a service onto it. It launched the containers fine and did not go into reconnecting. I will keep it up for the next 24 hours and try to monitor. Once it goes into reconnecting, does it recover on its own? |
@deniseschannon no, i have to restart the rancher-server. |
I am unable to reproduce. I left my box up for 24 hours and had no issues where the host went into reconnecting. Why do you need to restart rancher-server? Couldn't you just re-add the agent? |
@deniseschannon because our production environment has many hosts. Is easier to have a bash script that restart rancher server every 2 hours and prevent reconneting. |
@deniseschannon at this moment im getting a reconneting host in my test environment. If you want i can provide you acess to this environment. just send me a email to fernando.neto@junglecloud.com |
I'm also experiencing the same issue. I think it started after we rebooted the rancher server host. |
Are you still having these issues with Rancher v1.1.1? |
@bacheson Would you mind opening a new issue for yours? I know the others were using Azure and it looks like you are using Packet. I'll close this one cause of the old version of Rancher as well. |
Rancher Version :
v0.59.1
Docker Version:
1.10.2
OS:
Ubuntu 14.04.4 LTS (3.13.0-79-generic)
Steps to Reproduce:
Results:
After some time the host go to
Reconnecting
and we have to restart rancher-server container to host go toActive
againExpected:
Hosts recover from reconnecting state automatically.
Related:
#2196
The text was updated successfully, but these errors were encountered: