-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nomad didn't deregister services in Consul #810
Comments
@adrianlop Can you share the Nomad client logs of the machines where the services were supposed to be not registered? |
Ok, I'll rerun the reboots. do you prefer them on DEBUG? |
Yes, please share the debug logs. Sent from my iPhone
|
now that you released v0.3, I'm going to upgrade the whole cluster first, then I'll try to reproduce this again. thanks! |
hi again, I think what happened is:
So I'm going to close the issue for now, if I reboot machines some other times, and there are no zombie Dockers, then I'll reopen and send you debug logs. Thanks @diptanu ! |
I'm going to lock this issue because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active issues. |
Hi there,
I have Consul 0.6.3 and Nomad 0.2.3 agents running as a system service.
When I rebooted some machines, the services running on these machines reallocated to other available machines. That worked like a charm.
The problem was that the services were still registered (but not running, I double-checked that) on the old machines, and in the new ones.
I checked Nomad logs in the old machines and there's no reference about the allocID (even after restarting the Nomad agent too).
So my guess is that when the reboot happened, Nomad cluster saw that the machines were unreachable, so it reallocated the services, but when they came up again, the Nomad agents didn't send a query to Consul to deregister the services.
Can you please help? is this fix in Nomad 0.3 maybe?
The text was updated successfully, but these errors were encountered: