Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Services with health checks sometimes get stuck in degraded/initializing #2516

Closed
cjellick opened this issue Nov 2, 2015 · 12 comments
Closed
Labels
kind/question Issues that just require an answer. No code change needd

Comments

@cjellick
Copy link

cjellick commented Nov 2, 2015

In building my kubernetes stack, when I added health checks, about half the time, the services/containers never got passed the degraded/initializing phase.

@cjellick cjellick added the kind/bug Issues that are defects reported by users or that we know have reached a real release label Nov 2, 2015
@cusspvz
Copy link

cusspvz commented Nov 3, 2015

This happens to me with hosts that are going to be or are in "Reconnecting" state.

@alena1108
Copy link

@cusspvz we've put a bunch of fixes related to healthchecks + hosts in reconnecting state combination - all will be available in next rancher release. The problem @cjellick is facing, is different. Today we don't support health checks for instances having Host networking. So if the health check is specified for such instance, the instance will be in initializing state forever.

@janeczku
Copy link
Contributor

This just happened to me with Rancher DNS service on a new DigitalOcean host and i can't get it working. Can you advise how to troubleshoot/debug the stuck in initializing sate?
Update: This was actually caused by IPSec connectivity issue (at least in my case): #2901 (comment)

@will-chan will-chan added kind/question Issues that just require an answer. No code change needd and removed kind/bug Issues that are defects reported by users or that we know have reached a real release labels Jan 6, 2016
@will-chan
Copy link
Contributor

not enough information here so closing issue.

@raphink
Copy link

raphink commented Feb 2, 2016

I'm using rancher 0.56.1 and seeing this behavior:

Services I've set health checks for start fine, work fine (they're even behind a LB and working good). I see the health checks in the container logs passing (and returning a 200 code), but both services are marked as "Initializing".

@raphink
Copy link

raphink commented Feb 2, 2016

Also, the load-balancer itself (using rancher/load-balancer-service, with a health-check) is marked as "Initializing" as well, even though it is working fine.

@raphink
Copy link

raphink commented Feb 2, 2016

screenshot from 2016-02-02 21 06 36

and the bottom of the load-balancer logs (one of the containers):

screenshot from 2016-02-02 21 07 13

@raphink
Copy link

raphink commented Feb 2, 2016

Edit: after a while, the mapserver service was not load-balanced anymore. Stop and restarting it put it behind the LB again. That container is an Apache container. Here's an excerpt of its logs, showing the health check pings passing:

screenshot from 2016-02-02 21 10 15

@raphink
Copy link

raphink commented Feb 3, 2016

I removed switched a TCP check instead of an HTTP check (removed the request_line: parameter) and I'm still seeing the same behavior (services marked as "Initializing")

@raphink
Copy link

raphink commented Feb 3, 2016

Upgrading services to remove the health checks completely sets them to the "Active" state. The load-balancer however is still marked as "Initializing", even after a restart.

@raphink
Copy link

raphink commented Feb 3, 2016

OK, that turned out to be a firewall issue. When I added a rule to my security group to allow all traffic on all protocols from all sources, the health checks passed. However, allowing them from within the security group of the nodes or from the rancher server did not need to suffice.

Which firewall rules are actually necessary for health checks to work properly?

@deniseschannon
Copy link

Health check ports for load balancers are port 42.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/question Issues that just require an answer. No code change needd
Projects
None yet
Development

No branches or pull requests

7 participants