[Carry 22719] healthcheck feature #23218

thaJeztah · 2016-06-02T22:22:16Z

closes #22719
closes #21143
closes #21142

This PR adds support for user-defined health-check probes for Docker containers. It adds a `HEALTHCHECK` instruction to the Dockerfile syntax plus some corresponding "docker run" options. It can be used with a restart policy to automatically restart a container if the check fails. The `HEALTHCHECK` instruction has two forms: * `HEALTHCHECK [OPTIONS] CMD command` (check container health by running a command inside the container) * `HEALTHCHECK NONE` (disable any healthcheck inherited from the base image) The `HEALTHCHECK` instruction tells Docker how to test a container to check that it is still working. This can detect cases such as a web server that is stuck in an infinite loop and unable to handle new connections, even though the server process is still running. When a container has a healthcheck specified, it has a _health status_ in addition to its normal status. This status is initially `starting`. Whenever a health check passes, it becomes `healthy` (whatever state it was previously in). After a certain number of consecutive failures, it becomes `unhealthy`. The options that can appear before `CMD` are: * `--interval=DURATION` (default: `30s`) * `--timeout=DURATION` (default: `30s`) * `--retries=N` (default: `1`) The health check will first run **interval** seconds after the container is started, and then again **interval** seconds after each previous check completes. If a single run of the check takes longer than **timeout** seconds then the check is considered to have failed. It takes **retries** consecutive failures of the health check for the container to be considered `unhealthy`. There can only be one `HEALTHCHECK` instruction in a Dockerfile. If you list more than one then only the last `HEALTHCHECK` will take effect. The command after the `CMD` keyword can be either a shell command (e.g. `HEALTHCHECK CMD /bin/check-running`) or an _exec_ array (as with other Dockerfile commands; see e.g. `ENTRYPOINT` for details). The command's exit status indicates the health status of the container. The possible values are: - 0: success - the container is healthy and ready for use - 1: unhealthy - the container is not working correctly - 2: starting - the container is not ready for use yet, but is working correctly If the probe returns 2 ("starting") when the container has already moved out of the "starting" state then it is treated as "unhealthy" instead. For example, to check every five minutes or so that a web-server is able to serve the site's main page within three seconds: HEALTHCHECK --interval=5m --timeout=3s \ CMD curl -f http://localhost/ || exit 1 To help debug failing probes, any output text (UTF-8 encoded) that the command writes on stdout or stderr will be stored in the health status and can be queried with `docker inspect`. Such output should be kept short (only the first 4096 bytes are stored currently). When the health status of a container changes, a `health_status` event is generated with the new status. The health status is also displayed in the `docker ps` output. Signed-off-by: Thomas Leonard <thomas.leonard@docker.com> Signed-off-by: Sebastiaan van Stijn <github@gone.nl>

Signed-off-by: Sebastiaan van Stijn <github@gone.nl>

icecrime · 2016-06-03T00:23:19Z

Thanks @thaJeztah @crosbymichael! Congratulations @talex5 🎉

thaJeztah · 2016-06-03T12:31:32Z

pinging our awesome duo @albers and @sdurrheimer (for the new flags)

konobi · 2016-06-03T20:32:37Z

doesn't the remote api allow you to access resources on the remote side? This would allow for the client to check for file existence "READY_ON /tmp/container_is_up_and_read"?

For healthchecking on a more ongoing basis, I think something like containerpilot is going to be the better solution going forward.

Thomas Leonard and others added 2 commits June 2, 2016 23:58

Bump engine-api to fa04f66c7871183dd53a5ec666479f49b452743d

76d8b0d

Signed-off-by: Sebastiaan van Stijn <github@gone.nl>

GordonTheTurtle added area/distribution status/0-triage labels Jun 2, 2016

thaJeztah added status/2-code-review and removed status/0-triage labels Jun 2, 2016

thaJeztah mentioned this pull request Jun 2, 2016

Add support for user-defined healthchecks #22719

Closed

thaJeztah added this to the 1.12.0 milestone Jun 2, 2016

thaJeztah added impact/api impact/changelog impact/cli impact/dockerfile labels Jun 2, 2016

crosbymichael merged commit ce255f7 into moby:master Jun 2, 2016

crosbymichael deleted the carry-22719-healthcheck-feature branch June 2, 2016 23:58

This was referenced Jun 3, 2016

Proposal - Application-defined "alive probe" #21142

Closed

Proposal - Image defined probe #21143

Closed

elgalu mentioned this pull request Jun 3, 2016

Replace wait_all_done with docker healthcheck elgalu/docker-selenium#100

Closed

albers mentioned this pull request Jun 3, 2016

bash completion for docker run healthcheck options #23239

Merged

CpuID mentioned this pull request Jun 9, 2016

Is there a way to delay container startup to support dependant services with a longer startup time docker/compose#374

Closed

nishanttotla mentioned this pull request Jun 10, 2016

Healthcheck Support moby/swarmkit#641

Closed

sdurrheimer mentioned this pull request Jun 12, 2016

Add zsh completion for 'docker run' healthcheck options #23472

Merged

zarbis mentioned this pull request Aug 1, 2016

Add HEALTHCHECK entry to Dockerfile to reliably monitor container readiness status docker-library/mysql#196

Closed

vshcherb mentioned this pull request Sep 17, 2016

No possibility to specify grace period in the Healthcheck for service startup #26664

Closed

This was referenced Nov 25, 2016

Improve the docker-compose dependency workaround lucasmauricio/balut#3

Closed

Improve the docker-compose dependency workaround lucasmauricio/arrakis#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Carry 22719] healthcheck feature #23218

[Carry 22719] healthcheck feature #23218

thaJeztah commented Jun 2, 2016 •

edited

icecrime commented Jun 3, 2016

thaJeztah commented Jun 3, 2016

konobi commented Jun 3, 2016

[Carry 22719] healthcheck feature #23218

[Carry 22719] healthcheck feature #23218

Conversation

thaJeztah commented Jun 2, 2016 • edited

icecrime commented Jun 3, 2016

thaJeztah commented Jun 3, 2016

konobi commented Jun 3, 2016

thaJeztah commented Jun 2, 2016 •

edited