HEALTHCHECK Instructions? #915

rwmajor2 · 2019-07-30T11:14:42Z

I have a general question. We base some of our Docker images on the Docker-stacks Dockerfiles. We are following CISP Docker Security hardening guidelines and one of the checklist items is:

 Ensure that HEALTHCHECK instructions have been added to container images
	Guidance:  Docker engine periodically checks the running container instances against that instruction to ensure containers are still operational

Can anyone provide any suggestions on what this is and how we may implement it in docker-stacks? According to Docker help documentation, a HEALTHCHECK might be something like:

HEALTHCHECK --interval=5m --timeout=3s CMD curl -f http://localhost/ || exit 1

Thoughts?

@GrahamDumpleton

The text was updated successfully, but these errors were encountered:

GrahamDumpleton · 2019-07-30T11:23:02Z

Can't see what health checks have got to do with security hardening. They are an operations feature to ensure your application is running. Unless they think that your application not running may have been caused by hackers and so qualifies somehow as a security event. Also be aware that defining health checks in a container image itself only really pertains to Docker's container run time. They aren't used by other container platforms such as Kubernetes. In Kubernetes health checks are defined in the separate deployment configuration of Kubernetes where they more rightly belong. So not sure if it really belongs in the container image itself. It should be really part of how you deploy things in the container platform.

rwmajor2 · 2019-07-30T11:32:34Z

Thanks @GrahamDumpleton, that's fair enough. Thanks for the feedback.

FYI, below is the "rationale" from the CISP guideline:

An important security control is that of availability. Adding the HEALTHCHECK instruction to your container image ensures that the Docker engine periodically checks the running container instances against that instruction to ensure that containers are still operational.
Based on the results of the health check, the Docker engine could terminate containers which are not responding correctly, and instantiate new ones.

romainx · 2020-04-30T12:19:46Z

Hello @rwmajor2

It's a late answer however I hope it could be useful to someone.

Standard HTTP probe

As far as I know there is currently no specific "health" end point available on Jupyter, it seems to be confirmed by this issue jupyter/notebook#1857.
Since notebook are protected it's not possible to use a standard HTTP health check like.

# I'm using wget since curl is not available
HEALTHCHECK CMD wget -q --spider http://127.0.0.1:8888 > /dev/null || exit 1

This will always return HTTP 405 and the wget command the 8 exit status, meaning
"Server issued an error response".

$ wget -q --spider http://127.0.0.1:8888/
# [W 12:07:40.913 NotebookApp] 405 HEAD / (127.0.0.1) 0.72ms referer=None

$ echo $?
# 8

An alternative

As an alternative it's possible to check if the Jupyter process is running through pgrep.

HEALTHCHECK CMD pgrep "jupyter" > /dev/null || exit 1

Here is the result

$ docker ps             
# CONTAINER ID        IMAGE                   COMMAND                  CREATED             STATUS                             PORTS                    NAMES
# 162adb12d75d        jupyter/base-notebook   "tini -g -- start-no…"   29 seconds ago      Up 28 seconds (health: starting)   0.0.0.0:8888->8888/tcp   vibrant_agnesi

$ docker ps
# CONTAINER ID        IMAGE                   COMMAND                  CREATED             STATUS                    PORTS                    NAMES
# 162adb12d75d        jupyter/base-notebook   "tini -g -- start-no…"   34 seconds ago      Up 33 seconds (healthy)   0.0.0.0:8888->8888/tcp   vibrant_agnesi

You can change the HEALTHCHECK settings (interval, timeout, etc.) as explained in the documentation.

Hope it helps. Please tell us if this is the case.
Best

lmeyerov · 2021-07-13T05:18:47Z

We used to do pgrep "jupyter" as described above, but find Jupyter kernels are prone to getting wedged without crashing, e.g., 100% CPU or IO, so we find this check not so great for availability in practice. Don't have an alt, however.

mathbunnyru · 2022-03-15T22:29:20Z

I think this is actually possible in a reliable way!

jupyter/notebook#1857 (comment)

It should be something like this:
HEALTHCHECK CMD curl --fail http://localhost:8888/api || exit 1

I checked that /api returns 200 for lab, notebook, nbclassic, server and retro jupyter commands.

Also, I'm not sure we should add this to our docker files at this point - people might using custom command, which do not actually launch jupyter subcommand.
So, I think, I will add this to docs and it should be fine.

lmeyerov · 2022-03-16T00:59:15Z

Ah, so curl -f (so responsive to HTTP error codes), nice, thanks!

mathbunnyru · 2022-03-16T13:04:35Z

Ah, so curl -f (so responsive to HTTP error codes), nice, thanks!

Thanks, fixed 👍

carlosefr · 2022-04-20T17:39:51Z

This change doesn't seem to work when the image is used in Jupyter Hub. The URL used in the health check fails with a 404.

parente added the type:Question A question about the use of the docker stack images label Aug 4, 2019

mathbunnyru mentioned this issue Mar 17, 2022

Add healthcheck command to Dockerfile #1660

Merged

mathbunnyru closed this as completed in #1660 Mar 18, 2022

yacchin1205 mentioned this issue Apr 22, 2022

Fix HEALTHCHECK command for JupyterHub #1687

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HEALTHCHECK Instructions? #915

HEALTHCHECK Instructions? #915

rwmajor2 commented Jul 30, 2019

GrahamDumpleton commented Jul 30, 2019

rwmajor2 commented Jul 30, 2019

romainx commented Apr 30, 2020

lmeyerov commented Jul 13, 2021 •

edited

Loading

mathbunnyru commented Mar 15, 2022 •

edited

Loading

lmeyerov commented Mar 16, 2022 •

edited

Loading

mathbunnyru commented Mar 16, 2022

carlosefr commented Apr 20, 2022

HEALTHCHECK Instructions? #915

HEALTHCHECK Instructions? #915

Comments

rwmajor2 commented Jul 30, 2019

GrahamDumpleton commented Jul 30, 2019

rwmajor2 commented Jul 30, 2019

romainx commented Apr 30, 2020

Standard HTTP probe

An alternative

lmeyerov commented Jul 13, 2021 • edited Loading

mathbunnyru commented Mar 15, 2022 • edited Loading

lmeyerov commented Mar 16, 2022 • edited Loading

mathbunnyru commented Mar 16, 2022

carlosefr commented Apr 20, 2022

lmeyerov commented Jul 13, 2021 •

edited

Loading

mathbunnyru commented Mar 15, 2022 •

edited

Loading

lmeyerov commented Mar 16, 2022 •

edited

Loading