Stop container health check also on kill event #1699

nevalla · 2017-01-22T13:50:46Z

When container is stopped (docker stop <container>), Docker will send container event with status kill. Currently agent will stop container health check worker only with die event.

nevalla · 2017-01-22T15:22:41Z

@jakolehm @jnummelin Do you see any side effects because of this?

nevalla · 2017-01-23T06:48:50Z

Hmm... or should we only track kill event?

jnummelin · 2017-01-23T11:53:31Z

Hmm, stopping health check on both die and kill does not really make sense to me. At least according to Docker event state here moby/moby#12164 (comment), all containers should get die event when they are being stopped. I.e. when you stop a container, you should get events kill-> die -> stop.

SpComb · 2017-01-23T12:20:22Z

It definitely makes more sense to stop health-checking on kill, because the process may stop responding to health checks while shutting down, and we don't want to restart it while it's shutting down... Consider a process that traps the SIGTERM signal, closes its listening socket, waits for any pending transactions to complete, and flushes out any write buffers before exiting... it may take an arbitrarily long time between the Docker kill and the die events (assuming that die = the container pid=1 process has exited).

Even with the kill coming before the die, the Kontena::Workers::ContainerHealthCheckWorker is still racy with the Kontena::ServicePods::Stopper/Terminator, though... the agent should not be health-checking once it has stopped the container, but this process currently relies on the Docker::Container.stop -> Docker event -> Celluloid::Notifications container:event -> Celluloid::Actor.kill chain completing before the ContainerHealthCheckWorker periodic interval kicks in, and that's just a question of timing.

SpComb · 2017-01-23T13:26:18Z

agent/lib/kontena/workers/health_check_worker.rb

@@ -11,7 +11,7 @@ class HealthCheckWorker
    finalizer :terminate_workers

    START_EVENTS = ['start']
-    STOP_EVENTS = ['die']
+    STOP_EVENTS = ['die', 'kill']
    ETCD_PREFIX = '/kontena/log_worker/containers'


Mmm stale copy-pasta :)

Stop container health check on kill event

d81153f

nevalla added agent enhancement labels Jan 22, 2017

jakolehm added this to the 1.1.0 milestone Jan 23, 2017

jakolehm approved these changes Jan 23, 2017

View reviewed changes

jakolehm added the status/merge label Jan 23, 2017

SpComb reviewed Jan 23, 2017

View reviewed changes

kke merged commit cb8c17b into master Jan 23, 2017

SpComb mentioned this pull request Feb 28, 2017

Refactor agent to pull services desired state from the master #1873

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop container health check also on kill event #1699

Stop container health check also on kill event #1699

nevalla commented Jan 22, 2017

nevalla commented Jan 22, 2017

nevalla commented Jan 23, 2017

jnummelin commented Jan 23, 2017

SpComb commented Jan 23, 2017 •

edited

SpComb Jan 23, 2017

Stop container health check also on kill event #1699

Stop container health check also on kill event #1699

Conversation

nevalla commented Jan 22, 2017

nevalla commented Jan 22, 2017

nevalla commented Jan 23, 2017

jnummelin commented Jan 23, 2017

SpComb commented Jan 23, 2017 • edited

SpComb Jan 23, 2017

Choose a reason for hiding this comment

SpComb commented Jan 23, 2017 •

edited