Issues when forced to rebuild corrupted index #206

stigok · 2018-09-28T09:00:37Z

Found an issue with kafka nodes are coming up again after having failed. If they wake up to a corrupted index, it will attempt to fix itself. This seems to have two major implications:

Memory consumptions goes out the roof, resulting in pod getting killed due to OOM (due to limits, of course)
Readiness probe fails, and will kill the pod if it hasn't already OOM'ed

Any thoughts on how to remedy this?

solsson · 2018-09-29T03:25:47Z

We didn't get around to implementing it yet, but the idea is to fix the default image to support the ./kafka-server-stop.sh command (solsson/dockerfiles@4fb7b5d) and to use a preStop pod lifecycle hook to invoke it.

the kafka image used in current stable github.com/Yolean/kubernetes-kafka See: - #6 - Yolean/kubernetes-kafka#206

stigok changed the title ~~OOM when rebuilding corrupted index~~ Issues when forced to rebuild corrupted index Sep 28, 2018

solsson added this to the 4.0 milestone Sep 29, 2018

solsson added a commit to StreamingMicroservicesPlatform/docker-kafka that referenced this issue Sep 29, 2018

Adds the ps command as a minimal layer on top of

5df87ac

the kafka image used in current stable github.com/Yolean/kubernetes-kafka See: - #6 - Yolean/kubernetes-kafka#206

solsson mentioned this issue Sep 29, 2018

Send Kafka a TERM signal at pod stop and wait for shutdown #207

Merged

solsson closed this as completed in #207 Nov 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues when forced to rebuild corrupted index #206

Issues when forced to rebuild corrupted index #206

stigok commented Sep 28, 2018

solsson commented Sep 29, 2018

Issues when forced to rebuild corrupted index #206

Issues when forced to rebuild corrupted index #206

Comments

stigok commented Sep 28, 2018

solsson commented Sep 29, 2018