[ECS] [request]: Ability to disable task restart behaviour on Healthcheck Failure #1373

raags · 2021-05-14T11:23:35Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Tell us about your request
What do you want us to build?

Right now if a container health-check fails for an essential ECS task, the task is restarted. This case is not desirable in most cases, because the health-check failure could be transient due to traffic spike, network unavailability, resource constraint, misconfiguration etc.

One would also want the failed task to remain running for debugging, to understand why the health check is failing.

This is also related to the ALB health-checks, which behave in the same way ( #1271 and #289 ). But this ticket is regarding container health-check.

I think a flag to control this behaviour (for container and ALB health-check restart behaviour) would help, so that even if the health check fails, ECS does not stop the task.

Which service(s) is this request for?
Fargate, ECS

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
Explained above

Are you currently working around this issue?
By not using ECS container health checks, and instead use a third-party side-car to do the same.

Additional context
Health-check failures already generate a CW event, which can be used for alerting. With this alert, the engineer can investigate the issue, and make a call to restart or provision additional tasks, without losing the running task, which can be used for debugging.

whelanp · 2023-03-13T09:13:59Z

hello @raags could you share the name of your third-party side-car? We are having the same issue and would be looking to apply this work around too!

raags · 2023-03-13T09:56:56Z

Hi @whelanp - I was using sensu side-car for the health checks. There are others that can do the same.

raags added the Proposed Community submitted issue label May 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ECS] [request]: Ability to disable task restart behaviour on Healthcheck Failure #1373

[ECS] [request]: Ability to disable task restart behaviour on Healthcheck Failure #1373

raags commented May 14, 2021

whelanp commented Mar 13, 2023

raags commented Mar 13, 2023

[ECS] [request]: Ability to disable task restart behaviour on Healthcheck Failure #1373

[ECS] [request]: Ability to disable task restart behaviour on Healthcheck Failure #1373

Comments

raags commented May 14, 2021

Community Note

whelanp commented Mar 13, 2023

raags commented Mar 13, 2023