Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Watchdog reach limit when internet is down #3439

Closed
tyjtyj opened this issue Jan 28, 2024 · 1 comment
Closed

Watchdog reach limit when internet is down #3439

tyjtyj opened this issue Jan 28, 2024 · 1 comment
Labels

Comments

@tyjtyj
Copy link

tyjtyj commented Jan 28, 2024

Describe the issue you are experiencing

Watchdog attempt to restart the addons such as Duck DNS and Cloudflared during internet down causing the addons to failed.

After 10 attemps while internet still down, watchdog reached limit

What type of installation are you running?

Home Assistant OS

Which operating system are you running on?

Home Assistant Operating System

Which add-on are you reporting an issue with?

Duck DNS

What is the version of the add-on?

1.15.0

Steps to reproduce the issue

  1. Disconnect the internet for more than 10 mins

...

System Health information

There are currently no repair available

Anything in the Supervisor logs that might be useful for us?

24-01-26 11:06:01 INFO (MainThread) [supervisor.docker.addon] Starting Docker add-on ghcr.io/brenner-tobias/cloudflared/aarch64 with version 5.1.0
24-01-26 11:06:06 WARNING (MainThread) [supervisor.addons.addon] Watchdog found addon Duck DNS is failed, restarting...
24-01-26 11:06:06 INFO (SyncWorker_5) [supervisor.docker.manager] Cleaning addon_core_duckdns application
24-01-26 11:06:10 INFO (MainThread) [supervisor.docker.addon] Starting Docker add-on homeassistant/aarch64-addon-duckdns with version 1.15.0
24-01-26 11:06:22 WARNING (MainThread) [supervisor.addons.addon] Watchdog found addon Cloudflared is failed, restarting...
24-01-26 11:06:22 INFO (SyncWorker_0) [supervisor.docker.manager] Cleaning addon_9074a9fa_cloudflared application
24-01-26 11:06:25 INFO (MainThread) [supervisor.docker.addon] Starting Docker add-on ghcr.io/brenner-tobias/cloudflared/aarch64 with version 5.1.0
24-01-26 11:06:30 WARNING (MainThread) [supervisor.addons.addon] Watchdog found addon Duck DNS is failed, restarting...
24-01-26 11:06:30 INFO (SyncWorker_2) [supervisor.docker.manager] Cleaning addon_core_duckdns application
24-01-26 11:06:34 INFO (MainThread) [supervisor.docker.addon] Starting Docker add-on homeassistant/aarch64-addon-duckdns with version 1.15.0
24-01-26 11:06:45 WARNING (MainThread) [supervisor.addons.addon] Watchdog found addon Cloudflared is failed, restarting...
24-01-26 11:06:45 INFO (SyncWorker_1) [supervisor.docker.manager] Cleaning addon_9074a9fa_cloudflared application
24-01-26 11:06:49 INFO (MainThread) [supervisor.docker.addon] Starting Docker add-on ghcr.io/brenner-tobias/cloudflared/aarch64 with version 5.1.0
24-01-26 11:06:54 ERROR (MainThread) [asyncio] Task exception was never retrieved
future: <Task finished name='Task-20606' coro=<Addon.watchdog_container() done, defined at /usr/src/supervisor/supervisor/addons/addon.py:1346> exception=AddonsJobError('Rate limit exceeded, more than 10 calls in 0:30:00')>
Traceback (most recent call last):
  File "/usr/src/supervisor/supervisor/addons/addon.py", line 1360, in watchdog_container
    await self._restart_after_problem(event.state)
  File "/usr/src/supervisor/supervisor/jobs/decorator.py", line 276, in wrapper
    raise on_condition(
supervisor.exceptions.AddonsJobError: Rate limit exceeded, more than 10 calls in 0:30:00
24-01-26 11:07:08 ERROR (MainThread) [asyncio] Task exception was never retrieved
future: <Task finished name='Task-21164' coro=<Addon.watchdog_container() done, defined at /usr/src/supervisor/supervisor/addons/addon.py:1346> exception=AddonsJobError('Rate limit exceeded, more than 10 calls in 0:30:00')>
Traceback (most recent call last):
  File "/usr/src/supervisor/supervisor/addons/addon.py", line 1360, in watchdog_container
    await self._restart_after_problem(event.state)
  File "/usr/src/supervisor/supervisor/jobs/decorator.py", line 276, in wrapper
    raise on_condition(
supervisor.exceptions.AddonsJobError: Rate limit exceeded, more than 10 calls in 0:30:00

Anything in the add-on logs that might be useful for us?

No response

Additional information

Watchdog attempt to restart the service 10 times within 4 mins. This resulting the limit reached.

Work around, dont restart when internet is down ?
Exponential restart, eg 1,5,10,30,60 mins and repeat 60 over 24 hours or more ?

The watchdog purpose is to ensure the container is working.

My current work around
Enable automation to monitor running Cloudflared container every 10 mins, if it is down, restarted the container.

Copy link

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant