fix: retry sooner on transient DNS failures during device update by mfncl9991 · Pull Request #161 · iprak/winix

mfncl9991 · 2026-04-18T18:25:27Z

Summary

During live testing of PR #158, occasional transient DNS failures were observed:

HomeAssistantError: Error communicating with Winix: Cannot connect to host us.api.winix-iot.com:443 ssl:default [Timeout while contacting DNS servers]

DNS recovers on its own, but with no retry logic the coordinator waits the full scan interval (30s) before trying again, leaving entities unavailable in the meantime.

Change

Catch ClientConnectorDNSError in _async_update_data and raise UpdateFailed(retry_after=timedelta(seconds=15)), letting the DataUpdateCoordinator framework retry sooner rather than waiting the full interval.

ClientConnectorDNSError is already wrapped into HomeAssistantError by the driver, so it's detected via __cause__. All other HomeAssistantError cases (HTTP errors, timeouts) are re-raised unchanged.

try:
    for device_wrapper in self._device_wrappers:
        await device_wrapper.update()
except HomeAssistantError as err:
    if isinstance(err.__cause__, aiohttp.ClientConnectorDNSError):
        raise UpdateFailed(retry_after=timedelta(seconds=15)) from err
    raise

References

Discussed in PR fix: update auth and device control for Winix API v1.5.7 #158 comments

When a DNS lookup for us.api.winix-iot.com fails transiently, the driver raises HomeAssistantError wrapping aiohttp.ClientConnectorDNSError. Previously this propagated unhandled, leaving entities unavailable until the next full coordinator interval (30s). Catch the DNS-specific case in _async_update_data and raise UpdateFailed(retry_after=15s) so the coordinator retries sooner rather than waiting the full scan interval. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

iprak

Thanks for looking into this and getting to it before me.

Per review feedback, replace the isinstance(__cause__) check with a dedicated exception class that makes retry intent explicit. - Add WinixTransientError(HomeAssistantError) to driver.py; raised from get_state for ClientError (covers DNS, connection) and TimeoutError - manager.py catches WinixTransientError and retries once via UpdateFailed(retry_after=15s); on second consecutive failure resets the flag and raises UpdateFailed() to resume normal poll interval - LOGGER.info added for both the retry trigger and the give-up case - Remove now-unused aiohttp and HomeAssistantError imports from manager.py Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

mfncl9991 mentioned this pull request Apr 18, 2026

fix: update auth and device control for Winix API v1.5.7 #158

Merged

5 tasks

iprak requested changes Apr 18, 2026

View reviewed changes

Comment thread custom_components/winix/manager.py Outdated

iprak merged commit 705e7cd into iprak:main Apr 19, 2026
1 check passed

iprak mentioned this pull request Apr 19, 2026

fix: use seconds value for retry_after #163

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: retry sooner on transient DNS failures during device update#161

fix: retry sooner on transient DNS failures during device update#161
iprak merged 2 commits intoiprak:mainfrom
mfncl9991:fix/dns-retry-on-update

mfncl9991 commented Apr 18, 2026

Uh oh!

iprak left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mfncl9991 commented Apr 18, 2026

Summary

Change

References

Uh oh!

iprak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants