New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory leaking (when recovering from endpoint failures?) #505
Comments
Please try the 1.32.3. |
This might be worse? In 10' the process was taking up over 100M of resident memory and was well on it's way to getting killed off by the oom-killer. Worth mentioning: when these broken pipe messages are coming through the
|
What is the output of |
Same as in #506:
|
I'm also seeing this same issue, running on NextDNS v1.32.3, also EdgeRouter X v2.0.9-hotfix.1. Diag output: https://nextdns.io/diag/7b4e8710-c46f-11eb-82a5-a3ebfef2ae33 dmesg:
nextdns config:
|
Facing the same on an EdgeRouter Lite for the past 48hrs:
Running latest version:
Config:
Some parts of the log (
|
Facing a very similar issue except that I am not seeing OOM-kills but rather the nextdns process starts using all 4 CPU cores on my poor EdgerouterX and my entire network becomes unresponsive until I reboot the router
|
Can you please run |
|
In order to debug this, we rolled back the cert change. Can you plus restart the process and report if it fixes the issue? It would help us a lot. Thanks. |
Thanks Olivier! @rs I had upgraded to 1.32.3 after sending my last comment. I have now restarted the process using |
Can you also please show the output of this command: |
From my EdgeRouter X:
|
Took forever to run, but diag completed: https://nextdns.io/diag/4e513a80-c4d0-11eb-ba3e-9583b47c3fcd |
I have not seen the issue again since the cert rollback & upgrade to 1.32.3 I think this issue can be closed |
The issue is still there and a fix will be pushed soon. |
Is this issue already fixed? I still got this problem on cli 1.37.2. The memory usage grows about 35 MB just in a minute because of endpoint failure (?). The log is full of cache fallback HTTP/2.0: doh resolve: context deadline exceeded. |
It's normal that memory raises, but it should be bounded by the max number of allowed // queries. |
Still not fixed. Having the same issue on Xiaomi Mi Wifi Mini on latest OpenWRT. NextDNS on OpenWRT takes 543% of memory and 50% of CPU Tue Dec 14 00:10:36 2021 daemon.warn nextdns[28407]: Endpoint failed: https://dns.nextdns.io#116.203.147.209,2a01:4f8:c0c:fa3f::1: roundtrip: context deadline exceeded Tue Dec 14 00:10:41 2021 daemon.warn nextdns[28407]: Endpoint failed: https://dns.nextdns.io#217.146.22.163,2a00:11c0:e:ffff:1::d: roundtrip: context deadline exceeded Tue Dec 14 00:10:46 2021 daemon.warn nextdns[28407]: Endpoint failed: https://dns1.nextdns.io#45.90.28.0,2a07:a8c0::: roundtrip: context deadline exceeded Tue Dec 14 00:10:52 2021 daemon.warn nextdns[28407]: Endpoint failed: https://dns2.nextdns.io#45.90.30.0,2a07:a8c1::: roundtrip: context deadline exceeded |
Context
For days I've had a lot of problems with the NextDNS client failing. Lots of
cached HTTP/2.0: doh resolve: context deadline exceeded
in the logs. Starting yesterday, the process is getting killed by the OOM killer. This Issue is going to focus on that.I'll open another to focus on why the failures are happening the first place.It appears that when the following kind of dance occurs, the NextDNS resident memory size grows:
After enough of those, the process eventually is killed:
How fast this occurs seems related to how much DNS traffic is happening on my network. Last night I restarted things before bed and it took about 4 hours. This morning after restarting, it took about 90 minutes.
The text was updated successfully, but these errors were encountered: