-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
http: Accept error: accept tcp [::]:9115: accept4: too many open files #288
Comments
Do you know which probe is causing this? |
Unfortunately not, but wild guess, I'd say there is a higher chance it is caused by the ssh prober since that is the most exotic one, if it was caused by one of the others there is a high chance someone else already experienced the issue. |
The |
I am not sure I get your point. |
If you only send one type of probe to a blackbox exporter at a time, you could help narrow it down. |
So after I hit the max fd limit another two times I took the time to investigate. The fd leak seems to be caused by this job, because if I leave it on as the only job that uses the blackbox_exporter it happens, if I turn it off and keep all other jobs running it does not happen:
The module config looks like this:
Other stuff that may or may not be of relevance:
|
That narrows things down a good bit. Does it seem to go with failed probes, successful probes, or both? |
Only happens with successful probes, if they fail the open_fds stay constant. |
I can't reproduce this. What are the fds which are being leaked? |
So, I couldn't reproduce this locally either but perfectly well on the actual setup. Found out if this only happens when using that ipv4-to-ipv6 translating haproxy. What resolved the issue was upgrading the haproxy from 1.8.0 to 1.8.4 - I guess it had a bug that caused it to not close connections under some circumstances which got triggered by using the Thanks a lot for your help! |
we just reproduced it for the postgres exporter , and saw ulimits was 1024 while the exporter opened about one thousand sockets hope that helps |
Host operating system: output of
uname -a
Linux 3.10.0-693.el7.x86_64 #1 SMP Tue Aug 22 21:09:27 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
blackbox_exporter version: output of
blackbox_exporter -version
What is the blackbox.yml module config.
What is the prometheus.yml scrape config.
What did you do that produced an error?
Run the blackbox_exporter for about 50 days
What did you expect to see?
No errors
What did you see instead?
http: Accept error: accept tcp [::]:9115: accept4: too many open files; retrying in 1s
Restarting the blackbox_exporter fixes the issue, however that is not a real solution. Maybe noteworthy is that my Prometheus scrapes the blackbox_exporter via an IPv6 address.
The text was updated successfully, but these errors were encountered: