Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upToo many open File Descriptors (SYN_SENT) #1726
Comments
This comment has been minimized.
This comment has been minimized.
|
Well, I increased the limits to a drastic amount and prometheus using a LOT of files.
My queries are now timing out :( |
This comment has been minimized.
This comment has been minimized.
|
What does |
This comment has been minimized.
This comment has been minimized.
|
I found the root cause, my SD mechanism had an error which caused terminated instances to lie around. I have patched the code and right now am checking on a small load and everything is running as expected. Just started the test under production load, will keep you posted. But yep, any way we can make sure that unacknowledged connections are not piling up? |
This comment has been minimized.
This comment has been minimized.
|
So your file SD actually contained hundreds of thousands of targets in the end? If so, then it's not really a piling-up issue in Prometheus, but Prometheus just needs to connect to too many targets at once? |
This comment has been minimized.
This comment has been minimized.
|
Yep, you were right. So we kill and get new instances every hour. And the killed instances are not cleaned up for 5-10mins where the open connections pile up. But the weird thing is that with 7K-10K services, we are still hitting 90K open connections. While this might be expected, is there any flag/config or something I could do to reduce the number. And good thing is they fall from 90K to 4K very fast once the cleanup is done. |
gouthamve
closed this
Jun 16, 2016
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 24, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
gouthamve commentedJun 11, 2016
What did you do?
Ran Prometheus on production load.
What did you expect to see?
Prometheus runs fine
What did you see instead? Under which circumstances?
Prometheus starts erroring with too many open files
Environment
System information:
Linux 3.13.0-74-generic x86_64
Prometheus version:
build user: root@dfc6307dc40d
build date: 20160526-01:42:25
go version: go1.6.2
Prometheus configuration file:
I increased the open file limits, but prometheus is eating up everything. The output of lsof (pid is prometheus pid):
sudo lsof | grep <pid> | wc -l ==> 61165sudo lsof | grep <pid> | grep SYN_SENT | wc -l ==> 60149