-
Notifications
You must be signed in to change notification settings - Fork 439
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nagios freezing when using Query Handler #635
Comments
Hello With this little python script, I can reproduce that nagios is not releasing nagios.qh before executing the script : After executing the script : And this number never go down so after a while, it reach nagios limit opened files. Can you help me ? |
it seems that the issue is arround if force is True, the issue goes away. |
Up ? |
up ? |
Hi @ToToL, thanks for reporting this issue. I'll need to do some more testing before using your suggested changes (most likely, there's just one flag that's not getting set properly, rather than that whole variable being incorrectly assigned). |
I've had a little time to look into this. For now, I've decided to make a slightly different change: in workers.c, I changed 752 from My sense of this, however, is that the force variable can probably just be removed from wproc_destroy() entirely. @hedenface what do you think about this? |
Hello WPROC_FORCE is not 0 by default ? Thanks for digging my issue :) Regards |
WPROC_FORCE itself is set to 1 in include/workers.h - the Either way, thank you for reporting the issue! It looks like we should have a fix for this in 4.4.4 |
Resolve #635: When there is no more data to read from a socket, release it
Patch has been merged into the maint branch and will be released with 4.4.4 |
…ket, release it
…ket, release it
…ket, release it
…ket, release it
Hello
I'm using Nagios 4.4.3 under CentOS 7.
I activated query handler in nagios with default configuration.
On the other side, I have a perl program that open a socker, register to query handler and stop.
Register command : wproc register name=col-;pid=;max_jobs=1;plugin=Collector
the is replaced with the PID of my perl process
After some hour, nagios is freezing. When dooing strace on it, I get infinit lines with this message :
write(7170, "job_id=306\0type=0\0command=/usr/l"..., 104) = -1 EAGAIN (Resource temporarily unavailable)
lsof tell me that nagios have over 7000 open files with lot of nagios.qh file. I tried to allow Nagios to open more files but it just make the issue need more time to appear.
Is this a known issue ? Can you help me on investigate this ?
Thanks
The text was updated successfully, but these errors were encountered: