New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issues with snoopy and keepalived #100
Comments
Too many variables here:
I am using keepalived with snoopy for couple of years now, no problems. |
Snoopy version is: 2.4.7 Thanks Chris. |
I guess logging is hanging on a syscall to write to file, or something similar. Can you provide strace and lsof outputs for relevant processes (either bash or killall, maybe something else)? You will probably notice that one of these processes is stuck at some syscall or sth. Does using devlog output provider work? b. |
Let me write a check that keeps calling killall, as when I have been testing by hand it never fails. The devlog is available but not using it at the moment. Thanks Chris |
Also test with similarly-configured environment, and under same user. b. |
We have about 20 dev environments that are all the same, what it looks like is the split second that snoopy is adding is taking the whole process over 1 second. This isn't a fault of snoopy but that killall is expensive and clearly adding another layer however small is causing the process to slow down just enough to trip keepalived. Thanks for all your help. Chris. |
Tnx for reporting back the cause of your problems. b. |
All
We have deployed snoopy to our development environment however we have the following configuration in keepalived:
vrrp_script chk_haproxy { # Requires keepalived-1.1.13
script /root/scripts/chk_haproxy.sh
interval 2 # check every 2 seconds
}
Since installing snoopy this seems to fail and cause keepalived to flip.
Nov 16 08:27:22 HOSTNAME Keepalived_vrrp[5139]: VRRP_Script(chk_haproxy) timed out
Nov 16 08:27:22 HOSTNAME Keepalived_vrrp[5139]: Process [6810] didn't respond to SIGTERM
The script is doing:
killall -0 haproxy
Any suggestions are welcome.
Thanks
Chris
The text was updated successfully, but these errors were encountered: