Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.Sign up
This issue has been migrated from Redmine: https://dev.icinga.com/issues/10002
Created by aledermueller on 2015-08-26 13:34:46 +00:00
Agents (zones): approx. 400 (mixed versions with 2.3.8 and 2.3.9)
After a while Icinga2 on one master hangs without using resources like CPU and IO. netstat shows full Recv-Qs (data from the agents) and empty Send-Qs. While 2/3 of the connections is on close_wait, the other 1/3 is established.
A stacktrace is attached: gdb -p xxx -ex 'thread apply all bt full' -ex deta -ex q -batch > debug
In the debug log are mainly the following entries. The counter for pending tasks is growing....
2015-09-02 05:46:30 +00:00 by (unknown) 5c77e6e
2015-09-02 07:16:20 +00:00 by (unknown) 35acba7
2015-10-15 13:16:51 +00:00 by (unknown) e480af3
2015-10-15 13:18:02 +00:00 by (unknown) c8d24b6
Updated by aledermueller on 2015-08-27 07:10:51 +00:00
The same thing happened again. Now the second master shows the same behavior/logs. A stacktrace of both is attached, master1 is the host writing to the ido-master.
Updated by mfriedrich on 2015-09-14 08:22:08 +00:00
According to Achim and Blerim, the fixes made it work again (2.3.10 without fixes causes trouble, the snapshot packages run fine for nearly a week now). I'd say we'll test this a little more and may back port that into 2.3.11 next week.
Updated by mfriedrich on 2015-10-15 13:19:22 +00:00