New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[dev.icinga.com #10208] Eventhandler trigger on all endpoints in high available zone #3431
Comments
Updated by mfriedrich on 2015-09-24 19:59:26 +00:00
Please add the debug logs from both ends as well as the configuration objects (services). |
Updated by dgoetz on 2015-09-25 07:30:39 +00:00
Configuration object:
Added only the lines matching the hostname su01003910 because debuglog exceeds upload limit although it was only running less than 5 minutes! |
Updated by mfriedrich on 2015-10-15 12:47:37 +00:00
|
Updated by mfriedrich on 2016-03-18 17:09:10 +00:00
Event handlers are triggered locally when the check result is processed. If multiple instances are involved each of them will of course trigger the event handler then. Imho this is not a bug but a feature request to deal with event handlers inside a cluster zone somehow. Though I have no idea for a possible implementation. Leaving this open for suggestions. |
I'm not sure why this is marked low/enhancement, using an
I thought checks were scheduled between cluster members (forgive me if I'm wrong), so the same check result would only be processed by one cluster member for any given check execution? If so, this wouldn't be a problem. (I have logged check state, state id, check_attempt to verify duplication.) Ref: https://lists.icinga.org/pipermail/icinga-users/2017-February/011844.html Edit: The OP describes the result being replicated. So this is "of course" the expected behavior. The issue is that I do not believe it should be the expected behavior (executing the same command multiple times in response to the same service check result). Other than requiring |
Hi all, |
@MarcusCaepio: i talked to @dgoetz, as far as he knows this isn't fixed. |
I've got the same question from @MarcusCaepio and I believe no-one looked into this yet. That's what I've told him at OSMC. |
Since questions came up why no-one looked into this for a long period of time: Event handlers were ported from 1.x where no cluster or HA feature was yet implemented. Event handlers in the current feature-set would need a re-design, especially when it comes to command_endpoint enabled clients (see #5658 for details) or how parameters can be passed. That is the reason why I've tagged this as a feature a while ago. In terms of the problem itself, each instance determines whether to execute event handlers upon received check results in an HA-enabled zone. The object authority for checkable objects can be determined by the TestsConfiguration
FixGit Mastericinga2a
icinga2b
Appliedicinga2a
icinga2b
Note: The critical logging will be turned into notice inside the patch. Single instance test
There won't be any 2.8.x minor release anymore, and since this change also requires further tests with the snapshot packages, I'll schedule this for 2.9 as an exception (actually the issue set is frozen). |
This issue has been migrated from Redmine: https://dev.icinga.com/issues/10208
Created by dgoetz on 2015-09-24 07:54:05 +00:00
Assignee: (none)
Status: New
Target Version: (none)
Last Update: 2016-03-18 17:09:10 +00:00 (in Redmine)
Eventhandlers are triggerd on all endpoints in a high available setup.
We see an Eventhandler command triggered on both systems in the high available zone. First it is triggerd on the check source than after replication on the other endpoint.
Worst case scenario the eventhandler restarts a problematic service twice with some seconds delay which could cause more problems than fix!
Plattform is RHEL 7 64 bit, Icinga 2 Release 2.3.10 from packages.icinga.org
Attachments
The text was updated successfully, but these errors were encountered: