-
Notifications
You must be signed in to change notification settings - Fork 136
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
dcache-resilience: avoid spamming alarms with abort messages
Motivation: When a file replication operation is fatally aborted, an alarm notification is raised. This notification currently is keyed to the file's pnfsid. While such failures are usually infrequent, under certain conditions (e.g., network congestion failures, no more available pools for further copies), whole successions of such failures can occur, in which cases hundreds or even thousands of alarms are sent. Though this is not a problem for the server to handle, it is unfriendly to the user, both when scanning alarms using the various web inferfaces, or potentially when receiving email alerts (since the key for the alarm is the pnfsid, each alarm will generate an email alert as well). Modification: Key the alarm to the file's storage unit instead of the pnfsid. Add a suffix to the key based on a hourly timestamp. The alarm will thus be incremented during the hour but a new alarm will be created every hour; in this way, those receiving email alerts will receive them once an hour. Log this alarm to the domain log at WARN level. Create a new logger and appender for resilience, and log the full file information to that at the ERROR level. Result: Friendlier behavior wrt replication failure (fatal abort) alarms. Target: master Require-notes: yes Require-book: no Request: 4.0 Request: 3.2 Request: 3.1 Request: 3.0 Request: 2.16 Acked-by: Paul
- Loading branch information
Showing
4 changed files
with
62 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters