Skip to content

Backup Job Failure Rate Monitor

Daniele Grandini edited this page Nov 11, 2015 · 1 revision

Backup Job Failure Rate Monitor

Summary

The monitor, disabled by default, checks for an excessive failure rate for backup jobs. If the number of failures across the specified run count exceeds the threshold an alert is set.

What can I do?

To troubleshoot the issue you must go to the Azure Portal and review the job failures causes.

Configuration

Option Definition Default
Interval seconds Amount of time in seconds between checks for connectivity!3611 seconds
Sync Time Optional time within the hour to sync the check execution to
Timeout Seconds Amount of time in seconds to wait for script execution and hance for API Search response 30 seconds
TraceLevel Debugging level (0-5) anything above 2 will log clear text password in eventlog 2
LastNJobs Number of runs to check for failures 5
MaxFailures Number of maximum sustainable failures across the runs checked. 0 means a single failure raises an alert 1
FailureCondition RegEx to be cheked against the job result status for failures Failed
LookbackDays Number of days to search runs for. It basically instruct the code to look back this number of days to look for runs 15