Backup Job Failure Rate Monitor

Summary

The monitor, disabled by default, checks for an excessive failure rate for backup jobs. If the number of failures across the specified run count exceeds the threshold an alert is set.

What can I do?

To troubleshoot the issue you must go to the Azure Portal and review the job failures causes.

Configuration

Option	Definition	Default
Interval seconds	Amount of time in seconds between checks for connectivity!3611 seconds
Sync Time	Optional time within the hour to sync the check execution to
Timeout Seconds	Amount of time in seconds to wait for script execution and hance for API Search response	30 seconds
TraceLevel	Debugging level (0-5) anything above 2 will log clear text password in eventlog	2
LastNJobs	Number of runs to check for failures	5
MaxFailures	Number of maximum sustainable failures across the runs checked. 0 means a single failure raises an alert	1
FailureCondition	RegEx to be cheked against the job result status for failures	Failed
LookbackDays	Number of days to search runs for. It basically instruct the code to look back this number of days to look for runs	15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backup Job Failure Rate Monitor

Backup Job Failure Rate Monitor

Summary

What can I do?

Configuration

Clone this wiki locally