Backup Job Failure Rate Monitor
Daniele Grandini edited this page Nov 11, 2015
·
1 revision
The monitor, disabled by default, checks for an excessive failure rate for backup jobs. If the number of failures across the specified run count exceeds the threshold an alert is set.
To troubleshoot the issue you must go to the Azure Portal and review the job failures causes.
Option | Definition | Default |
---|---|---|
Interval seconds | Amount of time in seconds between checks for connectivity!3611 seconds | |
Sync Time | Optional time within the hour to sync the check execution to | |
Timeout Seconds | Amount of time in seconds to wait for script execution and hance for API Search response | 30 seconds |
TraceLevel | Debugging level (0-5) anything above 2 will log clear text password in eventlog | 2 |
LastNJobs | Number of runs to check for failures | 5 |
MaxFailures | Number of maximum sustainable failures across the runs checked. 0 means a single failure raises an alert | 1 |
FailureCondition | RegEx to be cheked against the job result status for failures | Failed |
LookbackDays | Number of days to search runs for. It basically instruct the code to look back this number of days to look for runs | 15 |