Skip to content

gregberns/MonitoringAndAlertingEssentials

Repository files navigation

Monitoring And Alerting Essentials

Purpose

The following is a guide to understand improving system reliability through logging, monitoring, and alerting, and a process of continuous improvement.

If we want to consider ourselves 'engineers' our systems need to work reliably. When they do not work, we need to know when they are failing and why they are failing.

Content

There are several parts to this documentation.

Scope

This documentation is primarily limited to application logging. OS, web service, and other types of logging will not be covered.

References

The majority of the ideas in this repo have been taken from the places where I learned them:

About

Writings and Talk on Logging, Metrics, Monitoring, and Alerting

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages