Skip to content

Commit

Permalink
Merge pull request #179 from sweir27/on-call-slo
Browse files Browse the repository at this point in the history
RFC: Clarify when we promise to be on-call
  • Loading branch information
dleve123 committed Apr 15, 2019
2 parents ab49d26 + 3bc1b01 commit 0dcf4d0
Show file tree
Hide file tree
Showing 4 changed files with 34 additions and 2 deletions.
3 changes: 2 additions & 1 deletion playbooks/support/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,8 @@ participating in ongoing sprint and team activities.

While we would _like_ to have 24/7 on-call coverage, we recognize that we don't yet have a good solution for
notifying engineers during non-work hours. Do your best to keep an eye out for issues, and lean on those in more
favorable timezones where appropriate.
favorable timezones where appropriate. See our [On-Call SLO document](on-call-slo.md) for a detailed description of
the coverage we currently promise.

## Responding to an incident

Expand Down
Binary file added playbooks/support/images/on-call-slo.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
31 changes: 31 additions & 0 deletions playbooks/support/on-call-slo.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
# On-Call SLO

This document describes a
["Service-Level Objective"](https://landing.google.com/sre/sre-book/chapters/service-level-objectives/) for our
on-call response time to incidents.

## What we promise

- Incidents should have a response time of under 10 minutes during hours of **full support**.
- Incidents should have a response time of under 30 minutes during hours of **partial support**.

![visualization of our on-call SLO](images/on-call-slo.jpg)

- Work days (excludes
[holidays](https://docs.google.com/document/d/16FQ0nKIJ_Eb11kWQd70neB7V3i51ixUsf-CB5f_rAMo/edit#heading=h.d93f947ory))
- 9am — 5pm ET full support
- 7am — 7pm ET partial support
- 7pm — 7am ET no official support
- Non-work days
- 9am — 5pm ET partial support

### Measuring performance against this SLO

This SLO targets the time between when an incident is first reported in #incidents and a Jira Ops ticket is
created.

### Live Auctions

There is currently no explicit support for live auctions outside of the on-call schedule defined above. For special
cases where an auction must be monitored during hours not covered above, the relevant product teams should be
directly consulted.
2 changes: 1 addition & 1 deletion playbooks/support/responding.md
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@ respond to these issues.
this possible issue
- Link the Jira Ops ticket to any relevant bugs that are tracked (and create an additional one if necessary)
- Mark the ticket as **CANCELED**


## Step 4: Address Incident

Expand Down

0 comments on commit 0dcf4d0

Please sign in to comment.