Skip to content

Launch: On-call rotation + incident runbook #16

@psdjungpulzze

Description

@psdjungpulzze

Before going live:

  • Set up on-call schedule (PagerDuty / Opsgenie / plain phone rotation)
  • Define SEV1/SEV2/SEV3 severity levels and response SLAs
  • Write incident runbook: how to roll back a bad deploy, restart worker, flush Redis queue
  • Add runbook link to status page and internal wiki
  • Test an alert end-to-end (fire a fake alert, confirm the right person is paged)

Owner: Eng lead

Metadata

Metadata

Assignees

Labels

launch-checklistProduction launch checklist items

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions