Gerd by Onyx is a light-weight chaos monkey implementation for k8s (kubernetes)
-
Updated
Aug 12, 2020 - C#
Gerd by Onyx is a light-weight chaos monkey implementation for k8s (kubernetes)
A .Net Standard library for working with the Uptime Robot API.
Use Grafana k6, Dynatrace business events, workflows and site reliability guardian to validate software releases
Overall map of topics to cover for my “Engineering for Site Reliability” blog series.
🔖 Daily-updated reading list for designing High Scalability 🍒, High Availability 🔥, High Stability 🗻 back-end systems - Pull requests are greatly welcome 👬 I hope you will find this project helpful 🍀 Please help me share it to more and more people ❤️ Thank you - 谢谢 - धन्यवाद - ধন্যবাদ - Спасибо - شكرا - Merci - Gracias - Danke - Cảm ơn! 🙇
An ongoing & curated collection of awesome SRE software and tools, libraries and frameworks, engineering books and blogs, philosophical principles, technical guidelines, practical tools about the field of Site Reliablity Engineering (SRE)
A list of common Disaster Recovery (DR) scenarios for software companies
A collection templates ported from the SRE Workbook
Calculate how much downtime should be permitted in your Service Level Agreement or Objective
A curated list of awesome Site Reliability and Production Engineering resources.
A party card game for engineers caring about reliability. Based on Cards Against Humanity.
A role-playing game for incident management training
A collection of postmortem templates
A curated list of Site Reliability and Production Engineering resources.
Add a description, image, and links to the site-reliability topic page so that developers can more easily learn about it.
To associate your repository with the site-reliability topic, visit your repo's landing page and select "manage topics."