SRE
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Here are 81 public repositories matching this topic...
The Open Source DevOps Assistant - solve problems twice as fast with an AI teammate
-
Updated
May 31, 2024 - Python
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
-
Updated
May 27, 2024 - Python
A curated list of awesome DevOps platforms, tools, practices and resources
-
Updated
May 27, 2024 - Python
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html
-
Updated
May 25, 2024 - Python
log data pre processing in python
-
Updated
May 28, 2024 - Python
Analyses your database queries and schema and suggests indices and schema improvements
-
Updated
May 29, 2024 - Python
Learning Shell,Python,Golang,System,Network
-
Updated
May 20, 2024 - Python
A very small digitalized primate responsible for randomly preventing something from continuing as usual or as expected.
-
Updated
May 14, 2024 - Python
专注于 SRE 运维、云原生、稳定性、高可用性、可观测性、DevOps 等技术
-
Updated
May 10, 2024 - Python
A CLI tool designed for CI/CD processes, enabling automatic service versioning and changelog generation.
-
Updated
May 7, 2024 - Python
Chaos Engineering Toolkit & Orchestration for Developers
-
Updated
May 1, 2024 - Python
This is repo that contains Platform Services Team SRE documentation and tools
-
Updated
Apr 29, 2024 - Python
Sample applications of supported integrations by Last9 Products
-
Updated
May 21, 2024 - Python
A tool that interfaces with multiple services to collect insightful key metrics.
-
Updated
Apr 29, 2024 - Python
Reliably CLI - Accelerate your Resilience Engineering Adoption
-
Updated
Mar 18, 2024 - Python
Used to automatically merge terraform dependencies pull requests in github that have no differences.
-
Updated
May 27, 2024 - Python
linux echo but with webhooks! ⚓
-
Updated
Mar 2, 2024 - Python
- Followers
- 114 followers
- Wikipedia
- Wikipedia