About

The SRE Gemini CLI Extension is a dedicated toolkit comprising specialized Skills and Agents designed to augment Site Reliability Engineers (SREs). By integrating deeply with the Gemini CLI, this extension empowers SREs to investigate outages, configure MCP servers, formulate mitigations, and detect anomalies more rapidly.

Note

🧪 Experimental Status: The included Agents (such as outage-gke-investigator) are currently experimental. They are designed to accelerate your workflow, but their outputs should always be independently verified during live incidents.

INSTALL

You need Gemini CLI installed.
Then type:

gemini extensions install https://github.com/gemini-cli-extensions/sre

Available Skills and Agents

🤖 Agents

outage-gke-investigator: An agent tuned specifically for GKE outage and incident investigations (experimental).

🛠️ Core SRE Skills

investigation-entrypoint: Primary entrypoint for investigating production outages, orchestrating SRE response, and mitigating incidents. Start here when an incident occurs!
gcp-playbooks: Follows established SRE playbooks for GCP/GKE investigations, including infrastructure discovery and common mitigation steps.
gcp-mcp-setup: Automates enabling services, Google Managed MCP (OneMCP) servers, generating API keys, and configuring ~/.gemini/settings.json.
gcp-slo-management: Discover Monitoring Services, list existing SLOs, or create new SLOs (Availability/Latency) via the REST API.
postmortem-generator: Creates a generated PostMortem given enough context about a resolved incident/outage.

☁️ Cloud Capabilities

cloud-build-investigation: Expert-level SRE skill for Google Cloud Build (GCB) and Cloud Run investigations. Correlates git commits with build failures and analyzes logs.
cloud-logging: Skill for interacting with and analyzing Google Cloud Logging and Error Reporting. Processes large JSON logs or converts them to Apache format.
cloud-monitoring: Interacts with Google Cloud Monitoring via APIs to avoid large context bloat. Exports time-series data and helps setup SLOs.

📊 Detection, Graphs & Mitigations

generic-mitigations: Generic Mitigations high-level classification logic and actuation plan.
monitoring-graphs: Generates high-quality, annotated incident graphs for post-mortems using Python to visualize outages and error rates (nice graphs visible here).
anomaly-detection: Detects anomalies in time-series data from various sources (Isolation Forest, KNN, Z-score).
data-ingestion: Fetches and parses time-series data from various sources for downstream analysis.

Warning

The safe-sre-investigator skill is deprecated and has been merged into native playbooks.

Quickstart

Install this extension via the Gemini CLI's install command: gemini extensions install https://github.com/gemini-cli-extensions/sre

Only for the first time, use gcp-mcp-setup skill to setup your GCP project, and MCP servers:

$ gemini 
Use the gcp-mcp-setup skill to setup my GCP project "foo-bar-123" with email jane-doe-sre@credible-company.com

Invoke the entrypoint skill with your incident request. For example:

$ gemini
Invoke the investigation entrypoint skill with this new incident: cluster gKE with ip 1.2.3.4 is reported down by numerous customers, please investigate.

The agent will take it from there—fetching context, querying metrics, and formulating mitigations.

For detailed instructions on setup and usage, please refer to the User Manual.

Contributing

Check CONTRIBUTING.md.

Feedback

For feedback, please report bugs and feature requests in the issue tracker. Any other intelligible feedback should be sent to this form: SRE Extension Survey

Thanks

Program Lead: Riccardo

Co-authors and contributors:

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gemini		.gemini
.github		.github
docs		docs
policies		policies
skills		skills
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
USER_MANUAL.md		USER_MANUAL.md
gemini-extension.json		gemini-extension.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

INSTALL

Available Skills and Agents

🤖 Agents

🛠️ Core SRE Skills

☁️ Cloud Capabilities

📊 Detection, Graphs & Mitigations

Quickstart

Contributing

Feedback

Thanks

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

INSTALL

Available Skills and Agents

🤖 Agents

🛠️ Core SRE Skills

☁️ Cloud Capabilities

📊 Detection, Graphs & Mitigations

Quickstart

Contributing

Feedback

Thanks

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages