Ruptor

Reliability testing for AI agents — Chaos Engineering meets LLM systems.

Ruptor helps you find out how your AI agent behaves when things go wrong: tool timeouts, invalid JSON, rate limits, empty responses. Before your users do.

Modules

Module	What it does
`ruptor run`	Injects failures into tool calls and observes agent behavior
`ruptor simulate`	Simulates real users to evaluate goal completion and conversation quality

Installation

Homebrew (recommended, macOS/Linux)

brew install ruptor-dev/tap/ruptor

curl

curl -fsSL https://ruptor.dev/install.sh | sh

go install

go install github.com/ruptor-dev/cli/cmd/ruptor@latest

Requires Go 1.22+

macOS — First Run

On macOS, Gatekeeper may block the binary on first run with "Apple could not verify ruptor is free of malware."

This is expected for unsigned OSS binaries. Remove the quarantine attribute and run normally:

xattr -d com.apple.quarantine $(which ruptor)
ruptor --version

This is a one-time step. It does not affect subsequent runs.

Quickstart — Chaos Testing

1. Point your agent's tools at Ruptor:

export TOOL_BASE_URL=http://localhost:8080

2. Create a chaos.yaml:

agent:
  name: my_agent
  entrypoint: python agent.py
  env:
    TOOL_BASE_URL: http://localhost:8080

proxy:
  port: 8080
  passthrough_url: https://my-real-tool-api.com

tests:
  - id: timeout_on_search
    tool: /search
    fault: tool_timeout
    delay_ms: 30000
    probability: 1.0

evaluation:
  max_iterations: 20
  timeout_s: 60
  llm_judge: true

output:
  format: both
  path: ./reports/

3. Run:

ruptor run chaos.yaml
ruptor run chaos.yaml --output report.html
ruptor run chaos.yaml --test timeout_on_search
ruptor run chaos.yaml -v                    # tail runner/proxy logs in a TUI panel

All runner and proxy events land in <~/.ruptor/runs/TIMESTAMP>/ruptor.log whether or not -v is set. The flag only decides whether the stream is also surfaced live: as a TUI panel in an interactive terminal, or on stderr when piped / CI.

Quickstart — Simulate

1. Create a simulate.yaml:

agent:
  name: support_agent
  base_url: http://localhost:3000

simulations:
  - id: frustrated_user
    persona: "Frustrated user who wants to resolve their issue in under 3 messages"
    goal: "Cancel subscription"
    max_turns: 10
    success_criteria: "Agent completed the cancellation"

evaluation:
  goal_completion: true
  tone_quality: true

output:
  format: both
  path: ./reports/

2. Run:

ruptor simulate simulate.yaml
ruptor simulate simulate.yaml --sim frustrated_user

Available Faults

Fault	Description
`tool_timeout`	No response — does the agent have its own timeout?
`slow_response`	Responds after N ms — does the agent wait or cut?
`tool_error`	Returns 5xx — does the agent retry or fail gracefully?
`invalid_json`	Returns malformed JSON — does the agent handle ParseError?
`empty_response`	Returns 200 OK with empty body — common and rarely tested
`rate_limit`	Returns 429 — does the agent implement backoff or spam?
`llm_error`	LLM backend returns 503 — does the agent have a fallback?
`llm_timeout`	LLM call hangs indefinitely — does the agent have a deadline?

Environment Variables

Variable	Required	Description
`TOOL_BASE_URL`	Yes	Points your agent's tool calls at the ruptor proxy
`OPENAI_API_KEY`	Only for LLM judge	API key for the LLM judge evaluator
`RUPTOR_TOKEN`	Only for --cloud	Auth token for cloud reporting (coming soon)

Development

make build                  # compile
make test                   # run tests with race detector
make lint                   # run golangci-lint
make check                  # build + test + vet
make run-example            # run chaos example
make run-simulate-example   # run simulate example
make tools                  # install dev tools
make help                   # list all targets

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.claude		.claude
.github		.github
bin		bin
cmd/ruptor		cmd/ruptor
configs		configs
docs		docs
examples		examples
internal		internal
pkg/types		pkg/types
scripts		scripts
testdata		testdata
web		web
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yaml		.goreleaser.yaml
AUDIT.md		AUDIT.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
PRE-RELEASE.md		PRE-RELEASE.md
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ruptor

Modules

Installation

Homebrew (recommended, macOS/Linux)

curl

go install

macOS — First Run

Quickstart — Chaos Testing

Quickstart — Simulate

Available Faults

Environment Variables

Development

Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ruptor

Modules

Installation

Homebrew (recommended, macOS/Linux)

curl

go install

macOS — First Run

Quickstart — Chaos Testing

Quickstart — Simulate

Available Faults

Environment Variables

Development

Roadmap

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages