AgentBreak

AgentBreak lets you test how your app behaves when an OpenAI-compatible provider is slow, flaky, or down.

It sits between your app and the provider, then randomly:

returns errors like 429 or 500
adds latency
lets some requests pass through normally

Use it to check whether your app retries, falls back, or breaks.

PyPI package: agentbreak

Quick Start

Requirements:

Python 3.10+

Install from PyPI:

pip install agentbreak

Start AgentBreak in mock mode:

agentbreak start --mode mock --scenario mixed-transient --fail-rate 0.2

Point your app to it:

export OPENAI_BASE_URL=http://localhost:5000/v1
export OPENAI_API_KEY=dummy

That is enough to start testing retry and fallback behavior locally.

How It Works

Normal path:

your app -> AgentBreak -> provider

Mock path:

your app -> AgentBreak -> fake response / injected failure

Real Provider Mode

If you want real upstream calls plus randomly injected failures:

agentbreak start \
  --mode proxy \
  --upstream-url https://api.openai.com \
  --scenario mixed-transient \
  --fail-rate 0.2

Then keep your app pointed at:

export OPENAI_BASE_URL=http://localhost:5000/v1

Why This Matters

Even major hosted providers have non-zero downtime.

As of March 15, 2026, the official status pages report roughly:

OpenAI APIs: 99.76% uptime over the last 90 days
Anthropic API: 99.4% uptime over the last 90 days

Inference:

99.76% uptime annualizes to about 21 hours of downtime per year
99.4% uptime annualizes to about 53 hours of downtime per year

Self-hosted systems often have more moving parts to break: your own gateway, networking, autoscaling, auth, rate limiting, queues, and model serving stack. In practice, that can fail more often than a top-tier managed API.

That is why resilience testing matters. You want to know whether your agent retries correctly, falls back correctly, avoids loops, and degrades gracefully before a real outage or brownout happens.

What It Supports

POST /v1/chat/completions
failure injection: 400, 401, 403, 404, 413, 429, 500, 503
latency injection
duplicate request tracking
a simple scorecard
mock mode and proxy mode

Useful Endpoints

curl http://localhost:5000/_agentbreak/scorecard
curl http://localhost:5000/_agentbreak/requests

Stop the server with Ctrl+C to print the final scorecard in the terminal.

Config File

AgentBreak will load config.yaml from the current directory if it exists.

Fastest setup:

cp config.example.yaml config.yaml
agentbreak start

CLI flags override config values.

Examples

Run the sample LangChain app:

cd examples/simple_langchain
pip install -r requirements.txt
OPENAI_API_KEY=dummy OPENAI_BASE_URL=http://localhost:5000/v1 python main.py

More examples: examples/README.md

Development

Install locally in editable mode:

pip install -e .

Run tests:

pip install -e '.[dev]'
pytest -q

Claude Code

Install the slash command into your project:

mkdir -p .claude/commands
curl -sSL https://raw.githubusercontent.com/mnvsk97/agentbreak/main/.claude-plugin/commands/agentbreak.md \
  -o .claude/commands/agentbreak.md

Then in Claude Code:

/agentbreak run my app in mock mode and check the scorecard
/agentbreak start proxy mode against https://api.openai.com with rate-limited scenario

The command teaches Claude how to start the server, choose scenarios, write config files, and interpret the scorecard.

Agent Skill

This repo includes a portable Agent Skills skill at skills/agentbreak-testing/SKILL.md.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.claude-plugin		.claude-plugin
.github/workflows		.github/workflows
agentbreak		agentbreak
docs		docs
examples		examples
skills/agentbreak-testing		skills/agentbreak-testing
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
config.example.yaml		config.example.yaml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentBreak

Quick Start

How It Works

Real Provider Mode

Why This Matters

What It Supports

Useful Endpoints

Config File

Examples

Development

Claude Code

Agent Skill

Links

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentBreak

Quick Start

How It Works

Real Provider Mode

Why This Matters

What It Supports

Useful Endpoints

Config File

Examples

Development

Claude Code

Agent Skill

Links

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages