Ralph

              --.                        .-+.
              -+++--.                 .-+++#.
              +++++++-.            .-+++++++-
              +++++++++-. .....   .+++++++++-
              +++++++++++-++++++--++++++++++-
              ++++++++++++++++++++++++++++++-
              +++++++++++++++++++----+++++++-
              -++++++++++++++++++.   ..--+++.
              .+++++-...--+++++++     ...-+-
               -+++-      ..---++...   ...+.
               .++++.     +#+ -++.+#+ ...--
                -++++..   -++.-++-+#-...-+.
                  -++++-     .###-  .-++.
                    .-+.       .     .+
                     .+-.   --.    ...
                    .++++       ..-+-. ..
                    .++++      ...-.-+--.
                    .++++-.    +#+-..-----
                    -+++++++--######++-- .
       ...          -++++---++#####+++-+-.
       ..     .-.   -++++.    ...-+--...
        .    .-++--..+++++-.. ...-+++-
        ...  .-+++++-+++++++----+++++.
          ...++++++++++++--....--+++-
           .-+++++++++++-        .++.
         .......-----##+..........+##-.....
              ........................

A small, opinionated Go CLI that drives an iterative loop against the GitHub Copilot SDK. One prompt in, many turns of progress out — until Ralph decides he's done, you run out of iterations, or you hit Ctrl+C.

Why Ralph

The "Ralph Wiggum" technique is dead simple: keep poking the model with the same prompt until the work is actually done. Ralph wraps that idea in a single binary so you can hand a task to Copilot and watch it iterate without babysitting the chat window.

Stream first. Tokens, reasoning, and tool calls land on stdout the moment they arrive.
Promise-based completion. The model wraps its sign-off in <promise>...</promise>; Ralph ends the loop cleanly when it sees one.
Hard limits. --max-iterations and --timeout keep runaway loops in check.
Verifiable progress. --verify-cmd runs your build/test suite after every iteration and feeds failures back as context for the next turn.
Auto-commit. --auto-commit snapshots each iteration as a git commit, optionally tagged, so you can bisect if something goes wrong.
Second opinion. --oracle-model consults a second Copilot model between iterations for an independent assessment.
No magic. Plain CLI, plain logs, no TUI, no wizard.

Install

Requires Go 1.26+ and the GitHub Copilot CLI on $PATH.

# Install the latest release directly
go install github.com/patbaumgartner/copilot-ralph/cmd/ralph@latest

# Or grab a binary from the Releases page
# https://github.com/patbaumgartner/copilot-ralph/releases

# Or build from source
git clone https://github.com/patbaumgartner/copilot-ralph.git
cd copilot-ralph
make build   # produces ./bin/ralph

Quick start

# Inline prompt — Ralph iterates until the model promises it's done
ralph run "Add unit tests for the parser module"

# Markdown file as prompt
ralph run task.md

# Read the prompt from stdin
echo "Fix all TODO comments" | ralph run -
cat task.md | ralph run -

# Cap iterations and runtime
ralph run --max-iterations 5 --timeout 10m "Refactor the auth module"

# Run your test suite after every iteration; feed failures back automatically
ralph run --verify-cmd "make test" "Fix all failing tests"

# Auto-commit each iteration and run the oracle every 3 turns
ralph run --auto-commit --oracle-model gpt-4o --oracle-every 3 "Optimise hot path"

# Show the resolved config without calling the model
ralph run --dry-run "Implement OAuth"

Features

Env-var overrides

Every major flag reads a RALPH_* environment variable as its default, so Ralph can be configured once (e.g. in a .env file or CI secret) without repeating flags on every invocation.

export RALPH_MAX_ITERATIONS=20
export RALPH_VERIFY_CMD="make test"
export RALPH_ON_COMPLETE="./scripts/notify.sh"
ralph run "Implement the feature"

Supported variables: RALPH_MAX_ITERATIONS, RALPH_TIMEOUT, RALPH_ITERATION_TIMEOUT, RALPH_PROMISE, RALPH_MODEL, RALPH_WORKING_DIR, RALPH_STREAMING, RALPH_SYSTEM_PROMPT, RALPH_CARRY_CONTEXT, RALPH_NO_RATE_LIMIT_WAIT, RALPH_VERIFY_CMD, RALPH_CHECKPOINT_FILE, RALPH_ORACLE_MODEL, RALPH_BLOCKED_PHRASE, RALPH_STALL_AFTER, RALPH_ITERATION_DELAY, RALPH_ON_COMPLETE, RALPH_ON_BLOCKED.

Loop control

Flag	Default	Purpose
`-m`, `--max-iterations`	`10`	Stop after N loops.
`-t`, `--timeout`	`30m`	Hard wall-clock deadline.
`--iteration-timeout`	`0`	Per-iteration soft deadline (0 disables).
`--stop-on-no-changes`	`0`	Halt after N iterations with no git changes.
`--stop-on-error`	`0`	Halt after N iterations emitting errors.
`--stall-after`	`0`	Halt after N consecutive identical responses (0 disables).
`--iteration-delay`	`0`	Pause between iterations (e.g. `2s`).
`--model`	`gpt-4`	Copilot model id.
`--working-dir`	cwd	Directory where the assistant runs tools.
`--log-level`	`info`	`debug` / `info` / `warn` / `error`.
`--dry-run`	`false`	Print resolved config and exit.

Prompt & context

Flag	Default	Purpose
`--promise`	`I'm special!`	Phrase the model wraps in `<promise>`.
`--blocked-phrase`	(none)	Phrase the model wraps in `<blocked>` to signal it cannot proceed.
`--streaming`	`true`	Stream deltas vs. wait for full messages.
`--system-prompt`	(built-in)	Inline text or path to a Markdown file.
`--system-prompt-mode`	`append`	`append` or `replace` Ralph's prompt.
`--carry-context`	`summary`	`off` / `summary` / `verbatim`.
`--carry-context-max-runes`	`4000`	Max runes carried into the next prompt.
`--prompt-stack`	(none)	Extra prompts prepended each iteration.
`--plan-file`	(none)	Shared Markdown scratchpad injected every turn.
`--specs`	(none)	Directory of spec files listed each turn.

Build verification

Flag	Default	Purpose
`--verify-cmd`	(none)	Shell command run after each iteration.
`--verify-timeout`	`5m`	Timeout for a single verify run.
`--verify-max-bytes`	`16384`	Max bytes captured per stream.

Auto-commit & git

Flag	Default	Purpose
`--auto-commit`	`false`	`git add -A && commit` after each iteration.
`--auto-commit-message`	`ralph: iteration %d`	Commit message format (`%d` = iteration).
`--auto-commit-on-failure`	`false`	Commit even when verify failed.
`--auto-tag`	(none)	Annotated tag format (e.g. `ralph/iter-%d`).
`--diff-stat`	`false`	Emit `git diff --stat HEAD` each iteration.

Output sinks

Flag	Default	Purpose
`--json`	`false`	Emit JSON Lines to stdout.
`--json-output`	(none)	Also write JSON Lines to a file.
`--log-file`	(none)	Append a one-line summary of every event.
`--webhook`	(none)	POST every event as JSON to this URL.
`--webhook-timeout`	`5s`	Timeout for a single webhook delivery.

Checkpoint & resume

Flag	Default	Purpose
`--checkpoint-file`	(none)	Persist loop state after every iteration.

# Pause a long run with Ctrl+\ and resume later
ralph run --checkpoint-file state.json "Big refactor"
ralph resume --checkpoint-file state.json

Oracle (second opinion)

Flag	Default	Purpose
`--oracle-model`	(none)	Second Copilot model consulted between iters.
`--oracle-every`	`0`	Consult every N iterations (0 disables).
`--oracle-on-verify-fail`	`false`	Consult whenever verify fails.

Lifecycle hooks

Flag	Default	Purpose
`--on-complete`	(none)	Shell command run when the loop completes successfully.
`--on-blocked`	(none)	Shell command run when the model emits the blocked signal.

Hooks receive RALPH_STATE and RALPH_ITERATIONS environment variables. Hook errors are printed as warnings and do not change Ralph's exit code.

Rate limiting

Flag	Default	Purpose
`--no-rate-limit-wait`	`false`	Fail fast instead of waiting for reset.

Commands

Command	Purpose
`ralph run <prompt>`	Run the iteration loop. Prompt may be text, a file path, or `-` for stdin.
`ralph resume`	Resume from a `--checkpoint-file`.
`ralph reset`	Delete a checkpoint file (`--force` skips confirmation).
`ralph doctor`	Check environment health (Copilot CLI, git, writable cwd).
`ralph version`	Print build metadata.
`ralph completion <shell>`	Print a shell completion script for `bash`, `zsh`, `fish`, or `powershell`.

How it works

┌─────────────────────────────────────────────────────────────┐
│  ralph run "Your task"                                      │
│                                                             │
│  1. Build prompt ← system prompt + plan-file + specs +      │
│                    previous response (carry-context) + user │
│  2. Send to Copilot SDK (streaming)                         │
│  3. Print assistant tokens, reasoning, and tool events      │
│  4. Run --verify-cmd (if set) and capture output            │
│  5. Optionally consult oracle model for second opinion      │
│  6. Auto-commit (if --auto-commit)                          │
│  7. Check stop conditions:                                  │
│       • <promise>...</promise> detected → done ✓            │
│       • --max-iterations reached → exit 4                   │
│       • --timeout elapsed → exit 3                          │
│       • --stop-on-no-changes or --stop-on-error triggered   │
│  8. Loop back to step 1 with the next iteration             │
└─────────────────────────────────────────────────────────────┘

The loop engine lives in internal/core. The SDK wrapper (internal/sdk) handles sessions, retries, and rate-limit backoff. The CLI layer (cmd/ralph, internal/cli) wires flags to a LoopConfig struct and prints typed events as they stream in. Output sinks (JSON, log file, webhook) plug into internal/eventsink without touching engine internals. There is no shared mutable state beyond Cobra flag bindings.

Carry context modes

Mode	Behaviour
`off`	Each iteration only sees the original prompt.
`summary`	The assistant's last response is summarised and prepended (default).
`verbatim`	The raw last response is prepended (up to `--carry-context-max-runes`).

Exit codes

Code	Meaning
`0`	Loop finished cleanly (promise received or no-op).
`1`	Generic failure / SDK error.
`2`	Cancelled (`Ctrl+C` or invalid args).
`3`	`--timeout` exceeded.
`4`	`--max-iterations` reached without a promise.
`5`	Model signalled it cannot proceed (`--blocked-phrase` detected).

Development

make all        # tidy + fmt + vet + lint + test + build
make test       # go test -race -cover ./...
make build      # ./bin/ralph

Conventions, architecture, and "how to add X" live in .github/copilot-instructions.md. Contribution workflow is in CONTRIBUTING.md. User-visible changes belong in CHANGELOG.md. Security disclosures go through SECURITY.md.

Acknowledgements

The original Ralph Wiggum plugin for Claude Code that inspired the loop pattern.
The GitHub Copilot SDK team.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.github		.github
cmd/ralph		cmd/ralph
internal		internal
pkg/version		pkg/version
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yaml		.goreleaser.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ralph

Why Ralph

Install

Quick start

Features

Env-var overrides

Loop control

Prompt & context

Build verification

Auto-commit & git

Output sinks

Checkpoint & resume

Oracle (second opinion)

Lifecycle hooks

Rate limiting

Commands

How it works

Carry context modes

Exit codes

Development

Acknowledgements

License

About

Uh oh!

Releases 5

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Ralph

Why Ralph

Install

Quick start

Features

Env-var overrides

Loop control

Prompt & context

Build verification

Auto-commit & git

Output sinks

Checkpoint & resume

Oracle (second opinion)

Lifecycle hooks

Rate limiting

Commands

How it works

Carry context modes

Exit codes

Development

Acknowledgements

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages