angry-ralph

A Claude Code plugin that transforms a feature spec into a fully implemented, reviewed, and tested codebase, using adversarial multi-LLM review and TDD-gated execution.

Built on the shoulders of giants. angry-ralph unifies and extends three foundational projects by piercelamb:

deep-project - Feature decomposition and stakeholder interviews

deep-plan - Detailed implementation planning with section-level specs

deep-implement - TDD execution with strict red-green cycles

angry-ralph combines all three into a single self-contained plugin, adds adversarial review via Gemini and Codex CLIs, and enforces execution with a built-in Ralph Wiggum Loop, a Stop hook that blocks exit until tests pass.

What It Does

Give angry-ralph a spec file and it runs a 6-phase pipeline:

Phase	What Happens
1. DECOMPOSE	Reads your spec, interviews you to clarify scope, identifies planning units
2. PLAN	Writes a detailed implementation plan with numbered sections
3. ADVERSARIAL REVIEW	Dispatches Gemini and Codex CLIs to review the plan, triages findings, iterates up to N times
4. SPLIT	Finalizes the plan into individual section spec files
5. EXECUTE	TDD Ralph Loop per section: tests first, implement, all tests pass, inline review gate, atomic commit
6. FINAL REVIEW	Self-healing review loop: external LLMs review, fix, re-review until clean or capped

Every phase produces persistent artifacts on disk. If the session is interrupted, re-running the command detects prior state and offers to resume.

Prerequisites

Required:

Claude Code (claude in PATH)
git
python3

Optional (upgrades review tier):

Gemini CLI (gemini in PATH)
Codex CLI (codex in PATH)

angry-ralph works out-of-the-box with only the required tools. External CLIs upgrade the review from Self-Reflection (Claude reviewing its own work in a fresh session) to Adversarial (cross-model scrutiny).

Verify your environment:

bash /path/to/angry-ralph/scripts/checks/validate-env.sh

Quick Start

Option A: Install via Marketplace

/plugin marketplace add Custos/angry-ralph
/plugin install angry-ralph@custos-plugins

Option B: Load from Local Path

git clone https://github.com/Custos/angry-ralph.git
claude --plugin-dir /path/to/angry-ralph

Or for fully autonomous execution (no permission prompts):

claude --plugin-dir /path/to/angry-ralph --dangerously-skip-permissions

Warning: --dangerously-skip-permissions gives Claude unrestricted access to your filesystem, shell, and network. The Ralph Loop is designed to run autonomously and will hit permission prompts repeatedly without this flag, but you should understand the risk. Use in isolated environments or repositories you trust. The TDD gate and fail-closed Stop hook provide safety at the execution level, but they do not replace filesystem-level caution.

Run against your spec

/angry-ralph @path/to/your-spec.md

With a custom review iteration cap:

/angry-ralph @path/to/your-spec.md --max-review-iterations 5

With a custom section review iteration cap:

/angry-ralph @path/to/your-spec.md --max-section-review-iterations 3

The current working directory should be a git repo (or angry-ralph will offer to git init one for you). This is where the code gets built.

Commands

Command	Description
`/angry-ralph @spec.md [--auto]`	Start the full 6-phase pipeline against a spec file
`/angry-architect @spec.md [--auto]`	Run Phases 1-2 only (decompose + plan)
`/angry-review [plan\|code\|section <name>]`	Adversarial review — pipeline Phase 3 or on-demand
`/angry-execute [--auto] [--rebuild <section>]`	Run Phases 4-6 (split, TDD execute, final review)
`/angry-fix [context] [prompt]`	Surgical TDD strike on a specific file or bug
`/angry-diagnose [@ctx] "problem"`	Adversarial bug diagnosis with differential diagnosis and TDD fix
`/cancel-ralph`	Cancel an active Ralph Loop and remove state
`/angry-status`	Show current pipeline state (read-only)

All commands are thin wrappers that delegate to scripts/lib/pipeline.sh and scripts/lib/mechanical-gates.sh.

Architecture

.
├── .claude-plugin/
│   └── plugin.json              # Plugin manifest
├── agents/
│   └── external-reviewer.md     # Subagent: invokes gemini + codex CLIs (Bash + Read only)
├── commands/
│   ├── angry-ralph.md           # Full 6-phase pipeline (thin wrapper)
│   ├── angry-architect.md       # Phases 1-2: decompose + plan
│   ├── angry-review.md          # Phase 3 + on-demand review
│   ├── angry-execute.md         # Phases 4-6: split, TDD, final review
│   ├── angry-fix.md             # Surgical TDD strike
│   ├── angry-diagnose.md        # Adversarial bug diagnosis
│   ├── angry-status.md          # Pipeline state display
│   └── cancel-ralph.md          # Loop cancellation
├── hooks/
│   ├── hooks.json               # Stop + SubagentStop hook registration
│   └── stop-hook.sh             # TDD-gated exit interceptor
├── scripts/
│   ├── checks/
│   │   └── validate-env.sh      # Prerequisite validation
│   └── lib/
│       ├── state.sh             # State file CRUD (YAML frontmatter)
│       ├── pipeline.sh          # Pipeline state engine (JSON config, .done markers)
│       └── mechanical-gates.sh  # Stub grep, test verify, spec compliance gates
├── skills/
│   └── angry-ralph/
│       ├── SKILL.md             # Master orchestration skill
│       └── references/
│           ├── planning-protocol.md
│           ├── review-protocol.md
│           ├── tdd-protocol.md
│           ├── loop-protocol.md
│           ├── section-review-protocol.md
│           ├── final-review-protocol.md
│           └── diagnosis-protocol.md
└── tests/
    ├── test-state.sh
    ├── test-validate-env.sh
    ├── test-stop-hook.sh
    ├── test-plugin-structure.sh
    ├── test-mechanical-gates.sh
    └── test-pipeline.sh

Key Components

External Reviewer Agent - A sandboxed subagent restricted to Bash and Read tools only. It invokes gemini and codex via CLI subprocesses, passing file paths (never stdin). The CLIs read spec/plan/code files from disk and write review output to the .planning/reviews/ directory.

Ralph Loop (Stop + SubagentStop Hooks) - During the EXECUTE phase, both the Stop and SubagentStop hooks intercept exit attempts. Each section is dispatched to a fresh subagent with clean context, and the SubagentStop hook gates its completion. The hook checks a state file (.ralph-state/loop.md) and the session transcript for a completion promise (SECTION_COMPLETE). If the promise isn't found in the last assistant message, the hook blocks exit and feeds the section prompt back, forcing the loop to continue until tests pass. The hook is fail-open before confirming an active execute-phase loop (safe default), and fail-closed once gating a confirmed TDD loop. The main session stays lean, managing only coordination and section transitions.

State Management - All state lives in .ralph-state/: loop.md (YAML-frontmatter TDD loop state), pipeline.json (config, phase tracking, sections), and .done marker files (architect.done, review.done, execute.done) for idempotent phase gating. The state.sh, pipeline.sh, and mechanical-gates.sh libraries in scripts/lib/ provide all state operations.

How the Review Loop Works

┌──────────────┐
│  Plan/Code   │
└──────┬───────┘
       │
       v
┌──────────────┐    ┌──────────────┐
│  gemini CLI  │    │  codex CLI   │
│  (review)    │    │  (review)    │
└──────┬───────┘    └──────┬───────┘
       │                   │
       v                   v
┌──────────────────────────────────┐
│     Claude Triages Findings      │
│  ┌───────────┐  ┌──────────────┐ │
│  │ Auto-fix  │  │ AskUser for  │ │
│  │ clear     │  │ genuine      │ │
│  │ issues    │  │ ambiguity    │ │
│  └───────────┘  └──────────────┘ │
└────────────────┬─────────────────┘
                 │
                 v
          Iterate or proceed
       (max N iterations, default 3)

Findings are triaged by Claude: clear issues get auto-fixed, genuine ambiguities are surfaced to you via AskUserQuestion. The loop runs until both reviewers approve or the iteration cap is reached.

Testing

Run all tests:

bash tests/test-state.sh
bash tests/test-validate-env.sh
bash tests/test-stop-hook.sh
bash tests/test-plugin-structure.sh

Or all at once:

for t in tests/test-*.sh; do echo "=== $t ==="; bash "$t"; echo; done

151 tests across 6 suites covering state management, environment validation, stop hook behavior (fail-closed on corrupt transcripts, promise swap gating, TDD iteration cap), plugin structure integrity, mechanical gates (stub grep, test verification, spec compliance), and pipeline state engine (JSON config, .done markers, phase transitions).

Planning Artifacts

When angry-ralph runs, it creates two hidden, gitignored directories for runtime state and artifacts:

.ralph-state/
├── pipeline.json               # Session config (phase, mode, sections, timestamps)
├── loop.md                     # TDD loop state (YAML frontmatter + prompt body)
└── phases/
    ├── architect.done           # Phases 1-2 completion marker
    ├── review.done              # Phase 3 completion marker
    └── execute.done             # Phases 4-6 completion marker

.planning/
├── angry-ralph-plan.md         # The implementation plan
├── angry-ralph-interview.md    # Decomposition interview notes
├── reviews/
│   ├── iteration-1/
│   │   ├── gemini-review.md
│   │   └── codex-review.md
│   ├── sections/
│   │   └── section-01-name/
│   │       ├── review-1.md
│   │       └── review-2.md
│   ├── on-demand/
│   │   ├── code-<timestamp>/
│   │   ├── plan-<timestamp>/
│   │   └── section-<name>-<timestamp>/
│   └── final/
│       ├── gemini-review.md
│       └── codex-review.md
└── sections/
    ├── index.md
    ├── section-01-name.md
    └── section-02-name.md

These artifacts persist across sessions, enabling resume after interruptions.

Hard Constraints

These are non-negotiable design decisions baked into the plugin:

External reviewer tools: Bash and Read only - no file writes from review agents
CLI invocation: File paths as arguments, never stdin piping
Triage with user input: Ambiguous review findings always surface via AskUserQuestion
TDD enforcement: Tests written before implementation, must fail first, must pass before commit
Atomic commits: One commit per completed section
Fail-closed loop: Stop hook blocks exit if transcript parsing fails
Zero external plugin dependencies: Everything is self-contained

License

MIT

Contributing

See CONTRIBUTING.md for development setup, testing, and contribution guidelines.

Acknowledgments

This project would not exist without the deep-series by piercelamb:

deep-project - The decomposition and interview methodology
deep-plan - The section-based planning approach and multi-LLM review concept
deep-implement - The TDD execution discipline and atomic commit strategy

angry-ralph stands on these foundations and adds the Ralph Loop execution engine, unified plugin packaging, and cross-platform hardening.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.claude-plugin		.claude-plugin
agents		agents
commands		commands
docs/plans		docs/plans
hooks		hooks
scripts		scripts
skills/angry-ralph		skills/angry-ralph
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

angry-ralph

What It Does

Prerequisites

Quick Start

Option A: Install via Marketplace

Option B: Load from Local Path

Run against your spec

Commands

Architecture

Key Components

How the Review Loop Works

Testing

Planning Artifacts

Hard Constraints

License

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

angry-ralph

What It Does

Prerequisites

Quick Start

Option A: Install via Marketplace

Option B: Load from Local Path

Run against your spec

Commands

Architecture

Key Components

How the Review Loop Works

Testing

Planning Artifacts

Hard Constraints

License

Contributing

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages