Forge

A Claude Code plugin for executing ambitious projects through design specifications, without losing sight of your constraints.

Forge has replaced my use of ralph-loops.

The Problem

When agents tackle multi-step projects, requirements drift. An instruction like "no external dependencies" or "match the existing tone" gets buried in context and ignored by step 10. Forge fixes this.

How It Works

Forge takes a design document and runs it through a pipeline:

Council — determines the agent roles (perspectives) needed for the project
Pipeline Design — captures the project's constraints, conventions, and quality bar
Agent Generation — generates project-specific agents for the council
Plan Decomposition — breaks work into small, self-contained tasks
Execution — runs each task through an experiment → verify → save loop, syncing with the remote between tasks
Report — summarizes what was built

The filesystem is the source of truth. Tasks move through todo/ → working/ → done/ (or blocked/). Runs are always resumable — re-run the same command and forge picks up where it left off.

Installation

Forge is a Claude Code plugin. See the Claude Code documentation for how to install plugins from the marketplace or load them locally via --plugin-dir.

Workflow

Write a design doc — describe what to build and any non-negotiable constraints
Open a Claude Code session — in your project directory
Run forge — /forge design.md works through the full pipeline automatically
If tasks block — forge creates design-blocked.md summarizing each failure and its reason
Resolve the issues — edit design-blocked.md to clarify requirements, add context, or adjust constraints
Run forge again — /forge design-blocked.md starts a fresh run targeting only the failures

Always forward — completed work is never re-run.

Usage

Open a Claude Code session in your project directory, then:

/forge path/to/design.md           # run fully automated (no prompts)
/forge path/to/design.md --ask     # pause for approval at each phase
/forge path/to/design.md --clean   # delete the .forge/ state and start over

With --ask, forge pauses at the council, pipeline, and agent generation phases, letting you review and request changes before proceeding. Without it, forge auto-approves everything and runs to completion. Interrupted runs resume automatically on the next invocation.

Your design document should describe what you want to build and any non-negotiable constraints (e.g., no external dependencies, match the existing voice and tone, all decisions must be reversible).

Testing forge itself:

./tests/test.sh --reset  # wipe state, run forge, then verify
./tests/test.sh          # verify only (after forge has already run)

Key Concepts

Global Constraints — defined once in pipeline.md, injected into every task. Constraints are verified after each task, not just at the end.

Council Deliberation — before execution, the task agent reasons through each council role's perspective in a single context. This catches issues before work begins.

council/*.md files — generated in Phase 3, one per role (e.g., programmer.md, tester.md). These are project-specific agent instructions tailored to the design and pipeline. They are used in two ways:

Phase 4 (Plan Decomposition): the plan-decomposer reads all of them to understand each role's scope and assign tasks to the right role.
Phase 5 (Execution): the file matching the task's role becomes that agent's primary instructions. All council files are also passed together so the agent can deliberate from every perspective before acting.

Dynamic Verification — every task that produces output with observable behavior includes a check that exercises it directly: starting a server and calling an endpoint, invoking a CLI with real arguments, running a script against real data. Static checks (file exists, pattern absent) confirm structure; dynamic checks confirm the output actually works.

Sync — before each task, forge pulls the latest changes so work done by others is visible. After each task completes, forge pushes so others receive it immediately. If a pull fails (conflict, no connectivity), the run stops cleanly for manual resolution. If there is no remote, sync is skipped silently.

Attempt Tracking — each task gets up to 3 attempts (configurable). After max attempts, the task moves to blocked/ for manual review rather than silently failing.

Task Context — each task agent receives its role's generated instructions, pipeline.md, the task file, and all council member files for deliberation. It does not receive the original design.md directly — by execution time, everything relevant should be captured in the task and pipeline.

Not Just for Code — Forge works for any file-based project. If no tech stack is detected, the council is inferred from the design document alone. Default roles (programmer, tester, product-manager) can be replaced during the approval step with whatever fits the project (e.g., writer, editor, strategist).

Docs

goal.md — problem statement and motivation
plan.md — full implementation blueprint

Requirements

Claude Code CLI
A git repository for the target project

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.claude-plugin		.claude-plugin
agents		agents
commands		commands
docs		docs
hooks		hooks
tests		tests
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Forge

The Problem

How It Works

Installation

Workflow

Usage

Key Concepts

Docs

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Forge

The Problem

How It Works

Installation

Workflow

Usage

Key Concepts

Docs

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages