Evoloop

A self-contained multi-agent workflow for planning, building, reviewing, testing, and deploying software with strict quality gates and minimal context windows.

Overview

Evoloop orchestrates AI agents (Claude, Codex, Gemini) through a two-phase software delivery pipeline:

Planning - Exhaustive, area-based planning with quality gates, requirement traceability, and red-team review.
Implementation - Story-by-story execution loop: build, review/test, deploy, with deploy retry tracking and rollback handled by the deploy agent contract.

Each agent runs with a minimal, scoped context window. Fresh agents handle each step, so no single agent needs to hold the entire project in memory.

Need the basics only?

Prerequisites

jq - JSON processor
ripgrep (rg) - fast search
AI CLI tool(s) matching your configured agents/runners.json commands:
- Claude Code (claude)
- Codex (codex)
- Gemini CLI (gemini)
If you use gpt-5.3-codex, install Codex CLI 0.98.0 or later.

Default agents/runners.json uses Codex (gpt-5.3-codex, xhigh) for all agents. The default Codex runner includes --skip-git-repo-check, so planning/implementation can run before repository initialization.

Quickstart

# 1. Bootstrap the scaffold
./scripts/bootstrap-plan.sh

# 2. Drop your source material into .init/
#    (requirements, design docs, architecture notes, etc.)

# 3. Validate readiness
./scripts/doctor.sh --planning-only

# 4. Run planning pipeline (start -> user checkpoint -> all areas -> review -> redteam)
./orchestrator.sh plan

# 5. Run delivery pipeline (pm -> doctor -> implementation loop)
./orchestrator.sh run

Architecture

orchestrator.sh          # Pipeline orchestrator (combined plan/run flows)
  ├── scripts/plan.sh    # Planning phase runner
  ├── scripts/implement.sh  # Implementation phase runner
  └── scripts/lib/common.sh # Shared helpers, validation, gates

Directory Layout

Directory	Purpose
`.init/`	User-provided source inputs (read-only)
`.plan/`	Planning artifacts (areas, registers, runbook)
`.plan/templates/`	Schema templates for stories, areas, registers
`.plan/areas/`	Per-area deep-dive documents
`.log/`	Per-run logs and context packs
`.state/`	Pipeline metadata (pipeline.json, lock file)
`agents/`	Role prompts and runner configuration
`prd/`	Story JSON specs and markdown trackers
`scripts/`	All executable scripts
`flowchart/`	Workflow diagrams

Planning Phase

Planning is explicit, sequenced, and gate-driven through five subphases:

Step	Agent	What It Does
`start`	Planning Coordinator	Reads `.init/`, writes 4-5 clarifying questions to `.plan/questions.md`, creates area map and planning registers
`area`	Area Agent	Deepens one area at a time through `draft -> probing -> in_review -> approved -> locked`
`review`	Planning Reviewer	Finds gaps, produces `work-breakdown.md` with `REQ-###` IDs and `traceability.md`
`redteam`	Red-Team Agent	Stress-tests scale, security, rollback realism, critical-path coverage
`pm`	PM Agent	Converts work breakdown into story specs in `prd/`

Planning Exit Gate

Before PM runs and before implementation, the orchestrator enforces:

All 8 planning docs exist (areas, work-breakdown, traceability, runbook, decisions, assumptions, dependencies, risk-register)
runbook.md, dependencies.md, and work-breakdown.md contain no TODO/TBD/FIXME
work-breakdown.md and traceability.md include REQ-### IDs
dependencies.md includes a critical path section
No area is still draft, probing, or in_review

During ./orchestrator.sh plan, the orchestrator pauses after start and collects answers interactively, then writes them to .plan/answers.md for downstream planning agents.

Implementation Phase

Each story progresses through strict stages handled by fresh agents:

build --> review/test --> deploy --> complete
  ^          |              |
  |          |              +--> blocked (after 3 failures)
  +----------+--------------+
       (on failure)

Stage	Agent	Behavior
`build`	Builder	Implements the story, commits on branch, runs build command
`review`/`test`	Reviewer-Test	Reviews code, fixes minor issues, adds tests, verifies requirements
`deploy`	Deploy	Validates deploy safety contract, deploys, verifies, rolls back on failure

Stage Transitions (enforced)

build -> build | review
review/test -> build | deploy
deploy -> build | complete | blocked

Deploy Safety

Every story carries a deploy safety contract:

strategy - deployment approach (rolling, blue-green, etc.)
healthChecks - post-deploy health verification steps
rollbackTrigger - condition that triggers rollback
rollbackCommand - exact rollback command
verification - post-deploy verification steps

Deploy failures are tracked by the orchestrator. After 3 failed attempts, the story is set to blocked.

Gated Deploys

Stories with autonomy: "gated_deploy" require explicit approval:

./orchestrator.sh run --approve-deploy US-001
./orchestrator.sh run --approve-deploy all

Preflight Checks

Run doctor.sh before unattended runs:

./scripts/doctor.sh                    # Full check
./scripts/doctor.sh --planning-only    # Skip implementation gate
./scripts/doctor.sh --skip-runner-tools  # Skip CLI availability checks
./scripts/doctor.sh --verbose          # Show details on failures

Checks include:

Directory scaffold and executable entrypoints
Shell syntax validation for all scripts
JSON validity for runners and story files
Core tool availability (jq, rg)
Runner CLI availability from agents/runners.json
Codex CLI version compatibility for configured model requirements
Story schema and deploy-safety contract validation
Circular and broken dependency detection
Planning exit gate readiness

Agent Routing

Agents are routed via agents/runners.json. Each entry maps an agent name to a CLI command:

{
  "default": {
    "cmd": ["codex", "exec", "--skip-git-repo-check", "--full-auto", "--model", "gpt-5.3-codex", "-c", "model_reasoning_effort=\"xhigh\""]
  }
}

If no matching runner is configured, the orchestrator falls back to the -agent/--agent flag. Passing -agent/--agent explicitly forces that provider for the run. See agents/runners.example.json for a multi-provider example.

Use {{PROMPT}} in the cmd array for tools that take the prompt as an argument instead of stdin.

CLI Reference

Planning

./orchestrator.sh plan [-agent claude|codex|gemini] [--runners <file>] [--areas area1,area2,...] [--skip-user-checkpoint]

Implementation

./orchestrator.sh run [-agent claude|codex|gemini]
                      [--story US-XXX]
                      [--max-iterations N]
                      [--approve-deploy US-XXX|all]
                      [--resume]
                      [--reset]
                      [--runners <file>]

Granular Planning (legacy/manual)

./orchestrator.sh plan start   [-agent claude|codex|gemini] [--runners <file>]
./orchestrator.sh plan area    --area <name> [-agent claude|codex|gemini] [--runners <file>]
./orchestrator.sh plan review  [-agent claude|codex|gemini] [--runners <file>]
./orchestrator.sh plan redteam [-agent claude|codex|gemini] [--runners <file>]
./orchestrator.sh plan pm      [-agent claude|codex|gemini] [--runners <file>]

Direct Entrypoints

./scripts/plan.sh start -agent codex
./scripts/implement.sh -agent codex --story US-001

Environment Variables

Variable	Default	Description
`CLAUDE_MODEL`	`opus`	Claude model name
`CODEX_MODEL`	`gpt-5.3-codex`	Codex model name
`CODEX_EFFORT`	`xhigh`	Codex reasoning effort
`GEMINI_MODEL`	`gemini-2.0-flash`	Gemini model name

Safety Features

Atomic state writes - All JSON state updates use temp file + mv to prevent corruption
Pipeline locking - mkdir-based lock prevents concurrent orchestrator instances
Trap handlers - Graceful cleanup on crash or interruption (logs failure, releases lock, cleans temp files)
Stall detection - Implementation exits if a story fails to advance after 3 consecutive iterations
Stage transition enforcement - Invalid stage transitions are rejected
Dependency cycle detection - doctor.sh catches circular and broken story dependencies
Deploy contract validation - Placeholder text (TODO/TBD/FIXME) is rejected in deploy safety fields

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.init		.init
.plan		.plan
.state		.state
.temp		.temp
agents		agents
flowchart		flowchart
prd		prd
scripts		scripts
skills		skills
.gitignore		.gitignore
AGENTS.md		AGENTS.md
QuickUserGuide.md		QuickUserGuide.md
README.md		README.md
UserGuide.md		UserGuide.md
orchestrator.sh		orchestrator.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evoloop

Overview

Prerequisites

Quickstart

Architecture

Directory Layout

Planning Phase

Planning Exit Gate

Implementation Phase

Stage Transitions (enforced)

Deploy Safety

Gated Deploys

Preflight Checks

Agent Routing

CLI Reference

Planning

Implementation

Granular Planning (legacy/manual)

Direct Entrypoints

Environment Variables

Safety Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Evoloop

Overview

Prerequisites

Quickstart

Architecture

Directory Layout

Planning Phase

Planning Exit Gate

Implementation Phase

Stage Transitions (enforced)

Deploy Safety

Gated Deploys

Preflight Checks

Agent Routing

CLI Reference

Planning

Implementation

Granular Planning (legacy/manual)

Direct Entrypoints

Environment Variables

Safety Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages