Agent Workflow

Stop letting your AI coding agent freestyle your codebase.

Lightweight workflow skills for Claude Code, Codex, and other AI coding agents.
Turns a "vibe coder" into a disciplined project manager that clarifies requirements,
delegates to subagents, runs reviews, and delivers evidence — before claiming it's done.

English · 中文 · Comparison with Aegis & Superpowers

Why This Exists

AI coding agents are powerful. But give them a complex task and watch what happens:

They start coding before understanding the requirement
The main thread does everything — chatting, coding, testing, reviewing — all in one bloated context
Context gets compressed, goals get lost, behavior drifts
They say "done" without running a single test
The frontend looks like every other AI-generated page: Inter font, purple gradient, 3 equal columns
You ask "did you test the error case?" and they start over from scratch

Agent Workflow fixes this by adding one constraint: the main thread never touches code.

It becomes an Orchestrator — a project manager that talks to you, clarifies requirements, delegates to specialized subagents, and delivers results with evidence. Every coding, testing, and review task goes to an independent subagent with a self-contained context packet. No conversation history leaking. No context bloat. No drift.

Before / After

Without Agent Workflow

User:  给个人中心加手机号修改功能
AI:    好的，我来改几个文件
AI:    [修改了 6 个文件]
AI:    已完成
User:  测了吗？
AI:    理论上应该可以
User:  旧手机号要验证吗？
AI:    好的，我加上
AI:    [又改了 4 个文件]
AI:    已完成
User:  UI 怎么还是三等分布局...
AI:    我来重新设计
AI:    [上下文已压缩，忘记了之前的讨论]

With Agent Workflow

User:  用重任务流程处理：个人中心增加修改手机号功能

Orchestrator:
  1. "需要哪些验证方式？" → 短信验证码
  2. "旧手机号需要验证吗？" → 需要
  3. 写 requirements.md → 用户确认
  4. 写 acceptance.md → 用户确认
  5. 写 plan.md

  ── 派发 Implementation Subagent ──
     返回: DONE (4 files, tests passing)

  ── 派发 Spec Compliance Reviewer ──
     结果: PASS — all requirements covered

  ── 派发 Code Quality Reviewer ──
     结果: PASS

  ── 派发 Verification Subagent ──
     结果: PASS — 12 tests, 0 failures

  ── 派发 QA Subagent ──
     结果: PASS — 8/8 acceptance criteria

  → 交付：功能完成，附带测试证据

The user answered 3 questions. The Orchestrator managed the rest. Every step has evidence.

How It Works

graph LR
    User["User"] <-->|"clarify, confirm, handoff"| O["Orchestrator<br/>(main thread)"]
    O -->|"writes"| R["requirements.md"]
    O -->|"writes"| A["acceptance.md"]
    O -->|"writes"| P["plan.md"]

    O -->|"dispatches"| IMP["Implementation<br/>Subagent"]
    O -->|"dispatches"| SR["Spec Reviewer"]
    O -->|"dispatches"| QR["Code Quality<br/>Reviewer"]
    O -->|"dispatches"| UI["UI Reviewer<br/>(frontend only)"]
    O -->|"dispatches"| VER["Verification<br/>Subagent"]
    O -->|"dispatches"| QA["QA Subagent"]

    IMP -->|"status + evidence"| O
    SR -->|"pass/fail"| O
    QR -->|"pass/fail"| O
    UI -->|"AI Slop Score"| O
    VER -->|"test results"| O
    QA -->|"criteria check"| O

    style O fill:#2563EB,stroke:#1D4ED8,color:#fff
    style IMP fill:#10B981,stroke:#059669,color:#fff
    style SR fill:#F59E0B,stroke:#D97706,color:#fff
    style QR fill:#F59E0B,stroke:#D97706,color:#fff
    style UI fill:#EC4899,stroke:#DB2777,color:#fff
    style VER fill:#8B5CF6,stroke:#7C3AED,color:#fff
    style QA fill:#8B5CF6,stroke:#7C3AED,color:#fff

The Orchestrator never edits code directly. It only:

Talks to the user — requirement clarification, confirmations, final handoff
Manages state — reads/writes state.json, requirements, acceptance, plan
Dispatches subagents — builds self-contained context packets, delegates via Agent tool
Synthesizes results — handles subagent status, decides next action

Why Subagents?

This isn't just architectural aesthetics. It solves real problems:

Problem	How subagents fix it
Context bloat	Each subagent gets only what it needs — a focused context packet, not the entire conversation
Goal drift	Subagents have explicit stop conditions; they don't wander
"Done" without evidence	Verification and QA are separate subagents that run real tests, not vibes
Reviewer bias	The reviewer is a different subagent than the implementer — it reads the actual code, not the report
Main thread overload	The Orchestrator stays lightweight; code, tests, and reviews happen in parallel isolation

Workflow Stages

graph TD
    A["requirement_clarification"] --> B["requirements"]
    B --> C["acceptance"]
    C --> D["plan"]
    D --> E["implementation"]
    E --> F["spec_compliance_review"]
    F --> G["code_quality_review"]
    G --> H{"Frontend task?"}
    H -->|Yes| I["ui_review"]
    H -->|No| J["verification"]
    I --> J
    J --> K["qa"]
    K --> L["final_handoff"]

    style A fill:#3B82F6,stroke:#2563EB,color:#fff
    style E fill:#10B981,stroke:#059669,color:#fff
    style F fill:#F59E0B,stroke:#D97706,color:#fff
    style G fill:#F59E0B,stroke:#D97706,color:#fff
    style I fill:#EC4899,stroke:#DB2777,color:#fff
    style J fill:#8B5CF6,stroke:#7C3AED,color:#fff
    style K fill:#8B5CF6,stroke:#7C3AED,color:#fff
    style L fill:#3B82F6,stroke:#2563EB,color:#fff

Stage	Who runs	What happens
`requirement_clarification`	Orchestrator	Talks to user, clarifies ambiguities
`requirements`	Orchestrator	Writes requirements.md, user confirms
`acceptance`	Orchestrator	Writes acceptance.md with testable criteria, user confirms
`plan`	Orchestrator	Writes plan.md with executable task breakdown
`implementation`	Subagent	Implements code (worktree isolation for high-risk)
`spec_compliance_review`	Subagent	Reads actual code, compares to requirements line by line
`code_quality_review`	Subagent	Checks structure, correctness, maintainability
`ui_review`	Subagent	Catches AI slop — fonts, gradients, layout, responsiveness
`verification`	Subagent	Runs tests, lint, build
`qa`	Subagent	Verifies every acceptance criterion against code
`final_handoff`	Orchestrator	Reports results with evidence bundle

The UI Reviewer: Killing AI Slop

AI-generated frontends have a distinctive look: Inter font everywhere, purple-blue gradients, 3 equal columns, heavy shadows, placeholder content. We call this AI slop.

The UI Reviewer is a dedicated subagent that catches what code review misses:

graph LR
    UI["UI Reviewer"] --> T["Typography<br/>No Inter/Roboto<br/>No #000000 text"]
    UI --> C["Color<br/>No neon gradients<br/>Max 1 accent"]
    UI --> L["Layout<br/>No 3 equal columns<br/>Generous whitespace"]
    UI --> M["Motion<br/>Custom cubic-bezier<br/>Respect reduced-motion"]
    UI --> R["Responsive<br/>375px mobile<br/>44px touch targets"]
    UI --> A["Accessibility<br/>WCAG AA contrast<br/>Focus states"]

    style UI fill:#EC4899,stroke:#DB2777,color:#fff

It outputs an AI Slop Score (0-10): 0 = looks handcrafted, 10 = maximum AI slop.

The frontend implementer also gets design constraints injected into its prompt: typography rules, color palette limits, layout patterns, motion guidelines, icon choices, and content rules (no placeholder names, no em-dashes, real copy only).

Key Features

Feature	What it does
Orchestrator-subagent separation	Main thread coordinates, subagents execute. The Orchestrator never writes code.
SubagentContextPacket	Self-contained prompts with task, goal, files, non-goals, verification. No conversation history leaking.
Two-stage review	Spec compliance (did you build the right thing?) + code quality (did you build it well?)
UI review	AI Slop Score (0-10), responsive check, accessibility audit, design constraint enforcement
Frontend design constraints	Typography, color, layout, motion rules injected into implementation prompts
Implementer 4-status return	`DONE` / `DONE_WITH_CONCERNS` / `NEEDS_CONTEXT` / `BLOCKED` — Orchestrator handles each
Checkpoint & resume	Survives context resets via handoff.md. Never resumes from memory alone.
Drift detection	After each stage, verifies work still serves original intent
Risk-based isolation	High-risk tasks use git worktree isolation; medium-risk shares working directory

Quick Start

AI-Assisted Install (Recommended)

Paste this to your AI coding agent:

请阅读 https://github.com/xzh20121116/agent-workflow，帮我全局安装 agent-workflow 技能。

The agent will detect your host (Claude Code, Codex, etc.), clone the repo, set up the correct skill paths, and verify the installation.

Manual Install

# Clone to a central location
git clone https://github.com/xzh20121116/agent-workflow.git ~/.agent-workflow

# Symlink to your host's skill directory
# Claude Code:
ln -s ~/.agent-workflow/skills/agent-workflow-init ~/.claude/skills/agent-workflow-init
ln -s ~/.agent-workflow/skills/agent-workflow-start ~/.claude/skills/agent-workflow-start

# Codex App:
ln -s ~/.agent-workflow/skills/agent-workflow-init ~/.codex/skills/agent-workflow-init
ln -s ~/.agent-workflow/skills/agent-workflow-start ~/.codex/skills/agent-workflow-start

Usage

Initialize a project

帮我用 agent-workflow 初始化当前项目

This sets up docs/agent/ with project config, request templates, and AGENTS.md.

Start a feature (heavy workflow)

用重任务流程处理：用户个人中心增加修改手机号功能

The Orchestrator will clarify requirements, write acceptance criteria, get your confirmation, then automatically delegate through the full stage flow.

Fix a bug

用重任务流程处理：支付回调偶发失败，大概一天出现几次

The Orchestrator investigates with you first, then delegates root cause analysis and fix to the implementation subagent.

Beautify a frontend page

用重任务流程美化 src/pages/landing/index.tsx 页面

Automatically uses the frontend implementer with design constraints, and adds a UI review stage.

Run spec compliance review only

帮我审查 src/services/auth.service.ts 是否符合 docs/requirements.md 中的需求

Run code quality review only

帮我做代码质量审查：src/services/order.service.ts

Included Skills

Two skills, zero config:

Skill	Purpose
`agent-workflow-init`	Project-level bootstrapper. Creates `docs/agent/` structure, AGENTS.md, project config.
`agent-workflow-start`	Request-level entry point. Creates request workspace, drives the full workflow from clarification to delivery.

Subagent Prompt Templates

Each role has a dedicated prompt template in skills/agent-workflow-start/references/:

Template	Role	Key Feature
`implementer-prompt.md`	Backend implementation	SubagentContextPacket, 4-status return
`frontend-implementer-prompt.md`	Frontend implementation	Design constraints (typography, color, layout, motion)
`spec-reviewer-prompt.md`	Spec compliance review	"Do Not Trust the Report" — reads actual code
`code-quality-reviewer-prompt.md`	Code quality review	Structure, correctness, maintainability
`ui-reviewer-prompt.md`	UI/visual review	AI Slop Score, responsive check, accessibility
`verification-prompt.md`	Test/lint/build	Runs project test suite
`qa-prompt.md`	Acceptance criteria	Verifies every criterion against code

Example Output

After a successful workflow run, you get:

docs/agent/requests/REQ-20260609-001/
├── requirements.md          # What we're building
├── acceptance.md            # How we verify it
├── plan.md                  # Task breakdown
├── state.json               # Machine-readable state
├── handoff.md               # Checkpoint for resume
├── implementation.md        # What was built, files changed
├── review.md                # Spec + code quality findings
├── verification.md          # Test results, lint output
└── qa.md                    # Acceptance criteria check

Every claim is backed by evidence. No "theoretically it should work."

Comparison with Similar Tools

Agent Workflow is inspired by Aegis and Superpowers. Here's how they differ:

	Agent Workflow	Aegis	Superpowers
Philosophy	Process discipline via Orchestrator separation	Baseline-first, evidence-driven method pack	Composable auto-triggered skills
Main thread	Never touches code	Coordinator with baseline-read phase	Skills auto-trigger
UI/Frontend	Built-in design constraints + UI reviewer with AI Slop Score	Not included	Not included
Setup	Clone + symlink, zero config	Guided prompt + doctor script	Per-host plugin install
Best for	Frontend-heavy projects, Orchestrator discipline	Complex enterprise codebases, risk-adaptive TDD	TDD-first teams, strict process

For a detailed comparison, see docs/comparison.md.

When to use Agent Workflow

You care about frontend quality and want to eliminate AI slop
You want the main thread to stay focused on coordination, not coding
You want a simple setup with minimal configuration
You want explicit status handling (DONE / BLOCKED / NEEDS_CONTEXT) instead of assumptions

When to use something else

Complex enterprise codebase needing baseline reads before every change → Aegis
TDD-first team wanting strict red-green-refactor as non-negotiable discipline → Superpowers

Project Structure

.
├── skills/
│   ├── agent-workflow-init/
│   │   ├── SKILL.md
│   │   ├── references/agent-workflow-guide.md
│   │   ├── assets/templates/
│   │   │   ├── AGENTS.md.template
│   │   │   └── change-request-template.md
│   │   └── scripts/
│   │       ├── init_agent_workflow.py
│   │       └── install_symlinks.sh
│   └── agent-workflow-start/
│       ├── SKILL.md
│       ├── references/
│       │   ├── start-guide.md
│       │   ├── implementer-prompt.md
│       │   ├── frontend-implementer-prompt.md
│       │   ├── spec-reviewer-prompt.md
│       │   ├── code-quality-reviewer-prompt.md
│       │   ├── ui-reviewer-prompt.md
│       │   ├── verification-prompt.md
│       │   └── qa-prompt.md
│       └── scripts/
│           └── start_agent_workflow.py
├── .claude-plugin/plugin.json
├── .codex-plugin/plugin.json
├── LICENSE
└── README.md

Inspired By

Aegis — baseline-first, evidence-driven method pack for AI coding agents
Superpowers — composable agent skills by Jesse Vincent

License

MIT License. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.claude-plugin		.claude-plugin
.codex-plugin		.codex-plugin
docs		docs
skills		skills
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh-CN.md		README_zh-CN.md

Folders and files

Latest commit

History

Repository files navigation

Agent Workflow

Why This Exists

Before / After

Without Agent Workflow

With Agent Workflow

How It Works

Why Subagents?

Workflow Stages

The UI Reviewer: Killing AI Slop

Key Features

Quick Start

AI-Assisted Install (Recommended)

Manual Install

Usage

Initialize a project

Start a feature (heavy workflow)

Fix a bug

Beautify a frontend page

Run spec compliance review only

Run code quality review only

Included Skills

Subagent Prompt Templates

Example Output

Comparison with Similar Tools

When to use Agent Workflow

When to use something else

Project Structure

Inspired By

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages