Spec-First — Spec-Driven AI Development Workflow Engine

Bring structure, traceability, and quality gates to AI-assisted software delivery.

The Problem

AI coding assistants (Claude Code, Codex, Cursor) are powerful — but stateless. Every new session loses the context of previous decisions. Code ships without validation. Requirement changes become untraceable. Team members use AI inconsistently, making reviews unreliable.

Spec-First solves this at the process level, not the prompt level:

Symptom	Root Cause	Spec-First Solution
AI generates code inconsistent with earlier decisions	No persistent semantic context between sessions	`specs/<featureId>/` directory as the single source of truth across all sessions
Unvalidated AI output reaches production	No enforcement layer between generation and commit	Stage-gated state machine — each stage requires explicit gate passage before advancing
"Why was this written this way?" is unanswerable	No artifact linkage from requirements to implementation	14-type traceability ID system covering the full FR → DS → TASK → TC chain
Every developer prompts AI differently	No shared execution protocol	20 Skills with a unified 6-phase execution model (P0–P5)

How It Works

Spec-First wraps your AI workflow in a structured state machine. A feature begins at 00_init and can only advance when each stage's quality gate passes.

[Idea] → 00_init → 01_specify → 02_design → 03_plan → 04_implement → 05_verify → 06_wrap_up → 07_release → 08_done
           ↓            ↓            ↓           ↓            ↓             ↓             ↓             ↓
                                          09_cancelled  ←  (cancellable from any active stage)

At each stage, a Skill guides the AI through a deterministic 6-phase protocol:

P0  LOCATE       — Resolve the active feature and validate the current stage
P1  CONTEXT      — Load the spec directory, prior artifacts, and run history
P2  GENERATE     — AI inference produces a structured draft artifact
P3  CONFIRM      — User reviews, iterates, or rejects (multi-round supported)
P4  WRITE        — Finalized artifact is written and traceability IDs are registered
P5  SIDE EFFECTS — Sync tracking matrix, trigger gate evaluation, update runtime state

This means every AI action is locatable, context-aware, confirmable, and auditable.

Quick Start

Prerequisites

Node.js ≥ 20.0.0
npm or pnpm
Git
Claude Code or Codex (optional — required for Skill integration)

Install

npm install -g spec-first@latest
spec-first doctor          # Verify installation and host integration

Initialize a Feature

cd /path/to/your-project

# --mode N  = New feature  |  --mode I = Incremental improvement
# --size S  = Small        |  --size M = Medium  |  --size L = Large
spec-first init --feat AUTH --mode N --size M --platforms web,node

Run the Full Development Cycle (Claude Code / Codex)

/spec-first:onboarding    # First time? Start here
/spec-first:init          # Initialize feature workspace
/spec-first:spec          # Author requirements spec (FR + acceptance criteria)
/spec-first:design        # Technical design (DS + API contracts)
/spec-first:task          # Break design into tracked tasks
/spec-first:code          # Implement tasks with spec-linked commits
/spec-first:verify        # Run stage verification against acceptance criteria
/spec-first:archive       # Retrospective and close-out

Day-to-Day CLI

spec-first feature current          # Which feature am I on?
spec-first stage current            # Which stage is active?
spec-first gate                     # Are all gate conditions passing?
spec-first metrics report           # Coverage and health score
spec-first golive check <featureId> # Pre-release readiness check

Core Features

Stage State Machine (8 Active + 2 Terminal)

Every feature advances through eight active stages — each with blocking gate conditions — and terminates in one of two terminal states. Terminal states are irreversible by design.

Stage	Deliverable	Entry via
`00_init`	Feature workspace, config	`spec-first init`
`01_specify`	Requirements spec (FR + AC)	`/spec-first:spec`
`02_design`	Technical design (DS + API)	`/spec-first:design`
`03_plan`	Task list with traceability IDs	`/spec-first:task`
`04_implement`	Spec-linked code commits	`/spec-first:code`
`05_verify`	Test cases and coverage evidence	`/spec-first:verify`
`06_wrap_up`	Retrospective document	`/spec-first:archive`
`07_release`	Smoke test report + release note	`spec-first golive check`
`08_done`	(terminal)	`spec-first done`
`09_cancelled`	(terminal)	`spec-first stage cancel`

Quality Gates

Each stage defines blocking conditions that must pass before the stage can advance. Gate evaluation is deterministic and CI-compatible.

spec-first gate                              # Evaluate current stage
spec-first gate --stage 04_implement         # Evaluate a specific stage
spec-first golive check <featureId>          # Full pre-release gate (07_release)
spec-first metrics coverage --threshold 0.8  # Enforce coverage threshold

Full-Lifecycle Traceability

Every artifact is tagged with a typed traceability ID, forming a navigable chain from business requirements to deployed code:

FR · DS · TASK · TC · RFC                  ← primary delivery chain
REQ · SYS · ARCH · MOD                    ← requirements & architecture
ATP · STP · ITP · UTP                     ← test planning
Feature                                    ← feature-level tracking

14 ID types in total. Every ID is registered, searchable, and validated.

spec-first id generate FR        # Generate a new requirement ID
spec-first id verify FR-001      # Confirm ID is registered and linked
spec-first matrix sync           # Rebuild the traceability coverage matrix

20 Built-in Skills

Skills are the AI-facing interface. Each Skill executes the deterministic P0–P5 protocol, ensuring every AI interaction is context-loaded, confirmable, and side-effect-tracked.

Category	Skills
Onboarding	`onboarding`, `first`
Core Stages	`init`, `spec`, `design`, `research`, `task`, `code`, `review`, `archive`, `catchup`
Orchestration	`plan`, `verify`, `orchestrate`, `status`, `sync`, `feature`, `doctor`
Quality	`spec-review`, `analyze`

Host Integration and Automation

spec-first update          # Refresh baseline Skills/MCP for stable hosts (Claude Code + Codex)
spec-first update --host gemini   # Opt in to Gemini baseline (experimental)
spec-first update --host cursor   # Opt in to Cursor baseline (experimental)
spec-first hooks status    # Inspect Git hook integration
spec-first viewer start    # Launch the Stage Viewer dashboard
spec-first commit          # Structured commit with auto-linked traceability ID

Architecture

Spec-First is organized in three layers. The boundary between layers is strict: Skills never access the runtime directly; CLI commands never call Skills.

┌────────────────────────────────────────────────────┐
│  Skill Layer  — Claude Code / Codex integration    │
│  20 Skills · P0–P5 execution protocol              │
├────────────────────────────────────────────────────┤
│  CLI Layer  — spec-first <command>                 │
│  28 deterministic command groups                   │
├────────────────────────────────────────────────────┤
│  Runtime Layer                                     │
│  ┌─────────────────┬──────────────────────────┐   │
│  │ process-engine  │ Stage FSM, lifecycle ctrl │   │
│  │ gate-engine     │ Blocking condition eval   │   │
│  │ trace-engine    │ ID registry, coverage     │   │
│  │ skill-runtime   │ Skill dispatch, prompts   │   │
│  │ ai-orchestrator │ Auto-loop, context pack   │   │
│  │ metrics-engine  │ Health score, bottlenecks │   │
│  └─────────────────┴──────────────────────────┘   │
└────────────────────────────────────────────────────┘

Stage Viewer

Spec-First ships with a built-in visual dashboard. Launch it with spec-first viewer start to inspect feature health, stage progress, gate status, and time distribution at a glance.

Ecosystem Learning

Spec-First was built by studying the best ideas across the AI workflow ecosystem — not to replace these tools, but to synthesize their strongest patterns into one coherent process engine.

Project	Core Philosophy	What Spec-First Adopted
OpenSpec	Actions not Phases — artifact DAG over rigid stage gates	Delta Spec concept for requirement evolution tracking
Spec Kit	Spec-Driven Development — constitution as supreme authority	Specification-as-contract principle; consistency analysis patterns
Planning-Files	Context Engineering — files as persistent working memory	Cross-session context persistence via `specs/<featureId>/`; 5-Question Reboot for `catchup`
Trellis	Read Before Write — spec injection before every dev action	`before-dev` spec loading protocol; `break-loop` retrospective in `archive`
Superpowers	Discipline Over Convenience — TDD as a hard gate	P0–P5 deterministic execution model; verification-before-completion as a gate condition

Contributing

Bug reports and pull requests are welcome. Please open an issue first to discuss significant changes.

For local development:

npm install
npm run build
npm test
npm run lint

Name		Name	Last commit message	Last commit date
Latest commit History 162 Commits
.claude		.claude
.codex-backups/20260315-151157/skills/spec-first		.codex-backups/20260315-151157/skills/spec-first
.full-review		.full-review
.github/workflows		.github/workflows
.serena		.serena
.spec-first		.spec-first
docs		docs
packages/vscode-spec-first		packages/vscode-spec-first
scripts		scripts
skills/spec-first		skills/spec-first
specs		specs
src		src
tasks		tasks
templates		templates
tests		tests
.coverage		.coverage
.gitignore		.gitignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CLAUDE.md.bak		CLAUDE.md.bak
CLAUDE.md.bak2		CLAUDE.md.bak2
CLAUDE.md.bak3		CLAUDE.md.bak3
CLAUDE.md.bak4		CLAUDE.md.bak4
LICENSE		LICENSE
PROJECT-INTRODUCTION.md		PROJECT-INTRODUCTION.md
README-CN.md		README-CN.md
README.md		README.md
coverage.json		coverage.json
eslint.config.js		eslint.config.js
image.png		image.png
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spec-First — Spec-Driven AI Development Workflow Engine

The Problem

How It Works

Quick Start

Prerequisites

Install

Initialize a Feature

Run the Full Development Cycle (Claude Code / Codex)

Day-to-Day CLI

Core Features

Stage State Machine (8 Active + 2 Terminal)

Quality Gates

Full-Lifecycle Traceability

20 Built-in Skills

Host Integration and Automation

Architecture

Stage Viewer

Ecosystem Learning

Contributing

Repository

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spec-First — Spec-Driven AI Development Workflow Engine

The Problem

How It Works

Quick Start

Prerequisites

Install

Initialize a Feature

Run the Full Development Cycle (Claude Code / Codex)

Day-to-Day CLI

Core Features

Stage State Machine (8 Active + 2 Terminal)

Quality Gates

Full-Lifecycle Traceability

20 Built-in Skills

Host Integration and Automation

Architecture

Stage Viewer

Ecosystem Learning

Contributing

Repository

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages