AI Product Operating System

A simulated, end-to-end product development organization where specialized AI agents collaborate to take an idea from raw hypothesis to deployed, instrumented product — following the same rigor as a real product team.

Who this is for: Product Managers, indie founders, and ICPs who want to ship AI-assisted products faster without skipping the parts that matter — research, architecture review, QA, metrics, and learning.

Built and operated with Claude Code. You need Claude Code to run the slash commands.

Quick Navigation

What you're looking for	Where to find it
Active project status, stage, blockers	`project-state.md`
Product ideas and issue definitions	`experiments/ideas/issue-NNN.md`
Market research and problem exploration	`experiments/exploration/exploration-NNN.md`
PRDs, UX specs, architecture plans	`experiments/plans/plan-NNN.md`
QA, code review, metrics, deploy results	`experiments/results/`
Demo scripts and presentations	`experiments/demos/`
Built app codebases	`apps/[project-name]/`
Pipeline command instructions	`commands/`
Agent role definitions	`agents/`
Engineering and product knowledge base	`knowledge/`
Quality gate rules and stage progression	`system-orchestrator.md`
Command execution framework	`command-protocol.md`

Projects Built

Issue	App	What It Does	Stack	Status
002	—	Gmail → WhatsApp daily digest summarizer	Next.js, Supabase, Gemini, Twilio	Archived
003	finance-advisor	AI personal finance advisor	Next.js, Supabase, Gemini	Complete
004	clarity	PM to-do list with AI task categorization	Next.js, Neon, Gemini	Complete
005	smb-bundler	Feature bundle + value-based pricing engine for B2B SaaS PMs	Next.js, Neon, Gemini	Complete
006	ozi-reorder	Reorder reminder experiment for dark-store baby essentials (50/50 test vs. control, 7 PostHog events)	Next.js, Neon, PostHog	Complete
007	ozi-insights	Synthetic Freshdesk support data for order reliability research (30 tickets, grounded in Play Store)	Data workspace	Explored

Each issue number maps directly across all folders: experiments/ideas/issue-NNN.md, experiments/exploration/exploration-NNN.md, experiments/plans/plan-NNN.md, and experiments/results/*-NNN.md.

The 12-Step Pipeline

The OS enforces a sequential pipeline with quality gates. A stage cannot start until the previous stage passes.

#	Command	Agent	Output
1	`/create-issue`	Research Agent	Structured opportunity brief
2	`/explore`	Research Agent	Market validation, recommendation
3	`/create-plan`	Product + Design + Backend/DB Architects	PRD, UX, architecture, DB schema
4	`/execute-plan`	Frontend + Backend Engineers	Working app codebase
5	`/deslop`	Deslop Agent	Clean, comment-free code
6	`/review`	Code Review Agent	Critical issues list (blocks until fixed)
7	`/peer-review`	Peer Review Agent	Adversarial architecture review
8	`/qa-test`	QA Agent	Reliability and edge-case test results
9	`/metric-plan`	Analytics Agent	North Star, funnels, ground-truth queries
10	`/deploy-check`	Deploy Agent	Production readiness sign-off
11	`/postmortem`	Learning Agent	Root cause analysis of pipeline failures
12	`/learning`	Learning Agent	Engineering rules extracted → knowledge base updated

Utility commands (run anytime):

/docs — Generate CODEBASE-CONTEXT.md for the active app
/explain — Deep-dive on a concept, pattern, or error

Knowledge Base

The system gets smarter with every cycle. After each /learning run, insights from postmortems are extracted into durable rules:

knowledge/engineering-lessons.md — Technical rules (e.g., fan-out cron, pagination bounds, telemetry resilience)
knowledge/product-lessons.md — Product patterns and anti-patterns
knowledge/prompt-library.md — Refined agent prompts extracted from what worked
knowledge/coding-standards.md — TypeScript, Next.js, Supabase/Neon standards
knowledge/architecture-guide.md — Default system architecture patterns
knowledge/analytics-framework.md — PostHog event schema and funnel design

Every agent reads the knowledge base before executing — preventing the same class of mistake from appearing twice.

Getting Started (Forking This Repo)

Check the current state — read project-state.md to see what stage the system is at and which issue is active
Pick an idea — browse experiments/ideas/ for context on past issues, or create a new one with /create-issue
Run commands sequentially — pass the command file from commands/ to Claude Code (e.g., paste commands/create-issue.md content and follow it)
Read the knowledge base first — every command in the pipeline reads all files in knowledge/ before generating output to avoid repeating past mistakes
Track gates, not just progress — check project-state.md after each command; blocked = do not proceed

Default tech stack (used across all apps):

Frontend: Next.js 16+ (App Router), TypeScript strict, Tailwind CSS 4+
Backend: Next.js API Routes, Neon DB (@neondatabase/serverless) or Supabase
AI: Google Gemini 2.5 Flash/Pro via @google/genai with structured outputs
Analytics: PostHog (posthog-js + posthog-node)
Hosting: Vercel

Environment setup per app:

cd apps/[project-name]
cp .env.local.example .env.local   # fill in your keys
npm install
npm run dev

Each app includes a schema.sql (idempotent) that must be applied in your database editor before first run.

The Human PM Role

Agents execute but do not replace judgment. The human PM is responsible for:

Deciding which ideas to pursue
Evaluating agent outputs at each stage
Overriding blocked quality gates when the tradeoff is justified
Making final product and architectural decisions
Approving releases

Repository Structure

/agents                    # Agent role definitions (one file per role)
/commands                  # Pipeline command instructions (one file per command)
/knowledge                 # Shared intelligence: standards, lessons, prompts
/experiments
  /ideas                   # Issue briefs (issue-NNN.md)
  /exploration             # Market validation outputs (exploration-NNN.md)
  /plans                   # PRDs, UX, architecture, DB schema (plan-NNN.md)
  /results                 # QA, reviews, metrics, deploy artifacts (*-NNN.md)
  /demos                   # Demo scripts and presentations
/apps                      # Built codebases (one folder per project)
  /[project-name]
    src/app/               # Next.js App Router pages and API routes
    src/components/        # UI components
    src/lib/               # Utilities, DB clients, AI helpers
    schema.sql             # Idempotent DB schema
    CODEBASE-CONTEXT.md    # Auto-generated docs (via /docs command)
    README.md              # Setup and run instructions
    .env.local.example     # Required env vars (no secrets)
project-state.md           # Live runtime memory — always check this first
system-orchestrator.md     # Quality gate rules and stage progression
command-protocol.md        # How commands load context and update state

Build faster. Learn systematically. Fail safely.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Product Operating System

Quick Navigation

Projects Built

The 12-Step Pipeline

Knowledge Base

Getting Started (Forking This Repo)

The Human PM Role

Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
agents		agents
apps		apps
commands		commands
experiments		experiments
knowledge		knowledge
postmortems		postmortems
src		src
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
command-protocol.md		command-protocol.md
project-state.md		project-state.md
system-orchestrator.md		system-orchestrator.md

Folders and files

Latest commit

History

Repository files navigation

AI Product Operating System

Quick Navigation

Projects Built

The 12-Step Pipeline

Knowledge Base

Getting Started (Forking This Repo)

The Human PM Role

Repository Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages