Skip to content

shadowdevcode/ai-product-os

Repository files navigation

AI Product Operating System

A simulated, end-to-end product development organization where specialized AI agents collaborate to take an idea from raw hypothesis to deployed, instrumented product — following the same rigor as a real product team.

Who this is for: Product Managers, indie founders, and ICPs who want to ship AI-assisted products faster without skipping the parts that matter — research, architecture review, QA, metrics, and learning.

Built and operated with Claude Code. You need Claude Code to run the slash commands.


Quick Navigation

What you're looking for Where to find it
Active project status, stage, blockers project-state.md
Product ideas and issue definitions experiments/ideas/issue-NNN.md
Market research and problem exploration experiments/exploration/exploration-NNN.md
PRDs, UX specs, architecture plans experiments/plans/plan-NNN.md
QA, code review, metrics, deploy results experiments/results/
Demo scripts and presentations experiments/demos/
Built app codebases apps/[project-name]/
Pipeline command instructions commands/
Agent role definitions agents/
Engineering and product knowledge base knowledge/
Quality gate rules and stage progression system-orchestrator.md
Command execution framework command-protocol.md

Projects Built

Issue App What It Does Stack Status
002 Gmail → WhatsApp daily digest summarizer Next.js, Supabase, Gemini, Twilio Archived
003 finance-advisor AI personal finance advisor Next.js, Supabase, Gemini Complete
004 clarity PM to-do list with AI task categorization Next.js, Neon, Gemini Complete
005 smb-bundler Feature bundle + value-based pricing engine for B2B SaaS PMs Next.js, Neon, Gemini Complete
006 ozi-reorder Reorder reminder experiment for dark-store baby essentials (50/50 test vs. control, 7 PostHog events) Next.js, Neon, PostHog Complete
007 ozi-insights Synthetic Freshdesk support data for order reliability research (30 tickets, grounded in Play Store) Data workspace Explored

Each issue number maps directly across all folders: experiments/ideas/issue-NNN.md, experiments/exploration/exploration-NNN.md, experiments/plans/plan-NNN.md, and experiments/results/*-NNN.md.


The 12-Step Pipeline

The OS enforces a sequential pipeline with quality gates. A stage cannot start until the previous stage passes.

# Command Agent Output
1 /create-issue Research Agent Structured opportunity brief
2 /explore Research Agent Market validation, recommendation
3 /create-plan Product + Design + Backend/DB Architects PRD, UX, architecture, DB schema
4 /execute-plan Frontend + Backend Engineers Working app codebase
5 /deslop Deslop Agent Clean, comment-free code
6 /review Code Review Agent Critical issues list (blocks until fixed)
7 /peer-review Peer Review Agent Adversarial architecture review
8 /qa-test QA Agent Reliability and edge-case test results
9 /metric-plan Analytics Agent North Star, funnels, ground-truth queries
10 /deploy-check Deploy Agent Production readiness sign-off
11 /postmortem Learning Agent Root cause analysis of pipeline failures
12 /learning Learning Agent Engineering rules extracted → knowledge base updated

Utility commands (run anytime):

  • /docs — Generate CODEBASE-CONTEXT.md for the active app
  • /explain — Deep-dive on a concept, pattern, or error

Knowledge Base

The system gets smarter with every cycle. After each /learning run, insights from postmortems are extracted into durable rules:

Every agent reads the knowledge base before executing — preventing the same class of mistake from appearing twice.


Getting Started (Forking This Repo)

  1. Check the current state — read project-state.md to see what stage the system is at and which issue is active
  2. Pick an idea — browse experiments/ideas/ for context on past issues, or create a new one with /create-issue
  3. Run commands sequentially — pass the command file from commands/ to Claude Code (e.g., paste commands/create-issue.md content and follow it)
  4. Read the knowledge base first — every command in the pipeline reads all files in knowledge/ before generating output to avoid repeating past mistakes
  5. Track gates, not just progress — check project-state.md after each command; blocked = do not proceed

Default tech stack (used across all apps):

  • Frontend: Next.js 16+ (App Router), TypeScript strict, Tailwind CSS 4+
  • Backend: Next.js API Routes, Neon DB (@neondatabase/serverless) or Supabase
  • AI: Google Gemini 2.5 Flash/Pro via @google/genai with structured outputs
  • Analytics: PostHog (posthog-js + posthog-node)
  • Hosting: Vercel

Environment setup per app:

cd apps/[project-name]
cp .env.local.example .env.local   # fill in your keys
npm install
npm run dev

Each app includes a schema.sql (idempotent) that must be applied in your database editor before first run.


The Human PM Role

Agents execute but do not replace judgment. The human PM is responsible for:

  • Deciding which ideas to pursue
  • Evaluating agent outputs at each stage
  • Overriding blocked quality gates when the tradeoff is justified
  • Making final product and architectural decisions
  • Approving releases

Repository Structure

/agents                    # Agent role definitions (one file per role)
/commands                  # Pipeline command instructions (one file per command)
/knowledge                 # Shared intelligence: standards, lessons, prompts
/experiments
  /ideas                   # Issue briefs (issue-NNN.md)
  /exploration             # Market validation outputs (exploration-NNN.md)
  /plans                   # PRDs, UX, architecture, DB schema (plan-NNN.md)
  /results                 # QA, reviews, metrics, deploy artifacts (*-NNN.md)
  /demos                   # Demo scripts and presentations
/apps                      # Built codebases (one folder per project)
  /[project-name]
    src/app/               # Next.js App Router pages and API routes
    src/components/        # UI components
    src/lib/               # Utilities, DB clients, AI helpers
    schema.sql             # Idempotent DB schema
    CODEBASE-CONTEXT.md    # Auto-generated docs (via /docs command)
    README.md              # Setup and run instructions
    .env.local.example     # Required env vars (no secrets)
project-state.md           # Live runtime memory — always check this first
system-orchestrator.md     # Quality gate rules and stage progression
command-protocol.md        # How commands load context and update state

Build faster. Learn systematically. Fail safely.

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors