reactlens

The first E2E testing tool that understands your React code, not just your DOM.

Every existing E2E tool — Playwright, Cypress, QA Wolf — treats your app as a black box. They see HTML, accessibility trees, pixels. They never see what your code actually does: a tree of <Component>s with props, state, hooks, and a known relationship to your source files.

reactlens runs alongside Playwright and captures the React component tree, the accessibility tree, and axe-core findings at every test — and persists the whole run timeline so you can replay it offline. That changes what you can do:

Generate tests that exercise every render branch — loading, error, empty, success — by reading your source via AST, not guessing from one DOM snapshot. Each component gets a <Component>.contract.md next to its spec, documenting the visual-state set reactlens enumerated.
Diagnose failures by reading the actual prop that was wrong, the hook that returned bad data, the recent commit that touched the relevant file. Each diagnosis is classified real-bug | test-bug | flaky | env-issue with calibrated confidence (16/16 on the in-tree eval set in the with-snapshot condition; the without-snapshot baseline that would prove the moat is the headline metric is pending per ADR-0001, and the eval set is being grown via dogfooding and corpus harvest per ADR-0004).
Inspect any test step in a live dashboard — DOM preview alongside the React tree at that moment, with all props and hook values visible. The active step highlights the exact fiber Playwright is interacting with, resolved via the probe-built testid → fiber map.
Replay past runs offline. Every run writes its events + per-step frames to .reactlens/runs/<id>/; the dashboard picks them up from a dropdown and scrubs through with a timeline slider — no re-run needed.
Semantic diff (reactlens diff <runA> <runB>): structural changes in the component tree, not pixel-level noise — catches regressions pixel diffing misses. Accessibility-tree diff is bundled for free but is currently classified gray-zone pending the ablation in ADR-0001.

Built-in conveniences (not differentiators)

These ship with reactlens because you're already using Playwright. They are not the moat — any team can wire equivalents in an afternoon. See ADR-0002.

Watch mode (--watch): editing a file under src/ or e2e/ triggers an automatic re-run, debounced so two rapid saves yield one run.
axe-core a11y: runs against every test; violations land in the run timeline so the dashboard (and your CI) can surface them.

Status

reactlens is at v0.2 — 7 of 9 v0.2 phases shipped (P8 → P13). The component-tree capture works end-to-end on four fixture stacks: Vite + React Router (React 18 and 19), Next.js 14 App Router, and TanStack Router. Time-travel replay, watch mode, behavior contracts, semantic diff (component + a11y), and per-test axe-core are all live. AI-driven generation and diagnosis are wired to the Anthropic API (or your local claude CLI via --use-claude-code) and ready when you provide a key or sign-in. 137/137 unit + 8/8 integration green; 16/16 diagnostic-eval cases pass in the with-snapshot condition — the without-snapshot baseline that proves the moat actually contributes to that number is the next mandatory measurement (see ADR-0001). v0.3 priorities are committed: Component-Object Pattern (ADR-0006) and apply-fix loop closure (ADR-0007).

Remaining v0.2 work: P14 (CI Auto-PR mode, deferred), P15/P16 stretch (multi-user flows, opt-in telemetry).

Install

pnpm add -D reactlens
npx reactlens init           # one-time setup: copies templates, installs Playwright + MSW
npx reactlens generate       # AI writes tests, reading your component tree + DOM
npx reactlens run            # opens the dashboard, runs tests, diagnoses failures

reactlens supports React 18 and React 19 projects on Vite + React Router, Next.js 14 App Router, and TanStack Router — full templates + integration coverage on all four stacks.

Commands

Command	What it does
`reactlens init`	Detects your stack, copies `playwright.config.ts`, `reactlens/fixtures.ts`, `reactlens/global-setup.ts`, `reactlens/streaming-reporter.ts`, `reactlens.config.ts` into your project. Installs `@playwright/test` and `msw`.
`reactlens generate`	For each component matching `componentGlobs`, AST-analyzes its visual states, then asks Claude to write a Page Object + spec covering all of them. Writes `<ComponentName>.contract.md` alongside each spec. Runs `tsc --noEmit` on output. Requires `ANTHROPIC_API_KEY`.
`reactlens run`	Starts the dashboard at `localhost:7777`, runs Playwright with the streaming reporter, captures component snapshots + CDP screencast frames + accessibility tree + axe-core violations, persists everything to `.reactlens/runs/<id>/`, and (when `ANTHROPIC_API_KEY` is set) diagnoses each failure with classification + confidence + suggested patch. Add `--watch` to re-run on file changes.
`reactlens diff <runIdA> <runIdB>`	Semantic diff between two persisted runs — component-tree changes (props, additions, removals) plus accessibility-tree changes (role/name/attr) per test. Exit code mirrors `git diff` (0 identical, 1 differences). `--json` for tooling.
`reactlens regen`	Hashes each component source, regenerates only those whose hash changed. Cache lives at `.reactlens/cache.json`.
`reactlens analyze <report>`	Reads a Playwright JSON report and writes a Markdown diagnostic summary for failed tests. Useful for CI artifacts.

Useful flags

reactlens run --watch — after the initial run, re-run on changes under <cwd>/src and <cwd>/e2e. Debounced; Ctrl-C to exit.
reactlens run --ci — no dashboard, no auto-open, writes diagnoses to reactlens-diagnoses.json.
reactlens run --no-analyze — skip Claude diagnoses (and avoid API costs).
reactlens run --no-open — keep the dashboard server alive but don't open a browser.
reactlens run --reporter json — emit raw JSONL events on stdout (for piping).
reactlens run --max-cost <usd> — hard cap on aggregate diagnosis spend; aborts at the next message boundary once exceeded. Same flag on generate/analyze/regen.
reactlens generate --pages 'src/pages/Login.tsx' — limit generation to one component.
reactlens init --dry-run — list what init would do without writing.
reactlens generate --use-claude-code (and the same flag on run/regen/analyze) — part of the sovereignty story (ADR-0003): route through your local claude CLI binary instead of the API so reactlens uses your existing Claude.ai/Max session rather than double-billing you on per-token API charges. Local development only — Anthropic's TOS prohibits this for distributed tools. See docs/troubleshooting.md.

How the moat works

A small probe is bundled as an IIFE (dist/probe/probe.global.js, ~19 KB) and injected into every page during a test. It uses bippy to subscribe to React fiber commits, walks the tree on each render, sanitizes props/hooks, attributes every data-testid to its owning component fiber, and ships component:snapshot events over WebSocket to the dashboard server.

The Playwright fixture opens a CDP session at end-of-test to walk the accessibility tree (via Accessibility.getRootAXNode + recursive getChildAXNodes) and runs axe-core against the rendered DOM. Both feed the same event bus, so every persisted run carries: per-step component trees + DOM frames, an end-of-test a11y tree, and one event per axe violation. Time-travel replay reads it back from .reactlens/runs/<id>/events.jsonl without re-running Playwright.

Failure diagnosis pre-loads the latest snapshot for the failing test and hands it to the agent alongside the spec, the component source, and recent git history. The agent's classification is grounded in evidence visible in the snapshot — that's how it can tell cvv: '12' violates min(3) rather than guessing from the rendered DOM.

For full architecture, see CLAUDE.md. For phase-by-phase progress and architecture diagrams, see the project portal.

Visual overview

For a navigable bird's-eye view of the project — philosophy, architecture diagrams, current state across all phases (flowchart / kanban / timeline), animated dashboard mockup, usage scenarios — open the project portal in your browser:

open docs/portal/index.html        # macOS
xdg-open docs/portal/index.html    # Linux

The portal is a static HTML doc with diagrams via Mermaid. No build step.

Configuration

Create reactlens.config.ts (auto-generated by init):

import { defineConfig } from 'reactlens/config';

export default defineConfig({
  componentGlobs: ['src/pages/**/*.tsx', 'src/components/**/*.tsx'],
  output: { pages: 'e2e/pages', specs: 'e2e/specs' },
  msw: { handlers: 'src/mocks/handlers.ts' },
  dashboard: { port: 7777, open: true },
});

Goals and non-goals

Goals

React-only, deeply. The component-tree integration is the substrate of the moat.
The moat is defined by serving diagnosis: every capability is evaluated by whether it makes diagnosis measurably better under the ablation methodology of ADR-0001. See ADR-0008.
Sovereignty-first. No SaaS, no telemetry, no cloud sync (hard invariants). Offline operation is preserved for non-LLM commands; LLM-backed commands acknowledge they require network. See ADR-0003.
Two questions answered for every failure: is this a test bug or a real bug? Where exactly?
POM is the default test pattern; Component-Object Pattern is opt-in for teams committed to reactlens. See ADR-0006.
Zero-config for common React stacks.

Non-goals

We do NOT support Vue, Svelte, or Angular.
We do NOT support Cypress, WebdriverIO, or Selenium. Playwright only.
We do NOT host anything in the cloud.
We do NOT auto-apply fixes without explicit user confirmation.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.claude		.claude
.github		.github
.impeccable		.impeccable
bin		bin
docs		docs
scripts		scripts
src		src
templates		templates
tests		tests
.gitignore		.gitignore
.npmrc		.npmrc
.nvmrc		.nvmrc
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTEXT.md		CONTEXT.md
DESIGN.md		DESIGN.md
EXECUTION_PLAN.md		EXECUTION_PLAN.md
PRODUCT.md		PRODUCT.md
README.md		README.md
harvest-corpus.json		harvest-corpus.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
tsup.config.ts		tsup.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reactlens

Built-in conveniences (not differentiators)

Status

Install

Commands

Useful flags

How the moat works

Visual overview

Configuration

Goals and non-goals

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

reactlens

Built-in conveniences (not differentiators)

Status

Install

Commands

Useful flags

How the moat works

Visual overview

Configuration

Goals and non-goals

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages