e2e-skills

A Claude Code plugin with four complementary E2E testing skills designed to work together:

playwright-test-generator — generates Playwright E2E tests from scratch, from coverage gap analysis to passing, reviewed tests
e2e-reviewer — finds issues in your tests and suggests fixes (Playwright, Cypress, Puppeteer)
playwright-debugger — diagnoses failures from playwright-report/ after you apply fixes and re-run
cypress-debugger — diagnoses failures from Cypress report files after you apply fixes and re-run

The typical workflow:

Run playwright-test-generator → generate tests with user approval → auto-reviewed by e2e-reviewer
If generated tests fail → playwright-debugger is invoked automatically after 3 fix attempts
For existing tests: run e2e-reviewer → fix issues → re-run tests
If tests fail → run playwright-debugger or cypress-debugger → fix → re-run tests

AI-generated E2E tests tend toward the statistically likely result — visibility checks that always pass, loose assertions that accept anything, and convenience methods that nobody calls. These skills catch what CI misses: tests that pass but prove nothing, and failures that are hard to trace.

Installation

# npx skills (recommended)
npx skills install dididy/e2e-skills

# Claude Code plugin marketplace
/plugin marketplace add dididy/e2e-skills
/plugin install e2e-skills@dididy

# Clone directly
mkdir -p ~/.claude/skills
git clone https://github.com/dididy/e2e-skills.git ~/.claude/skills/e2e-skills

Skill 1: `playwright-test-generator` — Test Generation

Generates Playwright E2E tests from scratch for any project. Starts from coverage gap analysis, explores the live app via Playwright CLI, designs scenarios with your approval, and auto-reviews generated tests with e2e-reviewer.

When to Use

You have a page or feature with no E2E coverage
You want to bootstrap a test suite for an existing app
You need to quickly add tests before a release

Usage

Generate playwright tests
Generate playwright tests for the login page
Write e2e tests for the settings page
Add playwright coverage for checkout flow

Pipeline

Step 1: Detect environment (config, baseURL, test dir, POM structure)
Step 2: Coverage gap analysis → user picks target
Step 3: Live browser exploration via Playwright CLI
Step 4: Scenario design → Plan Mode → user approves
Step 5: Code generation (POM + spec or flat spec, auto-detected)
Step 6: YAGNI audit + e2e-reviewer quality gate
Step 7: TS compile + test run → playwright-debugger on failure

Key Behaviors

Structure-aware: detects POM pattern and matches project conventions
No hallucinated selectors: explores real DOM before writing any code
Approval gate: shows scenario plan and locator table before generating code
Quality loop: YAGNI audit removes unused locators; e2e-reviewer catches P0 issues before you ever run the tests
Self-healing: 3 auto-fix attempts on failure, then hands off to playwright-debugger

Skill 2: `e2e-reviewer` — Quality Review

Catches issues in E2E tests that pass CI but fail to catch real regressions.

When to Use

Your tests always pass but you suspect they don't catch real bugs
You want to audit test quality before a release
You're reviewing Playwright, Cypress, or Puppeteer specs
You need to justify test coverage in a code review

Usage

Review these E2E tests for quality
Audit the spec files in tests/
Are there any always-passing tests?
My tests pass CI but I think they miss regressions

10 Patterns Detected

Tier 1 — P0/P1 (always check)

#	Pattern	Before	After
1	Name-assertion mismatch	Name says "status" but only checks `toBeVisible()`; name implies UI toggle but test uses localStorage	Add assertion for status content, or rename to match actual mechanism
2	Missing Then	Cancel action, verify text restored — input still visible?	Verify both `text.toBeVisible()` and `input.toBeHidden()`
3	Error swallowing	`try/catch` in spec, `.catch(() => {})` in POM	Let errors fail; remove silent catch from POM methods
4	Always-passing assertion	`expect(count).toBeGreaterThanOrEqual(0)`, `toBeAttached()` with no comment, `expect(await el.isVisible()).toBe(true)`, `expect(await el.isDisabled()).toBe(true)` (no retry), `toBeDefined()` on nullable / `not.toBeNull()` on potentially-empty string	`expect(count).toBeGreaterThan(0)`; `toBeVisible()` / `toBeDisabled()` (web-first); `not.toBeNull()` when null is the only invalid case; `toBeTruthy()` when empty string is also invalid
5	Bypass patterns	`if (visible) { expect(...) }`; `page.click(sel, { force: true })` without comment	Always assert; move env checks to `beforeEach`; add `// JUSTIFIED:` to force:true
6	Raw DOM queries	`document.querySelector` in `evaluate()`	Use framework element API (`locator` / `cy.get` / `page.$`)
7	Focused test leak	`test.only(...)` or `it.only(...)` committed — CI runs one test, silently skips the rest	Delete `.only`; use `--grep` or `--spec` CLI flags for local focus

Tier 2 — P1/P2 (check when time permits)

#	Pattern	Before	After
8	Flaky test patterns	`items.nth(2)` without comment; `test.describe.serial()`	Use `data-testid` or attribute selectors; replace serial suites with self-contained tests
9	Hard-coded sleep	`waitForTimeout(2000)` / `cy.wait(2000)`	Rely on framework auto-wait; use condition-based waits
10	YAGNI + Zombie Specs	`clickEdit()` never called; empty wrapper class; single-use Util; entire spec file duplicated by another	Delete unused members; inline single-use Util methods; delete zombie spec files

References

Review Workflow

Three-phase review with P0/P1/P2 severity:

Phase 1: Automated grep — mechanically detects error swallowing, always-passing (including toBeAttached() and isVisible() boolean trap — scans all .ts files, not just specs), test.only / it.only leak, conditional bypass in specs (POM methods reviewed manually in Phase 2), raw DOM, explicit sleeps, force: true usage, and describe.serial ordering
Phase 2: LLM analysis — semantic checks for naming, missing assertions, flaky patterns, YAGNI + zombie specs
Phase 3: Coverage gaps — suggests missing error paths, edge cases, accessibility, and auth boundary tests

Skill 3: `playwright-debugger` — Playwright Failure Debugger

Diagnoses Playwright test failures from a playwright-report/ directory — whether failures happened locally or in CI. Classifies root causes and provides concrete fixes.

When to Use

You have a playwright-report/ directory (local or downloaded from CI) with failures to understand
Tests pass locally but fail in CI
You're dealing with flaky or intermittent test failures
You get TimeoutError or locator not found without a clear cause

Usage

Debug these failing tests
Why did these tests fail?
Tests pass locally but fail in CI

Note: Provide the report as a local path. Download CI artifacts manually from GitHub Actions and pass the directory path — automatic artifact fetching is not supported.

14 Root Cause Categories

#	Category	Signals
F1	Flaky / Timing	`TimeoutError`, passes on retry
F2	Selector Broken	`locator not found`, strict mode violation
F3	Network Dependency	`net::ERR_*`, unexpected API response
F4	Assertion Mismatch	`Expected X to equal Y`, subject-inversion
F5	Missing Then	Action completed but wrong state remains
F6	Condition Branch Missing	Element conditionally present, assertion always runs
F7	Test Isolation Failure	Passes alone, fails in suite
F8	Environment Mismatch	CI vs local only; viewport, OS, timezone
F9	Data Dependency	Missing seed data, hardcoded IDs
F10	Auth / Session	Session expired, role-based UI not rendered
F11	Async Order Assumption	`Promise.all` order, parallel race
F12	POM / Locator Drift	DOM structure changed, POM not updated
F13	Error Swallowing	`.catch(() => {})` hiding actual failure
F14	Animation Race	Element visible but content not yet rendered

Debug Workflow

Extract — parse results.json for failed tests, error messages, duration
Classify — map each failure to F1–F14 using error signals (most failures resolved here)
Trace — if still unclear, extract trace.zip and inspect step-by-step: failed actions, DOM snapshots, network errors, JS console errors
Fix — concrete code suggestion per failure, P0/P1/P2 priority

Skill 4: `cypress-debugger` — Cypress Failure Debugger

Diagnoses Cypress test failures from mochawesome or JUnit report files. Classifies root causes and provides concrete fixes.

When to Use

You have a cypress/reports/ directory (local or downloaded from CI) with failures to understand
Cypress tests pass locally but fail in CI
You're dealing with flaky or intermittent Cypress failures
You get Timed out retrying or Expected to find element without a clear cause

Usage

Debug these failing Cypress tests
Why did these Cypress tests fail?
Analyze cypress/reports/
Cypress tests pass locally but fail in CI

14 Root Cause Categories

#	Category	Signals
F1	Flaky / Timing	`Timed out retrying`, passes on retry
F2	Selector Broken	`Expected to find element`, `cy.get() failed`
F3	Network Dependency	`cy.intercept()` not matched, `XHR failed`
F4	Assertion Mismatch	`expected X to equal Y`, `AssertionError`
F5	Missing Then	Action completed but wrong state remains
F6	Condition Branch Missing	Element conditionally present, assertion always runs
F7	Test Isolation Failure	Passes alone, fails in suite
F8	Environment Mismatch	CI vs local only; baseUrl, viewport, OS
F9	Data Dependency	Missing seed data, `cy.fixture()` mismatch
F10	Auth / Session	`cy.session()` expired, role-based UI not rendered
F11	Async Order Assumption	`.then()` chain order, parallel `cy.request()` race
F12	Selector Drift	DOM changed, custom command or POM selector not updated
F13	Error Swallowing	`cy.on('uncaught:exception', () => false)` hiding failures
F14	Animation Race	Element visible but content not yet rendered

Debug Workflow

Extract — parse mochawesome.json or JUnit XML for failed tests, error messages, duration
Classify — map each failure to F1–F14 using error signals (most failures resolved here)
Screenshot/Video — if still unclear, inspect cypress/screenshots/ and cypress/videos/
Fix — concrete code suggestion per failure, P0/P1/P2 priority

Compatibility

playwright-test-generator — Playwright only. Generates tests for any project with a playwright.config.ts. Requires Playwright CLI or agent-browser tools for live exploration.

e2e-reviewer — Framework-agnostic. Covers Playwright, Cypress, and Puppeteer.

playwright-debugger — Playwright only. Parses results.json and trace.zip from playwright-report/.

cypress-debugger — Cypress only. Parses mochawesome.json or JUnit XML from cypress/reports/.

License

Apache-2.0 — same as anthropics/skills.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.claude-plugin		.claude-plugin
skills		skills
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

e2e-skills

Installation

Skill 1: `playwright-test-generator` — Test Generation

When to Use

Usage

Pipeline

Key Behaviors

Skill 2: `e2e-reviewer` — Quality Review

When to Use

Usage

10 Patterns Detected

Tier 1 — P0/P1 (always check)

Tier 2 — P1/P2 (check when time permits)

References

Review Workflow

Skill 3: `playwright-debugger` — Playwright Failure Debugger

When to Use

Usage

14 Root Cause Categories

Debug Workflow

Skill 4: `cypress-debugger` — Cypress Failure Debugger

When to Use

Usage

14 Root Cause Categories

Debug Workflow

Compatibility

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

e2e-skills

Installation

Skill 1: playwright-test-generator — Test Generation

When to Use

Usage

Pipeline

Key Behaviors

Skill 2: e2e-reviewer — Quality Review

When to Use

Usage

10 Patterns Detected

Tier 1 — P0/P1 (always check)

Tier 2 — P1/P2 (check when time permits)

References

Review Workflow

Skill 3: playwright-debugger — Playwright Failure Debugger

When to Use

Usage

14 Root Cause Categories

Debug Workflow

Skill 4: cypress-debugger — Cypress Failure Debugger

When to Use

Usage

14 Root Cause Categories

Debug Workflow

Compatibility

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Skill 1: `playwright-test-generator` — Test Generation

Skill 2: `e2e-reviewer` — Quality Review

Skill 3: `playwright-debugger` — Playwright Failure Debugger

Skill 4: `cypress-debugger` — Cypress Failure Debugger

Packages