Skip to content

feat: Test Bootstrap + Regression Tests + Coverage Audit (v0.6.0)#136

Merged
garrytan merged 7 commits intomainfrom
garrytan/improve-test-coverage
Mar 17, 2026
Merged

feat: Test Bootstrap + Regression Tests + Coverage Audit (v0.6.0)#136
garrytan merged 7 commits intomainfrom
garrytan/improve-test-coverage

Conversation

@garrytan
Copy link
Owner

Summary

  • 100% test coverage = great vibe coding. gstack now bootstraps test frameworks from scratch, generates regression tests after QA fixes, audits coverage at ship time with quality stars, and tracks test health in retro.
  • New {{TEST_BOOTSTRAP}} resolver: detects runtime, researches frameworks, asks user, installs, writes first real tests, sets up CI/CD, creates TESTING.md + CLAUDE.md test culture
  • Phase 8e.5 regression test generation in /qa and /qa-design-review (CSS-aware variant)
  • Step 3.4 test coverage audit in /ship with ★★★/★★/★ quality scoring and ASCII diagram
  • Test health tracking in /retro (total tests, regression commits, period delta)
  • 2 new P3 TODOs: CI/CD non-GitHub providers, auto-upgrade weak tests

Test Coverage

  • 211 unit tests pass (26 new validation tests + 4 from main merge)
  • 2 new E2E evals: bootstrap flow + coverage audit (~$7.70 to run)
  • All SKILL.md files regenerate cleanly

Pre-Landing Review

No issues found.

TODOS

  • Added: CI/CD generation for non-GitHub providers (P3)
  • Added: Auto-upgrade weak tests ★→★★★ (P3)
  • No items completed in this PR

Test plan

  • bun run gen:skill-docs regenerates all 15 SKILL.md files
  • 211 validation + gen-skill-docs tests pass (0 failures)
  • E2E eval 1: /qa on zero-test project → bootstrap + regression test
  • E2E eval 2: /ship coverage audit → diagram + test generation

🤖 Generated with Claude Code

garrytan and others added 7 commits March 17, 2026 10:41
…alth

- Add {{TEST_BOOTSTRAP}} resolver to gen-skill-docs.ts
- Add Phase 8e.5 regression test generation to /qa and /qa-design-review
- Add Step 3.4 test coverage audit with quality scoring to /ship
- Add test health tracking to /retro
- Add 2 E2E evals (bootstrap + coverage audit)
- Add 26 validation tests
- Update ARCHITECTURE.md placeholder table
- Add 2 P3 TODOs (CI/CD non-GitHub, auto-upgrade weak tests)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…terns

Step 3.4 now instructs Claude to read full files, trace data flow through
every branch, diagram the execution, and check each branch against tests.
Phase 8e.5 regression tests now trace the bug's codepath before writing
the test, catching adjacent edge cases.
Step 3.4 now covers the full picture: code branches AND user-facing behavior.
Maps user flows (complete journey through the feature), interaction edge cases
(double-click, back button, stale state, slow connection), error states
(what does the user actually see?), and boundary states (zero results,
10k results, max-length input). Coverage diagram splits into Code Path
Coverage and User Flow Coverage sections with separate percentages.
…erage

- Raise Step 3.4 test generation cap from 10 to 20 (code + user flow combined)
- Add 3 validation tests: codepath tracing, user flow mapping, diagram sections
Keep v0.6.0 (our version supersedes main's v0.5.4). Both changelog
entries preserved in chronological order.
@garrytan garrytan merged commit a2d756f into main Mar 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant