-
Notifications
You must be signed in to change notification settings - Fork 1
Pipeline Plan 20
Seth Ford edited this page Feb 13, 2026
·
4 revisions
Here's the implementation plan:
| File | Action | Purpose |
|---|---|---|
scripts/sw-integration-test.sh |
Create | Main integration test script (smoke + live tests) |
templates/pipelines/integration.json |
Create | Minimal pipeline template (intake → build → test, 3 iterations) |
.github/workflows/integration-test.yml |
Create | CI workflow: smoke on every PR, live tests with budget cap |
package.json |
Modify | Add test:integration and test:integration:live scripts |
.github/workflows/test.yml |
Modify | Add integration smoke tests as parallel job |
Minimal 3-stage template: intake → build → test. Model: sonnet. Max 3 iterations. No audit/quality gates/PR stage.
Uses the existing test harness pattern (PASS/FAIL counters, ERR trap, colored output). Mock binaries but runs real pipeline orchestration:
-
Dry-run smoke — verify
--dry-runexits 0, outputs "Dry run", creates.claude/dirs - Stage ordering — mock pipeline runs intake → build → test in order, state file updated
- State file format — validate required fields, stage progress, timestamps
- Budget enforcement — $0 budget causes clean exit, not crash
- README change — real Claude modifies a README in a temp repo; verify git diff shows changes, clean status
- PR creation — full fast pipeline creates a real PR; verify PR exists and branch is clean
- Budget cap — live run stays under $1.00; cost tracking confirms
-
smokejob: every PR, no secrets needed, runs smoke tests -
livejob: PRs withANTHROPIC_API_KEY, budget=$1.00, 15min timeout -
regressionjob: push to main, re-runs live tests post-merge - All jobs write per-stage results to
$GITHUB_STEP_SUMMARY
- Create
templates/pipelines/integration.json - Create
scripts/sw-integration-test.shwith Tier 1 smoke tests - Add Tier 2 live integration tests (gated behind env vars)
- Create
.github/workflows/integration-test.yml - Update
package.jsonwith new scripts - Update
.github/workflows/test.ymlwith smoke job - Add CI summary reporting (
$GITHUB_STEP_SUMMARY) - Run smoke tests locally — all pass
- Verify
npm run test:integrationworks - Update CLAUDE.md documentation
-
npm run test:integrationexits 0 (smoke tests, no API key needed) -
npm run test:integration:liveruns both tiers when API key present - CI runs smoke on every PR, live when secrets available, regression on merge
- Budget hard-capped at $1.00 per live run
- Per-stage pass/fail in CI summary
- Existing
npm teststill passes