CI Gates

CI gates reference

Every gate that runs on push to main and on every pull_request:, with the invariant each one proves and the script behind it. Three per-OS workflows run in parallel. Run the deterministic subset locally in one command before every commit:

bash scripts/check-all.sh

⚡ Quick Reference

Workflow	Runs on	Jobs
`[T] Linux Tests`	`ubuntu-latest`	install-smoke + adapter-parity + validate + check-references + check-wiki + unit tests + verify-v4 + syntax + dogfood-workflows
`[T] Mac Tests`	`macos-latest`	install-smoke + validate + unit tests + verify-v4 + syntax (both shells)
`[T] Windows Tests`	`windows-latest`	install-smoke (pwsh) + validate + pwsh syntax

What each gate proves

Gate	Invariant	Script
install-smoke	Fresh install succeeds; re-run is idempotent; `--update` refreshes managed files but preserves user edits to `wiki/` and `AGENTS.md`; test infra never propagates to scratch.	`scripts/smoke-install-bash.sh`, `scripts/smoke-install-pwsh.ps1`
post-install integrity	Hook-command paths resolve; every `.sh`/`.ps1` parses; bash installer produces bash commands, pwsh installer produces pwsh commands; `settings.json` has the expected schema; `.harness` state files are valid.	`scripts/check-integrity-bash.sh`, `scripts/check-integrity-pwsh.ps1`
adapter-parity	Every adapter ships the canonical set of phase-commands, sub-agents, and skills.	`scripts/check-parity.sh`
validate	Every TOML, YAML frontmatter, and JSON across `adapters/` and `templates/` parses and has required keys.	`scripts/validate-adapters.py`
check-references	Every `harness/<phases\|agents\|skills\|pipelines>/*.md` mentioned in an adapter file exists; phase-spec "dispatch the `<name>` sub-agent / invoke the `<name>` skill" lines point at a canonical spec; `settings-fragment-{bash,pwsh}.json` have matching schemas.	`scripts/check-references.py`
check-wiki	Diátaxis structural rules (a–k): mode purity, ADR append-only + `Status: accepted\|superseded\|rejected`, orphan-link detection, globally-unique filenames, no banned-headings-per-mode. Runs `--strict` (blocks PRs) when `wiki/.diataxis` is present; warn-only otherwise. Shipped in v0.9.0 as part of the Diátaxis rollout (ADR 0004).	`scripts/check-wiki.py`
syntax	`bash -n` on every `.sh`; PowerShell AST parse on every `.ps1` across repo root + `scripts/` + `templates/` + `adapters/`.	`scripts/check-syntax.sh`, `scripts/check-syntax.ps1`
unit tests	Every `scripts/test_*.py` (auto-discovered) passes — the memory-script logic in isolation.	`(cd scripts && python3 -m unittest discover -p 'test_*.py')`
check-lib-parity	`lib/install/` matches the committed checksums (byte-identical across agentm + crickets).	`scripts/check-lib-parity.sh`
check-no-pii	The regex PII scanner finds no personal info across the tree (this is a public repo).	`scripts/check-no-pii.sh`
verify-v4	The V4 #23 auto-orchestration push surface works end-to-end (briefing · idle · phase-dispatch · nudges · config/state) against a throwaway scratch vault — see below. Linux/Mac only.	`scripts/verify-v4.sh`
dogfood-workflows	Every workflow the harness ships as a template under `templates/.github/workflows/` is active at the repo root, byte-identical to the template.	Inline job in `tests-linux.yml`

`verify-v4.sh` — the push-surface integration check

End-to-end check of the V4 #23 auto-orchestration surface. Runs the real scripts via their CLIs against a throwaway scratch vault and asserts the deterministic outputs — the integration complement to the per-function unit suite.

Property	Detail
Isolation	A `mktemp -d` scratch vault + an exported `IDEAS_SURFACE_PATH` — never reads or writes a real vault.
No side effects	No network, no transcript mining, no sub-agent dispatch; self-cleans via a `trap`.
Coverage	config seed/parse · every briefing signal (inbox · HIGH watchlist · incubator · idea-ledger · staged-adapt · both nudges) · staged-adapt surfaces-and-clears · idle-chain dry-run (ordering + bounded `--max-batches`/`--limit`) · phase-dispatch plans + session-marker resolution (incl. the `ambiguous-session` concurrency guard) · emit gating (shifted-guard + cooldown) · atomic state write.
Out of scope	The real-boot integration (does a live session inject the briefing), the cross-surface read paths, and subjective fatigue calibration — those are the operator dogfood (vault-resident `projects/agentm/_harness/DOGFOOD-V4.md`).
Extend it	Add a `check_*` per new push-surface signal (drive the real script against `$SV`; assert with `assert_contains`/`assert_equals`/`assert_absent`), keeping each check scratch-isolated.

Reading red CI

gh run list --workflow "[T] Linux Tests"    --limit 3
gh run list --workflow "[T] Mac Tests"       --limit 3
gh run list --workflow "[T] Windows Tests"   --limit 3
gh run view <run-id> --log-failed             # drill into the failing step

Red-on-Windows but green-on-POSIX almost always indicates a path-separator or pwsh-host assumption regression. Red-on-all is usually a canonical-spec or adapter-parity drift — try bash scripts/check-parity.sh locally.

Running the gate set locally

One command runs the deterministic battery (unit tests + every check-* gate + verify-v4), prints a PASS/FAIL table, and exits non-zero on any failure:

bash scripts/check-all.sh

It deliberately omits the heavier smoke-install + gitleaks (slow / external tooling) that CI runs on every push — run those directly if you need them: bash scripts/smoke-install-bash.sh (POSIX) / pwsh -NoProfile -File scripts/smoke-install-pwsh.ps1 (Windows). check-all.sh is the maintained source of truth for the local battery — add a gate line as the project grows.

CI Gates

CI gates reference

⚡ Quick Reference

What each gate proves

verify-v4.sh — the push-surface integration check

Reading red CI

Running the gate set locally

Related

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

`verify-v4.sh` — the push-surface integration check