Skip to content

v0.4.5 — the audit workflow + findings-inbox v2 (the Journal-Clock Audit)

Choose a tag to compare

@emyann emyann released this 13 Jun 01:10
· 12 commits to main since this release

The drift class hashes can't see, now owned

Freshness proves the bytes a page cites haven't moved; nothing proved the claims were true. Wrongness behind unchanged bytes — the freshness model's openly-admitted blind spot — is live, not theoretical: this release's design tournament found a fresh-stamped page in repolore's own wiki claiming "five masters" where there were ten, surviving four re-stamps.

The architecture was chosen by a 23-agent adversarial design tournament (docs/RESEARCH-AUDIT.md): 5 biased designs, 3 attack lenses each (token-cost, soundness vs a seeded answer key, ADR contract), 2 independent judges, 1 synthesis. The minimalist prompt-contract design won on every axis; its one critical (hot-page starvation via the last_refreshed clock) is fixed by the defining graft — the audit clock is a journal line (ADR-010).

What shipped — zero new vendored scripts

  • references/audit.md + /repolore:audit (plugin) + the sixth umbrella routing row (standalone parity): K-page sessions oldest-audited-first; per-claim verdicts (CONFIRMED = entailment, never "consistent with"); the negative-space rule (always/never claims require grep-enumerating the deciding tokens — comments are never evidence); the byte-capped invariant sweep (catches uncited wrongness); hard budgets (evidence ≤3× page bytes; exhaustion never demotes).
  • The one record: ## date — audited <category/slug> (N claims: …) in log.md — committed at bless time; due-list and the check workflow's dust line are parsed from it per run, written nowhere (ADR-002's split). Hook, exit codes, CI: byte-for-byte untouched.
  • Findings-inbox v2: invisible per-item blob-SHA anchors (<!-- repolore:sha=… captured=… -->), unanchored absence findings, recording-commit backfill, four deleting triage exits. The findings-check script stays deferred behind ADR-009's own revisit gate — ratified explicitly in ADR-010.
  • Honest cost (measured): ~65–70K tokens for a full 15-page audit (~100K ceiling), ~45–55K/month steady-state at 50 pages, linear scaling.

Dogfooded before shipping

The tournament's dry-runs caught 3/3 seeded falsehoods with zero false positives and found two genuine defects in this repo's wiki (fixed in v0.4.4→v0.4.5 interim). This release ships with the first real audited page: decisions/adr-002-computed-status, 10/10 claims confirmed via the full protocol, one ±10 silent citation repair, journal line minted — and the dust line parses it.