What
Add cross-source inconsistency flagging as a first-class Ledger diagnostic — reported, never reconciled.
Ledger rightly refuses to reconcile sources. Flagging that two source-backed claims disagree is not reconciling, and Ledger is the only layer where both sources coexist with enough structure (semantic fact keys, universe constraints, periods, assertions) to check. Consumers then decide.
Motivating failures this would have caught
- The May-19 eCPS calibration break (policyengine-us-data#1107 / the Groundwork incident): in the $500k+ AGI bracket, the calibrated amount tripled while the count fell — a count-versus-amount consistency violation visible in the target data itself.
- SOI TY2022 itemized levels versus TPC's FY2026 projection (~$99B vs ~$130B on the RCC dashboard): a fact-versus-source-projection gap that is expected (aging) but should be measured and reported, not discovered downstream.
Shape
A ledger check-consistency pass over a bundle or consumer artifact that emits a report (never mutates facts):
- Count-versus-amount coherence within a series: for bracketed families with both count and amount measures, flag period-over-period moves where amount and count diverge beyond a threshold in opposite directions.
- Same-semantic-fact disagreement: rows sharing a
semantic_fact_key (or same concept/entity/geography/period universe) from different sources whose values disagree beyond a tolerance — e.g. SOI vs CBO income tax receipts for the same period.
- Observation-versus-source-projection gaps: for series with both (e.g. CBO projected vs Treasury actual receipts), report realized gaps — useful to Thesis directly as forecaster-evaluation data.
Output: a consistency_report.json in bundle/artifact builds with per-finding severity, the fact keys involved, and the comparison arithmetic — same spirit as coverage.json. No reconciliation, no imputation, no edits.
Refs: the facts-only ADR (docs/adr-ledger-facts-only.md), #71, policyengine-us-data#1107.
What
Add cross-source inconsistency flagging as a first-class Ledger diagnostic — reported, never reconciled.
Ledger rightly refuses to reconcile sources. Flagging that two source-backed claims disagree is not reconciling, and Ledger is the only layer where both sources coexist with enough structure (semantic fact keys, universe constraints, periods, assertions) to check. Consumers then decide.
Motivating failures this would have caught
Shape
A
ledger check-consistencypass over a bundle or consumer artifact that emits a report (never mutates facts):semantic_fact_key(or same concept/entity/geography/period universe) from different sources whose values disagree beyond a tolerance — e.g. SOI vs CBO income tax receipts for the same period.Output: a
consistency_report.jsonin bundle/artifact builds with per-finding severity, the fact keys involved, and the comparison arithmetic — same spirit ascoverage.json. No reconciliation, no imputation, no edits.Refs: the facts-only ADR (
docs/adr-ledger-facts-only.md), #71, policyengine-us-data#1107.