Fix mp-300k target support on HoH AGI, SSI, Medicare, and ACA benchmark families

Parent: #11

Current-main validation after the table-construction fixes shows the first small replacement candidate is no longer primarily failing because of tax-unit/SPM/family fragmentation. Entity structure is now close to eCPS on the measured matched-N diagnostics, but the sound eCPS replacement comparison still fails by a wide margin.

Evidence artifact:

`/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/sound_ecps_replacement_comparison.json`

Manual per-target drilldown:

`/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/target_diagnostics_refit_ecps_to_mp.json`

Current loss result:

- Microplex refit PE-native broad loss: `3.3933854903630642`
- eCPS refit PE-native broad loss: `0.16459032249699052`
- Microplex train/holdout: `2.88954251907915` / `0.5038429754602355`
- eCPS train/holdout: `0.1372832454307687` / `0.027307076904983546`
- target wins: eCPS `2293`, Microplex `496`, ties `29`

Entity structure after #79/#80/#83 is not the main gap anymore:

- Microplex matched tax units/HH: `1.3079`; eCPS: `1.3377`
- Microplex matched SPM units/HH: `1.0000`; eCPS: `1.0441`
- Microplex matched families/HH: `1.0928`; eCPS: `1.1188`
- Microplex family singleton share: `0.3787`; eCPS: `0.4148`
- cross-household family units: `0` for both

Largest target blockers after refit include:

- Head-of-Household IRS AGI/count bins, e.g. AGI `20k-25k`, `25k-30k`, `500k-1m`, and `1m-inf` taxable HoH cells.
- Medicare Part B premium targets by young ages, where eCPS is often zero and MP has large positive estimates.
- SSI recipients and SSI dollar totals, especially age 65+ and CBO SSI.
- State ACA spending and SNAP household families.

The current target diagnostics show examples like:

- `nation/irs/count/count/AGI in 20k-25k/taxable/Head of Household`: target `53,987`, eCPS `22,976`, MP `1,131,244`
- `nation/irs/count/count/AGI in 25k-30k/taxable/Head of Household`: target `131,400`, eCPS `200,375`, MP `2,633,483`
- `nation/irs/count/count/AGI in 500k-1m/taxable/Head of Household`: target `43,007`, eCPS `42,528`, MP `858,975`
- `nation/census/medicare_part_b_premiums/age_10_to_19`: target `11.2M`, eCPS `0`, MP `222.9M`
- `nation/ssa/ssi_recipients/65_plus`: target `2.38M`, eCPS `3.81M`, MP `32.13M`
- `nation/cbo/ssi`: target `$57.0B`, eCPS `$59.3B`, MP `$356.9B`

## Desired direction

Fix these with source/model/target-support improvements, not by relaxing the sound comparison gate or treating eCPS as ground truth. Use eCPS only as a benchmark/control. The model should explain why Microplex is placing weight on these surfaces and whether the support comes from source data, donor imputation, formula-derived outputs, or calibration overfit.

Start with:

1. Add sidecar diagnostics that tie worst target rows to source-support/role surfaces: filing status, AGI bin, SSI recipient/value source, Medicare Part B premium support, ACA state support.
2. Diagnose whether the HoH AGI-bin failures are from remaining tax-unit role support, filing-status formula/output mismatch, AGI construction, or optimizer weight concentration.
3. Diagnose whether SSI/Medicare/ACA failures are exported input issues, formula-derived benchmark targets, donor-source support issues, or calibration target omissions.
4. Rebuild or rematerialize the small ASEC+ACS100k candidate after each fix and rerun the sound matched-N symmetric eCPS comparison.

## Acceptance criteria

- The sound comparison artifact includes the target diagnostics sidecar added in #85.
- Top regressions no longer dominated by HoH AGI-bin count/AGI cells, SSI, Medicare Part B premiums, and ACA spending without a clear source-support explanation.
- Candidate improves on full and holdout loss versus the current-main `3.3934` / `0.5038` baseline under matched-N symmetric refit.
- Protected-family and core-family floors in the mp-300k gate remain active; no gate relaxation is used to pass.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mp-300k target support on HoH AGI, SSI, Medicare, and ACA benchmark families #86

Desired direction

Acceptance criteria

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Fix mp-300k target support on HoH AGI, SSI, Medicare, and ACA benchmark families #86

Description

Desired direction

Acceptance criteria

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions