Skip to content

Fix mp-300k target support on HoH AGI, SSI, Medicare, and ACA benchmark families #86

@MaxGhenis

Description

@MaxGhenis

Parent: #11

Current-main validation after the table-construction fixes shows the first small replacement candidate is no longer primarily failing because of tax-unit/SPM/family fragmentation. Entity structure is now close to eCPS on the measured matched-N diagnostics, but the sound eCPS replacement comparison still fails by a wide margin.

Evidence artifact:

/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/sound_ecps_replacement_comparison.json

Manual per-target drilldown:

/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/target_diagnostics_refit_ecps_to_mp.json

Current loss result:

  • Microplex refit PE-native broad loss: 3.3933854903630642
  • eCPS refit PE-native broad loss: 0.16459032249699052
  • Microplex train/holdout: 2.88954251907915 / 0.5038429754602355
  • eCPS train/holdout: 0.1372832454307687 / 0.027307076904983546
  • target wins: eCPS 2293, Microplex 496, ties 29

Entity structure after #79/#80/#83 is not the main gap anymore:

  • Microplex matched tax units/HH: 1.3079; eCPS: 1.3377
  • Microplex matched SPM units/HH: 1.0000; eCPS: 1.0441
  • Microplex matched families/HH: 1.0928; eCPS: 1.1188
  • Microplex family singleton share: 0.3787; eCPS: 0.4148
  • cross-household family units: 0 for both

Largest target blockers after refit include:

  • Head-of-Household IRS AGI/count bins, e.g. AGI 20k-25k, 25k-30k, 500k-1m, and 1m-inf taxable HoH cells.
  • Medicare Part B premium targets by young ages, where eCPS is often zero and MP has large positive estimates.
  • SSI recipients and SSI dollar totals, especially age 65+ and CBO SSI.
  • State ACA spending and SNAP household families.

The current target diagnostics show examples like:

  • nation/irs/count/count/AGI in 20k-25k/taxable/Head of Household: target 53,987, eCPS 22,976, MP 1,131,244
  • nation/irs/count/count/AGI in 25k-30k/taxable/Head of Household: target 131,400, eCPS 200,375, MP 2,633,483
  • nation/irs/count/count/AGI in 500k-1m/taxable/Head of Household: target 43,007, eCPS 42,528, MP 858,975
  • nation/census/medicare_part_b_premiums/age_10_to_19: target 11.2M, eCPS 0, MP 222.9M
  • nation/ssa/ssi_recipients/65_plus: target 2.38M, eCPS 3.81M, MP 32.13M
  • nation/cbo/ssi: target $57.0B, eCPS $59.3B, MP $356.9B

Desired direction

Fix these with source/model/target-support improvements, not by relaxing the sound comparison gate or treating eCPS as ground truth. Use eCPS only as a benchmark/control. The model should explain why Microplex is placing weight on these surfaces and whether the support comes from source data, donor imputation, formula-derived outputs, or calibration overfit.

Start with:

  1. Add sidecar diagnostics that tie worst target rows to source-support/role surfaces: filing status, AGI bin, SSI recipient/value source, Medicare Part B premium support, ACA state support.
  2. Diagnose whether the HoH AGI-bin failures are from remaining tax-unit role support, filing-status formula/output mismatch, AGI construction, or optimizer weight concentration.
  3. Diagnose whether SSI/Medicare/ACA failures are exported input issues, formula-derived benchmark targets, donor-source support issues, or calibration target omissions.
  4. Rebuild or rematerialize the small ASEC+ACS100k candidate after each fix and rerun the sound matched-N symmetric eCPS comparison.

Acceptance criteria

  • The sound comparison artifact includes the target diagnostics sidecar added in Add target diagnostics to eCPS comparison #85.
  • Top regressions no longer dominated by HoH AGI-bin count/AGI cells, SSI, Medicare Part B premiums, and ACA spending without a clear source-support explanation.
  • Candidate improves on full and holdout loss versus the current-main 3.3934 / 0.5038 baseline under matched-N symmetric refit.
  • Protected-family and core-family floors in the mp-300k gate remain active; no gate relaxation is used to pass.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions