You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Current-main validation after the table-construction fixes shows the first small replacement candidate is no longer primarily failing because of tax-unit/SPM/family fragmentation. Entity structure is now close to eCPS on the measured matched-N diagnostics, but the sound eCPS replacement comparison still fails by a wide margin.
Fix these with source/model/target-support improvements, not by relaxing the sound comparison gate or treating eCPS as ground truth. Use eCPS only as a benchmark/control. The model should explain why Microplex is placing weight on these surfaces and whether the support comes from source data, donor imputation, formula-derived outputs, or calibration overfit.
Start with:
Add sidecar diagnostics that tie worst target rows to source-support/role surfaces: filing status, AGI bin, SSI recipient/value source, Medicare Part B premium support, ACA state support.
Diagnose whether the HoH AGI-bin failures are from remaining tax-unit role support, filing-status formula/output mismatch, AGI construction, or optimizer weight concentration.
Diagnose whether SSI/Medicare/ACA failures are exported input issues, formula-derived benchmark targets, donor-source support issues, or calibration target omissions.
Rebuild or rematerialize the small ASEC+ACS100k candidate after each fix and rerun the sound matched-N symmetric eCPS comparison.
Top regressions no longer dominated by HoH AGI-bin count/AGI cells, SSI, Medicare Part B premiums, and ACA spending without a clear source-support explanation.
Candidate improves on full and holdout loss versus the current-main 3.3934 / 0.5038 baseline under matched-N symmetric refit.
Protected-family and core-family floors in the mp-300k gate remain active; no gate relaxation is used to pass.
Parent: #11
Current-main validation after the table-construction fixes shows the first small replacement candidate is no longer primarily failing because of tax-unit/SPM/family fragmentation. Entity structure is now close to eCPS on the measured matched-N diagnostics, but the sound eCPS replacement comparison still fails by a wide margin.
Evidence artifact:
/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/sound_ecps_replacement_comparison.jsonManual per-target drilldown:
/Users/maxghenis/CosilicoAI/microplex-us/artifacts/small_asec_acs100k_family_relationship_units_20260529/sound_ecps_replacement_comparison/target_diagnostics_refit_ecps_to_mp.jsonCurrent loss result:
3.39338549036306420.164590322496990522.88954251907915/0.50384297546023550.1372832454307687/0.0273070769049835462293, Microplex496, ties29Entity structure after #79/#80/#83 is not the main gap anymore:
1.3079; eCPS:1.33771.0000; eCPS:1.04411.0928; eCPS:1.11880.3787; eCPS:0.41480for bothLargest target blockers after refit include:
20k-25k,25k-30k,500k-1m, and1m-inftaxable HoH cells.The current target diagnostics show examples like:
nation/irs/count/count/AGI in 20k-25k/taxable/Head of Household: target53,987, eCPS22,976, MP1,131,244nation/irs/count/count/AGI in 25k-30k/taxable/Head of Household: target131,400, eCPS200,375, MP2,633,483nation/irs/count/count/AGI in 500k-1m/taxable/Head of Household: target43,007, eCPS42,528, MP858,975nation/census/medicare_part_b_premiums/age_10_to_19: target11.2M, eCPS0, MP222.9Mnation/ssa/ssi_recipients/65_plus: target2.38M, eCPS3.81M, MP32.13Mnation/cbo/ssi: target$57.0B, eCPS$59.3B, MP$356.9BDesired direction
Fix these with source/model/target-support improvements, not by relaxing the sound comparison gate or treating eCPS as ground truth. Use eCPS only as a benchmark/control. The model should explain why Microplex is placing weight on these surfaces and whether the support comes from source data, donor imputation, formula-derived outputs, or calibration overfit.
Start with:
Acceptance criteria
3.3934/0.5038baseline under matched-N symmetric refit.