Skip to content

Backfill IRS SOI source packages to 2015 for Microplex PUF uprating #51

@anth-volk

Description

@anth-volk

Context

microplex-us currently needs a historical PE-style SOI target surface for PUF uprating, with values back to 2015 and forward through the PE-SOI/factor boundary used by the rebuild pipeline.

In PolicyEngine/microplex-us#155, the short-term fix keeps using a pinned PolicyEngine/policyengine-us-data repo artifact: policyengine_us_data/storage/calibration_targets/soi_targets.csv. That avoids depending on a local PE-US-data checkout or the PE-US-data Python package, but it is still a construction-time dependency on PE-US-data-owned SOI data.

Longer term, Arch should own the source-backed IRS SOI facts and provenance, and microplex-us should adapt/export those facts into the simulator-specific PE-style SOI target surface used by PUF uprating.

Goal

Add full Arch support for the IRS SOI source data needed by Microplex PUF uprating back to 2015.

Scope

  • Backfill the relevant IRS SOI source packages for years 2015 onward.
  • Cover the SOI measures needed to reproduce the PE-style long target table currently consumed by microplex-us PUF uprating.
  • Preserve source-level provenance, table/line metadata, units, periods, and AGI-stub semantics in Arch facts.
  • Add tests that verify representative facts for each covered SOI package/year range.
  • Document any source-definition breaks across years, especially line/table renames or AGI-stub changes.

Non-goals

  • Arch should not choose active Microplex calibration targets or bake in PolicyEngine variable names.
  • Arch should not own PE-style aging/reconciliation policy.
  • The Microplex-specific export/adapter can live in microplex-us; this issue is about making the underlying historical SOI fact coverage available in Arch.

Acceptance criteria

  • Arch can build source-backed SOI facts for the required Microplex PUF-uprating horizon beginning in 2015.
  • The resulting facts contain enough information for microplex-us to replace its pinned PE-US-data soi_targets.csv dependency.
  • Tests cover at least one current year and one historical year for each newly extended SOI package.
  • Any missing or non-comparable historical SOI lines are explicitly documented with source notes.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions