Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Microplex path to replace eFRS
This is the operational plan for replacing PolicyEngine's Enhanced FRS
(
eFRS) with Microplex-produced UK datasets.The plan follows the US eCPS replacement framing but uses UK-specific source
and target facts. Dataset names should be record-count tiers, not use-case
labels:
mp-30k-uk: routine UK microsimulation tier, roughly current eFRS scale.mp-300k-uk: larger candidate tier for stronger regional/local support.mp-3m-uk: future parent universe for sparse/distilled UK tiers if sourceaccess and compute make it useful.
The long-run construction path should be hierarchical: build the largest
defensible UK candidate universe, then use L0 sparsity and post-fit pruning to
derive smaller tiers. The first
mp-30k-ukcandidate may be a directly sampledor directly fused build, but that should be treated as an initial benchmark
candidate rather than the permanent construction contract.
Source architecture
The UK source analogs are:
microplex-ukalready has source adapters and benchmark scaffolding for FRS,SPI, and WAS. The replacement work should push that spike into a production
pipeline.
Replacement gates
mp-30k-ukcan replace eFRS as the routine UK dataset when all of these aretrue:
policyengine-uk-datacommit,policyengine-ukversion, target sourcepaths, and target source hashes.
current
policyengine-ukloader, including household, benefit-unit, person,and tax-unit IDs, joins, weights, periods, dtypes, and missing-value
conventions.
National Insurance, Universal Credit, child benefit, pension credit, housing
support, disability benefits, household net income, wealth, rent, and
council-tax-relevant variables.
tax, National Insurance, Universal Credit, child benefit, pension credit,
housing costs, disability benefits, and at least one Scotland/Wales/devolved
variation.
1.25x eFRS for default release. A beta can be slower if explicitly labeled.
deltas, record counts, nonzero weights, effective sample size, H5 size, and
benchmark outputs.
Workstreams
1. Build first
mp-30k-ukcandidateDeliverables:
Exit:
mp-30k-ukversus pinned eFRS with target deltas.2. Harden UK target registry
Deliverables:
specs
variables
Exit:
3. Add UK H5 compatibility checks
Deliverables:
policyengine-ukH5 loadingExit:
4. UK microsimulation benchmark suite
Deliverables:
where applicable, and component deltas
Exit:
mp-30k-ukcandidate has a comparable report against eFRS.5. Scale to
mp-300k-ukDeliverables:
mp-30k-ukExit:
mp-300k-ukimproves the frontier, or the dashboard clearly marks it as anexperimental tier.
6. Publish and switch
Deliverables:
policyengine-uk-datapublication path formp-30k-ukbeta artifactpolicyengine-ukdefault-switch and rollback planExit:
mp-30k-ukthrough the normal UK dataset interface andswitch defaults only after gates pass.
Beta Was this translation helpful? Give feedback.
All reactions