Add TRACE case study writeup for AEA / TRACE grant team by MaxGhenis · Pull Request #315 · PolicyEngine/policyengine.py

MaxGhenis · 2026-04-21T15:39:00Z

Summary

New `docs/trace-case-study.md` working draft covering the PolicyEngine TRACE use case.
Structured around the reframe from the 2026-04-21 meeting with Lars Vilhuber (AEA Data Editor), Tara Watson, John Sabelhaus, Tim Clark, and Casper: TRACE wraps PolicyEngine infrastructure (us-data build pipeline, policyengine.org webapp runs) rather than being embedded in the end-user Python package.
Explicit link to the three implementation workstreams: us-data TROs (PR #746, shipped), policyengine-api#3485 (webapp-run TRO emission, in progress), policyengine-app#2830 ("Cite this result" UI, blocked on api).
Flags three open questions: per-household frame as default, durable URL / retention, signing and key rotation.

Why

Lars asked for this kind of writeup during the meeting: "expressing those things and seeing exactly what is the value you get from trace or using it in a particular way for your use case is gold." They are designing TRACE vocabulary and submitting a grant proposal on this topic, and a concrete case study is directly useful to that work.

Test plan

`changelog.d` fragment added
Reviewer QA on the content itself — framing, correctness, tone
After merge, share the file URL with Lars + TRACE team

🤖 Generated with Claude Code

Working draft describing the PolicyEngine use case for TRACE, prepared after the 2026-04-21 meeting with Lars Vilhuber, Tara Watson, John Sabelhaus, Tim Clark, and Casper. Structured around the reframe that emerged in the meeting: TRACE should wrap PolicyEngine infrastructure (the us-data build pipeline and policyengine.org webapp runs) rather than be embedded in the end-user Python package. Covers: - Which PolicyEngine surfaces warrant institutional certification - The precise claims a TRO lets us make (rules, data, reform, inputs, outputs including per-household frame, institutional attestation) - UK data as the strongest TRACE case for us - Three concrete implementation workstreams with linked issues - What TRACE gets from us as a case study (infrastructure-certifying vs author-certifying; microdata provenance; pe:* extension discipline) - Three open questions (per-household frame default, retention and durable addressing, signing and key rotation) Lars explicitly asked for this kind of writeup during the meeting to feed the TRACE grant proposal and vocabulary design work. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…and adjacent work, clarify institution-backed self-attestation Codex review of the 2026-04-21 meeting transcript vs. this writeup flagged four issues: 1. UK was oversold as 'the strongest' or 'only' TRACE case. Transcript supports 'a strong case' but not 'the strongest' — and we are considering a recalibrated UK variant that would partly lift the restriction anyway. 2. Missing explicit non-scenario section. The meeting was emphatic that researcher-laptop TRO emission, transitive dep tracing, and plain version-identification are NOT TRACE's job for us. 3. Missing adjacent workstreams that came up but are not TRACE-solved: preservation-grade archiving (HuggingFace vs Zenodo), PolicyEngine- specific TRACE vocabulary contribution, and non-TRACE version- identification work (Casper's point). 4. 'Institutional certification' language oversold what PolicyEngine actually provides. An institution certifying its own runs 'carries technically no difference' from an author certifying their own runs; the value comes from institutional reputation and structured evidence, not from cryptographic equivalent of arms-length independence. Also: back off the per-household frame as 'the highest-value downstream artifact' claim the transcript doesn't support; flag it as open design question. Drop 'transitive Python deps' from the rules-bundle section per transcript explicitly saying TRACE has not built that in. Add three additional open-question items (retention + preservation, key trust model, production-runtime binding) surfaced by codex. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Codex flagged that the writeup slipped from 'claims we want to make' into 'claims we can make now' for service-account signatures, durable URLs, and per-household frames — three things the transcript does not actually settle. Changes: - Reframe section title from 'The precise claims a PolicyEngine TRO lets us make' to 'The claims a PolicyEngine TRO should let us make'. Every present-tense claim about what a TRO 'lets us' do is softened to what a TRO 'would let us' do, conditional on the design questions still being settled. - Per-household frame: drop the 'for US runs the TRO includes the full frame' assertion; replace with explicit open-design-question framing. Cite the transcript exchange for traceability. - Signing mechanism: remove the claim that a service-account signature is the answer. List service-account + DNS-keychain + Sigstore as options under consideration. - Institutional-attestation claim gains a caveat that the service- account signature is 'one implementation, not the only one.' - Workstream list for policyengine-api#3485 is rewritten from 'signed by a PolicyEngine service account, persisted to GCS with durable URL' (asserts design decisions that have not been made) to explicitly naming the strawman and the alternatives. - The two workstreams the writeup describes gain an explicit live / not-yet-live marker: us-data build TRO emission is live (us-data#746 shipped); webapp-run emission + Cite UI is not (api#3485, app#2830, api#3486). The open-questions section already handled this correctly; this change aligns the main body with that section so the writeup is internally consistent. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

MaxGhenis and others added 2 commits April 21, 2026 11:38

This was referenced Apr 21, 2026

Contribute PolicyEngine vocabulary patterns to TRACE / TROv #316

Open

Add preservation-mirror fields to DataReleaseManifest #317

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add TRACE case study writeup for AEA / TRACE grant team#315

Add TRACE case study writeup for AEA / TRACE grant team#315
MaxGhenis wants to merge 3 commits intomainfrom
trace-case-study-for-aea

MaxGhenis commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MaxGhenis commented Apr 21, 2026

Summary

Why

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant