Skip to content

Add blog post announcing Enhanced CPS full launch#2708

Merged
MaxGhenis merged 12 commits into
masterfrom
enhanced-cps-full-launch
Aug 8, 2025
Merged

Add blog post announcing Enhanced CPS full launch#2708
MaxGhenis merged 12 commits into
masterfrom
enhanced-cps-full-launch

Conversation

@MaxGhenis
Copy link
Copy Markdown
Contributor

Summary

Announces the full launch of the Enhanced Current Population Survey (Enhanced CPS), marking its graduation from beta status to PolicyEngine's default dataset.

Key features highlighted

New data enhancements

  • Tip income imputation: Using SIPP employer-reported data
  • Overtime premium calculations: Based on hours, occupations, and FLSA status
  • Auto loan interest: Imputed from Survey of Consumer Finances
  • Immigration status: ASEC Undocumented Algorithm implementation

Technical improvements

  • Adoption of new microimpute package for transparent imputation methodology
  • Over 100 calibration targets for validation
  • Enhanced validation dashboard

Upcoming developments

  • State/district calibration: Arnold Ventures-funded work to enable local area analysis
  • Microcalibrate package: Next-generation reweighting approach
  • Additional data: Plans to integrate SCF (assets), CEX (consumption), ACS (housing)

Impact

This launch enables more accurate modeling of recent policy proposals (tips/overtime exemptions) and upcoming local-area policy analysis capabilities.

🤖 Generated with Claude Code

This post announces the graduation from beta to full launch of the Enhanced CPS,
highlighting new features including:
- Tip income imputation from SIPP
- Overtime premium calculations
- Auto loan interest imputation from SCF
- Immigration status imputation using ASEC Undocumented Algorithm
- Adoption of new microimpute package

Also covers upcoming developments:
- State and congressional district calibration (Arnold Ventures funded)
- Migration to microcalibrate package
- Additional data integration plans

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@vercel
Copy link
Copy Markdown

vercel Bot commented Jul 28, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
policyengine-app ✅ Ready (Inspect) Visit Preview 💬 Add feedback Aug 8, 2025 5:20pm

- Correct calibration targets from "over 100" to "thousands" (9,168 lines)
- Fix microimpute description - it automates imputation, not replaces gradient descent
- Update data download info to mention Hugging Face automatic downloads
- Add links to Arnold Ventures and Nuffield Foundation funding
- Replace broken validation dashboard link with GitHub source code link
- Make Enhanced CPS the default dataset in the app (remove beta label)
- Update switch logic to default to Enhanced CPS when no dataset specified

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
- Change from "thousands" to the exact count of 9,168 administrative totals

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
MaxGhenis and others added 2 commits August 8, 2025 10:59
- Update launch date to January 8, 2025
- Remove timeline references that could make content seem dated
- Add comprehensive links to PolicyEngine US data documentation
- Clarify that SCF and ACS are already integrated for specific uses
- Remove dataset selector from UI as Enhanced CPS is now the default for nationwide analysis
- Note that Enhanced CPS will become default for state analysis once local calibration is complete

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
- Add two-stage methodology description with QRF imputation and PyTorch reweighting
- Include L0 penalty for sparsity in reweighting description
- Add link to methodology flowchart on documentation site
- Expand technical details about optimization approach
- Include data sources for calibration targets (IRS SOI, Census, CBO/Treasury, JCT)

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
- Add detailed description of the data processing pipeline
- Describe integration of five source datasets (CPS ASEC, IRS PUF, SIPP, SCF, ACS)
- Explain the two Enhanced CPS variants created through QRF imputation
- Link to full methodology documentation for visualization

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
- Embed the Enhanced CPS methodology flowchart showing the two-stage process
- Flowchart illustrates data integration from five sources through QRF imputation and reweighting

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@MaxGhenis MaxGhenis marked this pull request as ready for review August 8, 2025 17:10
Copilot AI review requested due to automatic review settings August 8, 2025 17:10
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR announces the full launch of PolicyEngine's Enhanced Current Population Survey (Enhanced CPS), marking its transition from beta to the default dataset for US policy simulations.

  • Adds a comprehensive blog post detailing new data enhancements including tip income, overtime premiums, auto loan interest, and immigration status imputations
  • Removes the Enhanced CPS dataset selector from the UI since it's now the default dataset
  • Documents upcoming developments including state/district calibration and new technical packages

Reviewed Changes

Copilot reviewed 3 out of 5 changed files in this pull request and generated 2 comments.

File Description
src/posts/posts.json Adds metadata entry for the Enhanced CPS launch blog post
src/posts/articles/enhanced-cps-launch.md Creates comprehensive blog post announcing Enhanced CPS features and methodology
src/pages/policy/PolicyRightSidebar.jsx Removes DatasetSelector component and related UI elements since Enhanced CPS is now default


**Two-stage methodology**: Our approach combines sophisticated imputation with advanced reweighting techniques. First, we use Quantile Regression Forests (QRF) to impute missing variables from multiple data sources, preserving realistic variation and capturing conditional distribution tails. Second, we apply gradient-based optimization with PyTorch to reweight households, matching administrative targets while maintaining the survey's statistical properties.

![Enhanced CPS methodology flowchart showing the two-stage process of imputation and reweighting](/images/posts/enhanced-cps-launch-flowchart.png)
Copy link

Copilot AI Aug 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The referenced image /images/posts/enhanced-cps-launch-flowchart.png is not included in this PR. Ensure the image file exists in the correct location before publishing the blog post.

Copilot uses AI. Check for mistakes.
Comment thread src/posts/posts.json
"date": "2025-08-08",
"tags": ["us", "data", "featured"],
"filename": "enhanced-cps-launch.md",
"image": "enhanced-cps-launch.png",
Copy link

Copilot AI Aug 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The referenced image enhanced-cps-launch.png is not included in this PR. Verify the image file exists in the expected location for the blog post metadata.

Copilot uses AI. Check for mistakes.
MaxGhenis and others added 2 commits August 8, 2025 13:16
- Update PolicyRightSidebar tests to expect the enhanced_cps_switch element to not be present
- Remove tests that were checking for switch behavior since the selector has been removed

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@anth-volk
Copy link
Copy Markdown
Collaborator

Just to note: we don't default to using the ECPS on API runs at the moment. We never migrated to that as it was not necessary to get the API running. Also, I believe we said it isn't an issue, but does it matter that if a user clicks an old article link that was run off the standard CPS, they'd now be getting ECPS numbers?

@MaxGhenis
Copy link
Copy Markdown
Contributor Author

That's OK nobody but us is using the microsim API. Old posts are outdated anyway but good flag.

@MaxGhenis MaxGhenis merged commit 5f25eb4 into master Aug 8, 2025
4 checks passed
@github-project-automation github-project-automation Bot moved this from Todo to Done in policyengine-app Aug 8, 2025
@MaxGhenis MaxGhenis deleted the enhanced-cps-full-launch branch August 8, 2025 17:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants