No description, website, or topics provided.
SAS Python Jupyter Notebook Shell
Latest commit b597273 Mar 6, 2017 @martinholmer martinholmer committed on GitHub Merge pull request #82 from martinholmer/reduce-growfactors
Remove APOPN and AGDPN from final growfactors.csv file
Permalink
Failed to load latest commit information.
cps_data rename to finalprep.py Feb 13, 2017
cps_stage2 Add more README.md text Feb 12, 2017
cps_stage3 Add more README.md text Feb 12, 2017
doc Cite Stage1+Stage2 documentation in both stage1 and puf_stage2 Mar 3, 2017
puf_data Cite Stage1+Stage2 documentation in both stage1 and puf_stage2 Mar 3, 2017
puf_stage2 Cite Stage1+Stage2 documentation in both stage1 and puf_stage2 Mar 3, 2017
puf_stage3 Number the agi_bins in stage3.py Feb 16, 2017
stage1 Remove APOPN and AGDPN from final growfactors.csv file Mar 4, 2017
LICENSE Update LICENSE Oct 20, 2015
README.md Rename makecsv as csvmake; rename copycsv as csvcopy Feb 13, 2017
csvcopy Rename makecsv as csvmake; rename copycsv as csvcopy Feb 13, 2017
csvmake
gitpr Add gitsync and gitpr bash scripts Feb 11, 2017
gitsync Add gitsync and gitpr bash scripts Feb 11, 2017

README.md

About taxdata Repository

This repository prepares data used in the Tax-Calculator repository.

The data produced here, all of which have CSV format, provide two different sets of data files for Tax-Calculator:

  • A set based on a recent IRS-SOI Public Use File (PUF)

  • A set based on recent Census Current Population Survey (CPS) data

Because the PUF data are restricted in their use, the IRS-SOI-supplied PUF file and the puf.csv data file produced here are not part of the taxdata or the Tax-Calculator repository.

Each of these two sets of data files contains four files:

  1. a sample data file containing variables for each tax filing unit;

  2. a factors file containing annual variable extrapolation factors;

  3. a weights file containing annual weights for each filing unit;

  4. a ratios file containing annual adjustment ratios for some variables.

Note that the factors file is the same in both sets of data files because the variable extrapolation factors are independent of the sample data being used. But the weights and ratios files do depend on the data file, so they are different in the two sets of data files.

Data-Preparation Documentation

IRS-SOI Public Use File (PUF) documentation:

  1. PUF-based sample data;

  2. grow factors

  3. PUF-based sample weights;

  4. PUF-based adjustment ratios.

Census Current Population Survey (CPS) documentation is available here:

  1. CPS-based sample data;

  2. grow factors

  3. CPS-based sample weights;

  4. CPS-based adjustment ratios.

Work-Flow Documentation

The sequence of operations required to make the two sets of data files is contained in the csvmake bash script, which also automates the preparation work-flow (except on Windows).

The sequence of operations required to install the two sets of data files in the Tax-Calculator repository is contained in the csvcopy bash script, which also automates the installation work-flow (except on Windows).

Contributors

  • John O'Hare
  • Amy Xu
  • Anderson Frailey
  • Martin Holmer