This repository contains the source data for my first, second and third blog posts on college degrees in the US.
I derive data/observations.csv
from the 2016 ACS PUMS data by running code/observations.do
in Stata MP/15.
The data dictionary for data/observations.csv
is as follows.
Variable | Type | Description |
---|---|---|
age |
int | Age of respondent at survey date |
female |
int | Binary indicator for whether respondent is female |
level |
int | Educational attainment level ID as in data/attainment.csv |
field1 , field2 |
int | Degree field IDs as in data/fields.csv |
weight |
dbl | Aggregated sample weight |
I create data/attainment.csv
and data/fields.csv
manually based on the PUMS data dictionary.
All repository content is licensed under the MIT license.