# Census Field Descriptions
---
- To supplement the "Census Counties ..." datasets, they've provided us a table with descriptions of each column name, for each historical standard of the census. Fortunately, the 2019 and 2012 census data (that's what we're using) uses the same standard: `acs_standard`
- This script does the following:
  - Filters the source dataframe to only include `acs_standard` column descriptions
  - Selects only necessary columns (column name, description)
  - Renames some values in column name, and removes some column name values we'll never use (geonum, geojson)

In [1]:
import pandas as pd
desc = pd.read_csv('../input-data/census_field_desc__qten-sdpn.csv')
desc = desc[desc.type == 'acs_standard']
desc = desc[['apifieldname', 'description']]
desc = desc.rename(columns={'apifieldname':'column'})
desc = desc[ ~ desc.column.isin(['geonum', 'geojson'])]
desc.iat[0,0] = 'county'
desc

Unnamed: 0,column,description
0,county,Geographic Area common name
3,pop,Population Estimate for the given time range
4,hispanic,Estimate for the Hispanic Population
5,white_nh,"Estimate for the White, Non-Hispanic Population"
6,black_nh,"Estimate for the Black, Non-Hispanic Population"
...,...,...
153,emp,Estimated number of people in the civilian labor force that are employed
154,unemp,Estimated number of people in the civilian labor force that are not employed
155,armedfrcs,Estimated number of people in the armed forces
156,not_lf,Estimated number of people not in the labor force


In [2]:
desc.to_csv("../working-data/census_field_descriptions.csv", index=False)