## Extraction.ipynb
This extracts dataframes from three different data sources
1. `states.csv`: the data which maps state names to state abbreviations
2. `crime_data_w_population_and_crime_rate.csv`: the county data on crime rates based on crime reports from 2013
3. `result.json`: yearly/monthly data on unemployment by county in the USA

In [None]:
import pandas as pd
import json

### State-to-abbreviation map
This is straightforward

In [None]:
state_abb = pd.read_csv("states.csv")
state_abb.head()

### Crime Rate Dataframe
Also straightforward to extract

In [None]:
crime_rate_df = pd.read_csv("crime_data_w_population_and_crime_rate.csv")
crime_rate_df.head()

### Unemployment JSON
This is a little trickier since we cannot directly load into a pandas dataframe using a library function; we have to construct it in a more hands-on way. Note here that we only get annual data from the year 2013.

In [None]:
with open('result.json') as json_file:
    data = json.load(json_file)

unemployment_df = pd.DataFrame(columns=['county', 'state', 'unemployment_rate'])

for state, v in data['2013']['Annual'].items():
    for county, rate in v['Unemployment Rate'].items():
        unemployment_df = unemployment_df.append({'county': county, 'state': state, 'unemployment_rate': rate}, ignore_index=True)

unemployment_df.head()