# Distribution of Federal Dollars by State

A question has arisen regarding the relationship between the state and local tax deduction and the distribution of federal expenditure by state.  This notebook seeks to probe this question a bit by way of the [Government Finance Statistics](http://www.census.gov/govs/local/) provided by the Census.

In [1]:
import numpy as np
import pandas as pd
from pandas import Series, DataFrame
import seaborn as sb

## Data Input

The first thing we need to do is capture both the GFS data, and population data needed to generate a view of per capita dollars.  The former is taken from a conveniently [aggregated state table](http://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?src=bkmk) housed on American FactFinder.  The latter are 3-year ACS estimates we have also grabbed from American FactFinder.  Both data sets come from 2013.

In [9]:
#Establish working directory
workdir='O:/Analyst/Marvin/BudgetOptions/GFS2013/'

#Read in mapping of variable names for both sets
var_map_gfs_df=pd.read_csv(workdir+'SLF_2013_SLF003_metadata.csv',names=['var','desc'])
var_map_gfs=dict(zip(var_map_gfs_df['var'],var_map_gfs_df['desc']))
var_map_pop_df=pd.read_csv(workdir+'ACS_13_3YR_B01003_metadata.csv',names=['var','desc'])
var_map_pop=dict(zip(var_map_pop_df['var'],var_map_pop_df['desc']))

#Read in data
gfs=pd.read_csv(workdir+'SLF_2013_SLF003.csv')
pop=pd.read_csv(workdir+'ACS_13_3YR_B01003.csv')

print pop.head()
print gfs.head()

        GEO.id  GEO.id2 GEO.display-label  HD01_VD01  HD02_VD01
0  0400000US01        1           Alabama    4817624        NaN
1  0400000US02        2            Alaska     729603        NaN
2  0400000US04        4           Arizona    6548856        NaN
3  0400000US05        5          Arkansas    2949238        NaN
4  0400000US06        6        California   38000360        NaN
        GEO.id  GEO.id2 GEO.display-label    Amt1_REV  CV1_REV      A2_REV  \
0    0100000US      NaN     United States  3419868195     0.07  2193443291   
1  0400000US01        1           Alabama    44644352     0.53    29092950   
2  0400000US02        2            Alaska    17371758     0.16    14020580   
3  0400000US04        4           Arizona    57247325     0.33    36947686   
4  0400000US05        5          Arkansas    26683523     0.36    21541707   

     Amt2_REV  CV2_REV  Amt1_GENREV  CV1_GENREV   ...     Amt1_BONDFUN  \
0  1709268981     0.16   2690426735        0.08   ...        183698004   

As we can see, the ID info matches well, so we have a good merge key.  We clearly do not need all of these variables, so let's cut down to a few key concepts:

+ State and Local Own Source Revenue
+ Federal Transfers to the State
+ Federal Taxes Paid