## Importing Libraries

In [1]:
import pandas as pd

### Importing DataFrames

In [2]:
df_main = pd.read_csv('./cleaned_data/incidents2.csv')
df_names = pd.read_csv('./cleaned_data/names2.csv')
df_census = pd.read_csv('./cleaned_data/counties2.csv')
df_states = pd.read_csv('./data/state_abbrv.csv')

pd.set_option('display.max_columns', 300)
pd.set_option('display.max_rows', 300)

### Examining our main dataframe

In [3]:
df_main.head()

Unnamed: 0.1,Unnamed: 0,Unique ID,Name,Age,Gender,Race,Race with imputations,Imputation probability,Date of injury resulting in death (month/day/year),State,Location of death (county),Agency or agencies involved,Highest level of force,Brief description,"Dispositions/Exclusions INTERNAL USE, NOT FOR ANALYSIS",Intended use of force (Developing),"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS",Date,year,month,week_of_year,day_of_month,day_of_week,day_of_year,Agency
0,0.0,25747.0,Mark A. Horton,21,Male,African-American/Black,African-American/Black,Not imputed,01/01/2000,MI,Wayne,,Vehicle,Two Detroit men killed when their car crashed ...,Unreported,Pursuit,No,2000-01-01,2000.0,1.0,0.0,1.0,5,1.0,
1,1.0,25748.0,Phillip A. Blurbridge,19,Male,African-American/Black,African-American/Black,Not imputed,01/01/2000,MI,Wayne,,Vehicle,Two Detroit men killed when their car crashed ...,Unreported,Pursuit,No,2000-01-01,2000.0,1.0,0.0,1.0,5,1.0,
2,2.0,25746.0,Samuel H. Knapp,17,Male,European-American/White,European-American/White,Not imputed,01/01/2000,CA,Mendocino,mendocino county sheriff's office,Vehicle,Samuel Knapp was allegedly driving a stolen ve...,Unreported,Pursuit,No,2000-01-01,2000.0,1.0,0.0,1.0,5,1.0,sheriff
3,3.0,25749.0,Mark Ortiz,23,Male,Hispanic/Latino,Hispanic/Latino,Not imputed,01/01/2000,NM,Eddy,eddy county sheriff's office,Vehicle,A motorcycle was allegedly being driven errati...,Unreported,Pursuit,No,2000-01-01,2000.0,1.0,0.0,1.0,5,1.0,sheriff
4,4.0,1.0,LaTanya Janelle McCoy,24,Female,African-American/Black,African-American/Black,Not imputed,01/02/2000,CA,Sacramento,sacramento police department,Vehicle,LaTanya Janelle McCoy's car was struck from be...,Unknown,Pursuit,No,2000-01-02,2000.0,1.0,1.0,2.0,6,2.0,police


Due to the uninspired nature of local governments accross the nation, the names of counties may be repeated from state to state. In order to avoid this confusion when trying to aggregating on a county basis down the line, we must create a new location feature which includes the county name, as well the state it exists within. This will also act as our foreign key for our census table, which is aggregated on a county basis.

In [4]:
df_main['county_state'] = df_main['Location of death (county)'] + ', ' + df_main['State']

To create the same format for the primary key within the census dataframe, we must start by removing the word "county" that exists after every county. Accordingly, we have a function that removes that last word from list of words in a phrase. 

In [5]:
def decountyifier(x):
    x1 = x.split()
    x2 = x1[:-1:]
    x3 = " ".join(x2)
    return x3

### Mapping function to every row in the 'name' column of the census dataframe.

In [6]:
df_census['name'] = df_census['name'].map(lambda x: decountyifier(x))

Another problem presents itself as the state identifying data in the incidents dataframe is formatted as a two letter abbreviation where it is the full name of the state in the census dataframe. In order to transform the full name into the abbreviated format, we use a fourth table that only contains the abbreviated state names and full state names, and join it with the census data on the state column. To simplfy the merging process, we reformat the name of the column so they are the same on both dataframes. Finally, we concatenate the strings of the county name and state name (seperated by a space and comma).

### Renaming state column

In [7]:
df_states.rename(columns={'State':'state'}, inplace=True)

### Merging state abbreviations to census dataframes

In [8]:
df_census = pd.merge(df_census, df_states, how="inner", on=['state'])

### Creating county, state column

In [9]:
df_census['county_state'] = df_census['name'] + ', ' + df_census['Code']

Similarly, in order to merge on the incidents dataframe with the last name dataframe we will have to create the same format for the data in both dataframes. This will include breaking out the last names from the name column in the in incidents dataframe, lowering the string case and removing suffixes. Therefore, a function was created that splits a name by term (including suffixes), and returns the last term if it is not a suffix, and the second to last term if the last term is a suffix.

In [10]:
def last_name(x):
    new = []
    x1 = x.split()
    x2 = x1[-1]
    if x2 == 'Jr.':
        return x1[-2]
    elif x2 == 'Sr.':
        return x1[-2]
    elif x2 == 'II':
        return x1[-2]
    elif x2 == 'III':
        return x1[-2]
    elif x2 == 'IV':
        return x1[-2]
    elif x2 == 'V':
        return x1[-2]
    elif x2 == 'VI':
        return x1[-2]
    elif x2 == 'VII':
        return x1[-2]
    else: 
        return x2

We map this function to the data in the name column, and overwrite the previously data.

In [11]:
df_main['last_name'] = df_main['Name'].map(lambda x: last_name(x))

After we apply a blnaket string case lowering method to the entire column.

In [12]:
df_main['last_name'] = df_main['last_name'].str.lower()

Finally, we rename the column last name to make match the column name in names dataframe.

In [13]:
df_names.rename(columns = {'name':'last_name'}, inplace = True)

Again, we apply a string case-lowering method, this time to the last name column from the names dataframe.

In [14]:
df_names['last_name'] = df_names['last_name'].str.lower()

### Merging dataframes

Merging incidents dataframe to census dataframe

In [15]:
df_main = pd.merge(df_main, df_census, how='inner', on = ['county_state'])

Merging incidents dataframe to last names dataframe

In [16]:
df_main = pd.merge(df_main, df_names, how='inner', on = ['last_name'])

### Displaying the full, newly created dataframe

In [17]:
df_main

Unnamed: 0.1,Unnamed: 0_x,Unique ID,Name,Age,Gender,Race,Race with imputations,Imputation probability,Date of injury resulting in death (month/day/year),State,Location of death (county),Agency or agencies involved,Highest level of force,Brief description,"Dispositions/Exclusions INTERNAL USE, NOT FOR ANALYSIS",Intended use of force (Developing),"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS",Date,year,month,week_of_year,day_of_month,day_of_week,day_of_year,Agency,county_state,last_name,Unnamed: 0_y,name,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,state,Abbrev,Code,Unnamed: 0,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic
0,0.0,25747.0,Mark A. Horton,21,Male,African-American/Black,African-American/Black,Not imputed,01/01/2000,MI,Wayne,,Vehicle,Two Detroit men killed when their car crashed ...,Unreported,Pursuit,No,2000-01-01,2000.0,1.0,0.0,1.0,5,1.0,,"Wayne, MI",horton,1312,Wayne,2061162.0,1820584,1802164.0,1794073.0,1777790.0,1769007.0,1762098.0,1756598.0,1753616.0,6.5,6.5,25.4,12.7,14.4,37.9,52.0,52.3,40.5,19.55,0.4,0.16,2.5,1.56,0.01,0.97,2.4,1.17,5.2,5.72,49.6,49.68,86.0,85.4,7.6,12.0,5.9,83.2,85.1,85.6,32.6,32.6,20.2,22.3,22.8,120005,6.4,24.6,25.2,68.9,83.6,821693,67.2,23.4,121100,690943,673143.0,2.67,2.59,22125,24209.27,42241,43570.0,43702.0,32960,593483,-21.7,117124.0,17275751.0,8720.0,735,19357718.0,612.08,2974.4,21.4,22.9,23.7,39.2,35.3,883501.0,810552.0,72949.0,8.26,863865.0,783771.0,80094.0,9.27,869996.0,728832.0,141164.0,16.23,802754.0,678426.0,124328.0,15.49,774443.0,674177.0,100266.0,12.95,771845.0,681453.0,90392.0,11.71,776560.0,687300.0,89260.0,11.49,768439.0,694244.0,74195.0,9.66,758684.0,706075.0,52609.0,6.93,778434.0,729654.0,48780.0,6.27,8.8,3.4,0.4,789088.0,746415.0,42673.0,5.41,76.2,15.1,2.0,6.6,3.4,3.33,23.9,38.7,5.9,75.5,69.2,77.1,682282,1.4,2.5,4.3,3.9,4.1,38.0,1.7,62.0,86.5,67907,37.9,47301,26659,36588,0.3,1.9,0.0,27282,2.54,1757299,2.5,8.7,6.3,0.7,2.8,1.9,5.9,53.1,49.5,1,0,0,1,0,Michigan,Mich.,MI,334,335,83523,30.96,28267.95,72.15,23.50,0.40,0.69,1.73,1.53
1,25492.0,25227.0,Andre Horton,42.0,Male,African-American/Black,African-American/Black,Not imputed,12/13/2018,TN,Shelby,memphis police department,Gunshot,Andre Horton was reportedly pointing a realist...,Justified by District Attorney,Deadly force,Unknown,2018-12-13,2018.0,12.0,49.0,13.0,3,347.0,police,"Shelby, TN",horton,2506,Shelby,897472.0,927644,933268.0,939425.0,938825.0,938434.0,937885.0,937130.0,936961.0,7.2,7.2,26.4,10.3,12.2,35.3,52.3,40.6,52.1,26.68,0.2,0.08,2.3,1.26,0.02,1.38,1.4,0.87,5.6,6.10,38.7,36.45,90.7,81.6,6.0,8.5,5.8,84.9,87.1,87.6,30.1,29.9,27.8,30.2,30.6,62382,7.5,22.4,22.9,70.5,82.4,398274,61.7,27.6,135300,340443,349207.0,2.65,2.64,25002,27419.32,44705,47639.0,48415.0,20262,428357,-10.3,70282.0,11932863.0,12971.0,1548,8657013.0,763.17,1215.5,19.7,20.8,20.8,39.5,33.7,446771.0,424197.0,22574.0,5.05,440054.0,409635.0,30419.0,6.91,438125.0,393849.0,44276.0,10.11,448829.0,405047.0,43782.0,9.75,451309.0,408345.0,42964.0,9.52,446126.0,407723.0,38403.0,8.61,438627.0,399995.0,38632.0,8.81,427667.0,395109.0,32558.0,7.61,429297.0,401766.0,27531.0,6.41,434261.0,411126.0,23135.0,5.33,12.8,5.9,0.4,439415.0,420439.0,18976.0,4.32,74.9,13.1,1.5,7.1,2.6,3.38,31.6,53.7,6.4,75.3,67.5,76.5,351194,1.7,1.8,1.0,1.8,4.9,44.9,0.9,55.1,88.4,76734,35.6,51657,28181,36699,0.2,2.7,0.0,30104,2.62,936374,1.7,6.8,11.5,1.0,5.6,4.7,7.2,39.1,35.8,1,0,1,0,0,Tennessee,Tenn.,TN,334,335,83523,30.96,28267.95,72.15,23.50,0.40,0.69,1.73,1.53
2,16483.0,13379.0,Marlon Horton,28.0,Male,African-American/Black,African-American/Black,Not imputed,09/07/2013,IL,Cook,chicago police department,Gunshot,A sleeping Marlon Horton was asked to leave th...,Pending investigation,Deadly force,Unknown,2013-09-07,2013.0,9.0,35.0,7.0,5,250.0,police,"Cook, IL",horton,610,Cook,5376741.0,5194675,5217049.0,5237174.0,5250498.0,5253756.0,5245831.0,5231356.0,5211263.0,6.6,6.3,23.7,11.9,13.5,36.4,51.6,55.4,24.8,11.86,0.4,0.13,6.2,3.49,0.02,4.94,2.5,1.25,24.0,25.05,43.9,42.68,64.9,85.8,21.0,33.7,5.1,83.2,85.9,86.2,25.7,25.5,33.2,36.5,37.2,244059,4.3,31.8,32.9,76.7,85.9,2180359,60.4,54.0,265800,1936481,1956561.0,2.63,2.63,29335,33031.18,53942,60025.0,59426.0,127868,2245334,-12.1,408366.0,60585557.0,11571.0,2734,44219617.0,945.33,5495.1,15.3,15.0,15.9,23.3,22.8,2615750.0,2478215.0,137535.0,5.26,2615675.0,2447178.0,168497.0,6.44,2604692.0,2330033.0,274659.0,10.54,2643833.0,2356472.0,287361.0,10.87,2636434.0,2360934.0,275500.0,10.45,2653326.0,2397792.0,255534.0,9.63,2671904.0,2414722.0,257182.0,9.63,2653342.0,2455149.0,198193.0,7.47,2648861.0,2484494.0,164367.0,6.21,2669585.0,2507250.0,162335.0,6.08,11.1,3.7,1.1,2653153.0,2514113.0,139040.0,5.24,78.0,14.3,1.9,6.2,7.3,3.39,38.8,23.4,25.3,81.4,77.1,80.2,1972108,4.5,7.7,2.0,9.9,17.6,43.1,0.7,56.9,87.1,95677,36.8,64660,32955,44142,0.3,9.6,0.0,37552,2.59,5198275,2.7,6.6,8.8,1.3,3.4,2.3,3.9,56.7,42.3,1,0,1,0,0,Illinois,Ill.,IL,334,335,83523,30.96,28267.95,72.15,23.50,0.40,0.69,1.73,1.53
3,3948.0,3153.0,Donnie Clay Coleman Jr. aka Leon Eric Horton,27,Male,African-American/Black,African-American/Black,Not imputed,02/09/2004,CA,Kern,kern county sheriff's office,Drug overdose,Donnie Clay Coleman Jr. aka Leon Eric Horton d...,Unreported,No,Unknown,2004-02-09,2004.0,2.0,6.0,9.0,0,40.0,sheriff,"Kern, CA",horton,200,Kern,661645.0,839631,848767.0,855237.0,864014.0,871895.0,879607.0,885086.0,893119.0,8.7,8.2,30.3,9.0,10.2,31.3,48.4,59.5,5.8,2.74,1.5,0.55,4.2,2.35,0.09,5.03,4.5,1.68,49.2,52.22,38.6,35.35,55.9,79.4,20.5,41.0,6.6,71.1,73.6,73.8,30.7,30.3,14.7,15.7,15.8,47365,6.1,23.2,23.4,73.9,83.0,284367,61.4,18.1,217100,248057,264993.0,3.14,3.20,20100,21636.25,47089,49812.0,50826.0,12111,179606,19.5,41628.0,7876043.0,10037.0,1656,5762188.0,8131.92,103.3,20.6,22.4,22.6,34.3,31.0,345187.0,316959.0,28228.0,8.18,359141.0,323808.0,35333.0,9.84,362714.0,311707.0,51007.0,14.06,371515.0,313361.0,58154.0,15.65,382367.0,325712.0,56655.0,14.82,391888.0,340355.0,51533.0,13.15,393440.0,347213.0,46227.0,11.75,393459.0,352504.0,40955.0,10.41,390873.0,350944.0,39929.0,10.22,388445.0,347996.0,40449.0,10.41,11.2,4.5,0.9,384944.0,349502.0,35442.0,9.21,70.9,10.7,1.2,7.9,4.7,3.69,16.4,5.5,53.3,79.7,70.0,79.3,270282,3.1,9.5,0.8,1.9,37.8,41.7,6.6,58.3,74.1,73478,31.6,53350,25013,32876,1.0,10.7,0.2,23326,3.17,887641,3.5,9.4,7.9,1.0,3.2,2.3,5.7,74.4,34.2,1,0,0,1,0,California,Calif.,CA,334,335,83523,30.96,28267.95,72.15,23.50,0.40,0.69,1.73,1.53
4,12134.0,9594.0,Kenneth Michael Horton,31,Male,African-American/Black,African-American/Black,Not imputed,11/04/2010,TX,Dallas,dallas police department,Gunshot,Horton parked near his ex-girlfriend's apartme...,Suicide,Suicide,No,2010-11-04,2010.0,11.0,44.0,4.0,3,308.0,police,"Dallas, TX",horton,2579,Dallas,2218899.0,2368139,2408133.0,2454781.0,2483807.0,2517417.0,2554233.0,2587462.0,2618148.0,8.1,7.6,27.6,8.8,10.0,33.3,50.6,53.5,22.3,11.22,0.7,0.16,5.0,3.00,0.03,3.58,2.8,1.35,38.3,39.63,33.1,30.22,57.4,80.9,23.0,38.8,5.9,76.5,78.0,78.3,25.8,25.6,28.0,29.7,30.1,112642,5.1,25.7,27.2,76.5,87.2,943257,54.7,38.9,129700,832360,906179.0,2.75,2.78,26185,29202.16,47974,54429.0,53626.0,61359,1253122,-15.2,185432.0,33177208.0,13929.0,5485,17621936.0,871.28,2718.0,17.6,16.3,17.7,27.6,26.9,1137194.0,1085427.0,51767.0,4.55,1140659.0,1080079.0,60580.0,5.31,1143197.0,1049081.0,94116.0,8.23,1194015.0,1091493.0,102522.0,8.59,1212288.0,1113577.0,98711.0,8.14,1223343.0,1136230.0,87113.0,7.12,1234933.0,1153569.0,81364.0,6.59,1250984.0,1183245.0,67739.0,5.41,1261721.0,1207803.0,53918.0,4.27,1296256.0,1244191.0,52065.0,4.02,22.1,13.5,1.5,1333933.0,1282785.0,51148.0,3.83,73.6,10.5,1.2,7.5,6.3,3.51,31.5,22.6,40.2,81.1,74.1,82.9,928341,4.1,10.8,2.1,3.4,29.2,50.0,1.5,50.0,79.3,88716,33.4,59607,31136,38500,0.4,6.8,0.0,32653,2.78,2606868,2.6,4.4,21.0,2.5,13.8,9.6,4.8,61.3,29.1,1,0,0,1,0,Texas,Tex.,TX,334,335,83523,30.96,28267.95,72.15,23.50,0.40,0.69,1.73,1.53
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
26002,30491.0,30480.0,Gary Lee Deering,74.0,Male,European-American/White,European-American/White,Not imputed,06/23/2021,MO,Dade,dade county sheriff's office,Gunshot,Gary Deering was shot and killed by a Dade Cou...,Pending investigation,Deadly force,Unknown,2021-06-23,2021.0,6.0,25.0,23.0,2,174.0,sheriff,"Dade, MO",deering,1511,Dade,7923.0,7883,7769.0,7561.0,7536.0,7600.0,7575.0,7606.0,7588.0,5.2,4.6,22.6,20.5,22.6,46.7,49.9,96.0,0.4,0.20,0.9,0.35,0.3,0.06,0.04,0.12,2.2,1.41,1.5,1.99,94.8,93.97,97.1,84.3,0.6,2.9,5.2,82.0,86.4,87.6,28.1,28.6,9.1,14.9,15.1,953,12.9,25.8,24.1,62.9,77.8,3965,77.5,5.2,73000,3276,3107.0,2.42,2.39,16638,20507.95,32714,37884.0,38880.0,149,1254,2.0,506.0,46902.0,6222.0,2,72197.0,490.01,16.1,20.5,17.5,22.9,45.4,34.6,3745.0,3546.0,199.0,5.31,3570.0,3346.0,224.0,6.27,3564.0,3248.0,316.0,8.87,3402.0,3071.0,331.0,9.73,3433.0,3114.0,319.0,9.29,3368.0,3116.0,252.0,7.48,3450.0,3215.0,235.0,6.81,3511.0,3300.0,211.0,6.01,3571.0,3390.0,181.0,5.07,3589.0,3429.0,160.0,4.46,15.1,12.3,0.0,3544.0,3423.0,121.0,3.41,79.2,23.2,2.8,4.8,0.8,2.83,12.7,0.4,2.1,71.6,57.0,69.7,3068,1.9,0.3,0.0,1.7,1.0,23.2,15.3,76.8,87.2,55998,46.2,40399,20907,27455,0.9,0.3,0.0,23186,2.41,7578,2.1,8.5,13.6,1.2,9.3,18.4,11.2,95.5,93.7,0,0,0,1,0,Missouri,Mo.,MO,5527,5528,5779,2.14,61299.21,88.67,6.85,0.69,0.42,1.30,2.08
26003,30502.0,30491.0,David Ronald Bridgette,45.0,Male,European-American/White,European-American/White,Not imputed,06/24/2021,MI,Iron,"iron river police department, iron county sher...",Gunshot,An Iron County Sheriff's Office deputy and an ...,Pending investigation,Deadly force,Unknown,2021-06-24,2021.0,6.0,25.0,24.0,3,175.0,multiple_agencies,"Iron, MI",bridgette,1266,Iron,13138.0,11817,11761.0,11562.0,11495.0,11341.0,11315.0,11182.0,11124.0,4.2,3.8,17.1,26.3,28.9,53.8,50.7,97.1,0.1,0.39,0.9,0.61,0.3,0.23,0.00,0.24,1.4,0.60,1.4,1.82,96.2,94.96,97.2,91.3,1.2,3.5,4.4,88.3,91.7,91.9,28.9,28.3,15.1,18.1,18.5,1445,13.2,18.6,21.9,57.6,75.7,9197,84.9,6.8,75700,5386,5315.0,2.15,2.06,19986,23609.22,33734,38913.0,36773.0,400,2713,-12.8,765.0,101386.0,8496.0,43,131545.0,1166.15,10.1,12.7,15.4,14.2,33.0,22.9,5938.0,5496.0,442.0,7.44,5838.0,5357.0,481.0,8.24,5779.0,5081.0,698.0,12.08,5296.0,4661.0,635.0,11.99,5170.0,4634.0,536.0,10.37,5079.0,4594.0,485.0,9.55,5068.0,4553.0,515.0,10.16,5195.0,4743.0,452.0,8.70,5197.0,4831.0,366.0,7.04,5212.0,4876.0,336.0,6.45,8.2,6.3,0.0,5213.0,4873.0,340.0,6.52,84.0,30.0,4.7,3.8,0.6,2.69,18.5,0.5,2.1,68.2,66.9,59.3,5225,0.8,0.1,0.6,1.4,0.9,19.2,3.7,80.8,91.1,53501,54.3,41599,24450,29816,1.7,0.2,0.0,25944,2.06,11152,1.2,4.3,6.7,0.0,6.6,8.7,11.7,95.8,94.4,0,0,0,1,0,Michigan,Mich.,MI,111805,111740,146,0.05,87974.97,46.58,47.26,0.00,0.00,0.00,4.11
26004,30507.0,30496.0,William Dean Hewett,54.0,Male,European-American/White,European-American/White,Not imputed,06/24/2021,NC,Columbus,u.s. marshals service,Gunshot,U.S. deputy marshals were attempting to arrest...,Pending investigation,Deadly force,No,2021-06-24,2021.0,6.0,25.0,24.0,3,175.0,other,"Columbus, NC",hewett,1913,Columbus,54749.0,58098,57693.0,57534.0,57088.0,56902.0,56686.0,56335.0,55936.0,6.0,5.5,23.3,15.2,18.5,41.9,50.5,61.5,30.5,15.44,3.2,1.85,0.3,0.22,0.01,1.23,1.5,0.49,4.6,5.11,60.4,59.53,94.0,90.1,2.8,5.0,5.3,77.5,80.7,80.6,36.0,35.0,12.0,12.0,12.5,4265,7.9,26.8,26.4,57.6,71.2,26042,72.0,5.8,89000,21779,22462.0,2.53,2.39,18784,22483.26,35421,35290.0,36261.0,1080,12116,-15.2,3237.0,527600.0,9769.0,28,588643.0,937.29,62.0,21.4,24.6,23.6,46.3,34.6,24348.0,23037.0,1311.0,5.38,24150.0,22294.0,1856.0,7.69,24433.0,21287.0,3146.0,12.88,24184.0,20943.0,3241.0,13.40,23939.0,20734.0,3205.0,13.39,23183.0,20341.0,2842.0,12.26,24084.0,21604.0,2480.0,10.30,23030.0,21066.0,1964.0,8.53,22610.0,20907.0,1703.0,7.53,22572.0,21125.0,1447.0,6.41,15.1,5.9,0.0,22324.0,21079.0,1245.0,5.58,78.7,19.5,1.8,5.1,0.4,3.07,13.8,30.5,5.3,66.6,59.8,65.5,21580,0.6,1.3,0.1,0.4,5.4,27.5,30.6,72.5,83.0,55616,41.9,37628,22271,30646,3.7,1.7,0.0,22608,2.46,56068,1.6,6.5,12.6,0.3,4.2,2.4,8.3,62.0,59.3,0,0,0,1,0,North Carolina,N.C.,NC,5210,5211,6159,2.28,60597.99,88.57,7.36,0.47,0.65,1.40,1.56
26005,30615.0,30605.0,Kristopher M. Burden,34.0,Male,European-American/White,European-American/White,Not imputed,07/15/2021,KY,Edmonson,edmonson county sheriff's office,Vehicle,An Edmonson County deputy saw an allegedly sto...,Pending investigation,Pursuit,No,2021-07-15,2021.0,7.0,28.0,15.0,3,196.0,sheriff,"Edmonson, KY",burden,1023,Edmonson,11644.0,12161,12213.0,12077.0,12042.0,11993.0,11986.0,12068.0,12226.0,5.5,4.4,21.8,16.6,19.3,43.1,50.1,96.9,1.4,0.78,0.3,0.19,0.2,0.07,0.00,0.00,0.9,1.02,0.8,0.87,96.5,95.47,98.0,83.9,1.0,2.7,5.4,76.3,76.9,79.2,24.8,26.3,7.2,11.9,11.1,906,8.2,32.4,29.8,71.9,78.7,6467,75.2,3.8,84600,4752,4880.0,2.43,2.39,18959,22569.87,35808,38395.0,41114.0,130,791,4.2,877.0,32536.0,2723.0,0,115136.0,302.88,40.2,18.8,22.3,20.0,28.8,25.0,5322.0,4985.0,337.0,6.33,5316.0,4915.0,401.0,7.54,5274.0,4555.0,719.0,13.63,5122.0,4440.0,682.0,13.32,5122.0,4523.0,599.0,11.69,5033.0,4532.0,501.0,9.95,4996.0,4496.0,500.0,10.01,4776.0,4377.0,399.0,8.35,4640.0,4312.0,328.0,7.07,4755.0,4455.0,300.0,6.31,5.4,2.8,0.0,4884.0,4592.0,292.0,5.98,81.4,20.2,2.1,4.6,0.1,2.89,10.3,1.5,0.7,77.9,65.0,69.2,4885,0.3,0.0,0.3,0.9,0.9,19.6,22.6,80.4,78.1,52578,44.3,43401,21545,31956,0.4,0.0,0.0,21802,2.41,12138,2.5,7.9,6.3,0.0,3.2,3.0,7.5,95.5,95.1,1,0,0,1,0,Kentucky,Ky.,KY,2693,2694,12299,4.56,52597.36,69.48,26.55,0.39,0.76,1.43,1.39


### Checking the full dataframe for null values

In [18]:
def null_sort(df):
     return df.apply(pd.isnull).sum().sort_values(ascending=False).loc[lambda series: series>0]

In [19]:
new = null_sort(df_main)

In [20]:
new = pd.DataFrame(new)

In [21]:
new

Unnamed: 0,0


### Renaming dataframe

In [22]:
df = df_main

### Casting day of week and age columns as float types

In [23]:
df['day_of_week'] = df['day_of_week'].astype('float')

In [24]:
df['Age'] = df['Age'].astype('float')

### Splitting our categorical and numerical variables

In [25]:
df_cat = df.select_dtypes(include='object')
df_num = df.select_dtypes(exclude='object')

### Defining a function which returns the number of unique values in each column of a dataframe

In [26]:
def unq_cnt(df):
    new_list = []
    for i in df.columns:
        new_list.append(i)
        def un_cnt(col):
            cnt = (len(df[col].unique()))
            return cnt
    col_counts = [un_cnt(i) for i in df.columns] 
    return list(zip(new_list,col_counts))

### Creating a dataframe based on the column unique value count of cat. variables

In [27]:
df_unq_cat = pd.DataFrame(unq_cnt(df_cat))

In [28]:
df_unq_cat.sort_values(by = 1, ascending = False)

Unnamed: 0,0,1
0,Name,25556
10,Brief description,24603
17,last_name,9516
5,Date of injury resulting in death (month/day/...,7427
14,Date,7427
4,Imputation probability,6216
8,Agency or agencies involved,6040
16,county_state,2277
18,name,1420
7,Location of death (county),1420


### Dropping unnecessary categorical columns

In [29]:
df_cat = df_cat.drop(columns = ['Name', 
                       'last_name', 
                       ' Date of injury resulting in death (month/day/year)',
                       'Date',
                       'Imputation probability', 
                       'Agency or agencies involved',
                       'county_state',
                       'name',
                       'Location of death (county)',
                       'Race with imputations', 
                       'Dispositions/Exclusions INTERNAL USE, NOT FOR ANALYSIS',
                       'state',
                       'Race',
                       'Abbrev',
                       'Code', 'Brief description'])

### Checking unique values after dropping columns

In [30]:
df_unq_cat = pd.DataFrame(unq_cnt(df_cat))

In [31]:
df_unq_cat.sort_values(by = 1, ascending = False)

Unnamed: 0,0,1
1,State,50
2,Highest level of force,17
3,Intended use of force (Developing),8
5,Agency,5
4,"Foreknowledge of mental illness? INTERNAL USE,...",4
0,Gender,3


### Dummifying remaining categorical variables

In [32]:
df_cat = pd.get_dummies(df_cat, drop_first=True)

### Creating a dataframe based on the column unique value count of num. variables

In [33]:
df_unq_num = pd.DataFrame(unq_cnt(df_num))

In [34]:
df_unq_num.sort_values(by = 1, ascending = False)

Unnamed: 0,0,1
0,Unnamed: 0_x,25750
1,Unique ID,25750
186,cum_prop100k,9516
182,Unnamed: 0,9516
184,count,6548
183,rank,6548
187,pctwhite,4472
188,pctblack,3148
9,Unnamed: 0_y,2283
69,per_capita_income_2017,2282


### Dropping unnecessary numeric columns 

In [35]:
df_num = df_num.drop(columns = ['Unnamed: 0_x', 'Unnamed: 0', 'Unnamed: 0_y'])

### Examining numeric variable dataframe 

In [36]:
df_num.head()

Unnamed: 0,Unique ID,Age,year,month,week_of_year,day_of_month,day_of_week,day_of_year,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic
0,25747.0,21.0,2000.0,1.0,0.0,1.0,5.0,1.0,2061162.0,1820584,1802164.0,1794073.0,1777790.0,1769007.0,1762098.0,1756598.0,1753616.0,6.5,6.5,25.4,12.7,14.4,37.9,52.0,52.3,40.5,19.55,0.4,0.16,2.5,1.56,0.01,0.97,2.4,1.17,5.2,5.72,49.6,49.68,86.0,85.4,7.6,12.0,5.9,83.2,85.1,85.6,32.6,32.6,20.2,22.3,22.8,120005,6.4,24.6,25.2,68.9,83.6,821693,67.2,23.4,121100,690943,673143.0,2.67,2.59,22125,24209.27,42241,43570.0,43702.0,32960,593483,-21.7,117124.0,17275751.0,8720.0,735,19357718.0,612.08,2974.4,21.4,22.9,23.7,39.2,35.3,883501.0,810552.0,72949.0,8.26,863865.0,783771.0,80094.0,9.27,869996.0,728832.0,141164.0,16.23,802754.0,678426.0,124328.0,15.49,774443.0,674177.0,100266.0,12.95,771845.0,681453.0,90392.0,11.71,776560.0,687300.0,89260.0,11.49,768439.0,694244.0,74195.0,9.66,758684.0,706075.0,52609.0,6.93,778434.0,729654.0,48780.0,6.27,8.8,3.4,0.4,789088.0,746415.0,42673.0,5.41,76.2,15.1,2.0,6.6,3.4,3.33,23.9,38.7,5.9,75.5,69.2,77.1,682282,1.4,2.5,4.3,3.9,4.1,38.0,1.7,62.0,86.5,67907,37.9,47301,26659,36588,0.3,1.9,0.0,27282,2.54,1757299,2.5,8.7,6.3,0.7,2.8,1.9,5.9,53.1,49.5,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53
1,25227.0,42.0,2018.0,12.0,49.0,13.0,3.0,347.0,897472.0,927644,933268.0,939425.0,938825.0,938434.0,937885.0,937130.0,936961.0,7.2,7.2,26.4,10.3,12.2,35.3,52.3,40.6,52.1,26.68,0.2,0.08,2.3,1.26,0.02,1.38,1.4,0.87,5.6,6.1,38.7,36.45,90.7,81.6,6.0,8.5,5.8,84.9,87.1,87.6,30.1,29.9,27.8,30.2,30.6,62382,7.5,22.4,22.9,70.5,82.4,398274,61.7,27.6,135300,340443,349207.0,2.65,2.64,25002,27419.32,44705,47639.0,48415.0,20262,428357,-10.3,70282.0,11932863.0,12971.0,1548,8657013.0,763.17,1215.5,19.7,20.8,20.8,39.5,33.7,446771.0,424197.0,22574.0,5.05,440054.0,409635.0,30419.0,6.91,438125.0,393849.0,44276.0,10.11,448829.0,405047.0,43782.0,9.75,451309.0,408345.0,42964.0,9.52,446126.0,407723.0,38403.0,8.61,438627.0,399995.0,38632.0,8.81,427667.0,395109.0,32558.0,7.61,429297.0,401766.0,27531.0,6.41,434261.0,411126.0,23135.0,5.33,12.8,5.9,0.4,439415.0,420439.0,18976.0,4.32,74.9,13.1,1.5,7.1,2.6,3.38,31.6,53.7,6.4,75.3,67.5,76.5,351194,1.7,1.8,1.0,1.8,4.9,44.9,0.9,55.1,88.4,76734,35.6,51657,28181,36699,0.2,2.7,0.0,30104,2.62,936374,1.7,6.8,11.5,1.0,5.6,4.7,7.2,39.1,35.8,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53
2,13379.0,28.0,2013.0,9.0,35.0,7.0,5.0,250.0,5376741.0,5194675,5217049.0,5237174.0,5250498.0,5253756.0,5245831.0,5231356.0,5211263.0,6.6,6.3,23.7,11.9,13.5,36.4,51.6,55.4,24.8,11.86,0.4,0.13,6.2,3.49,0.02,4.94,2.5,1.25,24.0,25.05,43.9,42.68,64.9,85.8,21.0,33.7,5.1,83.2,85.9,86.2,25.7,25.5,33.2,36.5,37.2,244059,4.3,31.8,32.9,76.7,85.9,2180359,60.4,54.0,265800,1936481,1956561.0,2.63,2.63,29335,33031.18,53942,60025.0,59426.0,127868,2245334,-12.1,408366.0,60585557.0,11571.0,2734,44219617.0,945.33,5495.1,15.3,15.0,15.9,23.3,22.8,2615750.0,2478215.0,137535.0,5.26,2615675.0,2447178.0,168497.0,6.44,2604692.0,2330033.0,274659.0,10.54,2643833.0,2356472.0,287361.0,10.87,2636434.0,2360934.0,275500.0,10.45,2653326.0,2397792.0,255534.0,9.63,2671904.0,2414722.0,257182.0,9.63,2653342.0,2455149.0,198193.0,7.47,2648861.0,2484494.0,164367.0,6.21,2669585.0,2507250.0,162335.0,6.08,11.1,3.7,1.1,2653153.0,2514113.0,139040.0,5.24,78.0,14.3,1.9,6.2,7.3,3.39,38.8,23.4,25.3,81.4,77.1,80.2,1972108,4.5,7.7,2.0,9.9,17.6,43.1,0.7,56.9,87.1,95677,36.8,64660,32955,44142,0.3,9.6,0.0,37552,2.59,5198275,2.7,6.6,8.8,1.3,3.4,2.3,3.9,56.7,42.3,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53
3,3153.0,27.0,2004.0,2.0,6.0,9.0,0.0,40.0,661645.0,839631,848767.0,855237.0,864014.0,871895.0,879607.0,885086.0,893119.0,8.7,8.2,30.3,9.0,10.2,31.3,48.4,59.5,5.8,2.74,1.5,0.55,4.2,2.35,0.09,5.03,4.5,1.68,49.2,52.22,38.6,35.35,55.9,79.4,20.5,41.0,6.6,71.1,73.6,73.8,30.7,30.3,14.7,15.7,15.8,47365,6.1,23.2,23.4,73.9,83.0,284367,61.4,18.1,217100,248057,264993.0,3.14,3.2,20100,21636.25,47089,49812.0,50826.0,12111,179606,19.5,41628.0,7876043.0,10037.0,1656,5762188.0,8131.92,103.3,20.6,22.4,22.6,34.3,31.0,345187.0,316959.0,28228.0,8.18,359141.0,323808.0,35333.0,9.84,362714.0,311707.0,51007.0,14.06,371515.0,313361.0,58154.0,15.65,382367.0,325712.0,56655.0,14.82,391888.0,340355.0,51533.0,13.15,393440.0,347213.0,46227.0,11.75,393459.0,352504.0,40955.0,10.41,390873.0,350944.0,39929.0,10.22,388445.0,347996.0,40449.0,10.41,11.2,4.5,0.9,384944.0,349502.0,35442.0,9.21,70.9,10.7,1.2,7.9,4.7,3.69,16.4,5.5,53.3,79.7,70.0,79.3,270282,3.1,9.5,0.8,1.9,37.8,41.7,6.6,58.3,74.1,73478,31.6,53350,25013,32876,1.0,10.7,0.2,23326,3.17,887641,3.5,9.4,7.9,1.0,3.2,2.3,5.7,74.4,34.2,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53
4,9594.0,31.0,2010.0,11.0,44.0,4.0,3.0,308.0,2218899.0,2368139,2408133.0,2454781.0,2483807.0,2517417.0,2554233.0,2587462.0,2618148.0,8.1,7.6,27.6,8.8,10.0,33.3,50.6,53.5,22.3,11.22,0.7,0.16,5.0,3.0,0.03,3.58,2.8,1.35,38.3,39.63,33.1,30.22,57.4,80.9,23.0,38.8,5.9,76.5,78.0,78.3,25.8,25.6,28.0,29.7,30.1,112642,5.1,25.7,27.2,76.5,87.2,943257,54.7,38.9,129700,832360,906179.0,2.75,2.78,26185,29202.16,47974,54429.0,53626.0,61359,1253122,-15.2,185432.0,33177208.0,13929.0,5485,17621936.0,871.28,2718.0,17.6,16.3,17.7,27.6,26.9,1137194.0,1085427.0,51767.0,4.55,1140659.0,1080079.0,60580.0,5.31,1143197.0,1049081.0,94116.0,8.23,1194015.0,1091493.0,102522.0,8.59,1212288.0,1113577.0,98711.0,8.14,1223343.0,1136230.0,87113.0,7.12,1234933.0,1153569.0,81364.0,6.59,1250984.0,1183245.0,67739.0,5.41,1261721.0,1207803.0,53918.0,4.27,1296256.0,1244191.0,52065.0,4.02,22.1,13.5,1.5,1333933.0,1282785.0,51148.0,3.83,73.6,10.5,1.2,7.5,6.3,3.51,31.5,22.6,40.2,81.1,74.1,82.9,928341,4.1,10.8,2.1,3.4,29.2,50.0,1.5,50.0,79.3,88716,33.4,59607,31136,38500,0.4,6.8,0.0,32653,2.78,2606868,2.6,4.4,21.0,2.5,13.8,9.6,4.8,61.3,29.1,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53


### Concatenating numeric variable dataframe and numeric dataframe

In [37]:
df_new = pd.concat([df_num, df_cat], axis = 1)

### Examining newly created full dataframe, with both numeric and categorical variables

In [38]:
df_new.head()

Unnamed: 0,Unique ID,Age,year,month,week_of_year,day_of_month,day_of_week,day_of_year,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic,Gender_Male,Gender_Transgender,State_AL,State_AR,State_AZ,State_CA,State_CO,State_CT,State_DE,State_FL,State_GA,State_HI,State_IA,State_ID,State_IL,State_IN,State_KS,State_KY,State_LA,State_MA,State_MD,State_ME,State_MI,State_MN,State_MO,State_MS,State_MT,State_NC,State_ND,State_NE,State_NH,State_NJ,State_NM,State_NV,State_NY,State_OH,State_OK,State_OR,State_PA,State_RI,State_SC,State_SD,State_TN,State_TX,State_UT,State_VA,State_VT,State_WA,State_WI,State_WV,State_WY,Highest level of force_Asphyxiation/Restrain,Highest level of force_Asphyxiation/Restrained,Highest level of force_Beaten/Bludgeoned with instrument,Highest level of force_Burned/Smoke inhalation,Highest level of force_Chemical agent/Pepper spray,Highest level of force_Drowned,Highest level of force_Drug overdose,Highest level of force_Fell from a height,Highest level of force_Gunshot,Highest level of force_Medical emergency,Highest level of force_Other,Highest level of force_Restrain/Asphyxiation,Highest level of force_Stabbed,Highest level of force_Tasered,Highest level of force_Undetermined,Highest level of force_Vehicle,Intended use of force (Developing)_Less-than-lethal force,Intended use of force (Developing)_No,Intended use of force (Developing)_Pursuit,Intended use of force (Developing)_Suicide,Intended use of force (Developing)_Undetermined,Intended use of force (Developing)_Vehic/Purs,Intended use of force (Developing)_Vehicle,"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_No","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Unknown","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Yes",Agency_multiple_agencies,Agency_other,Agency_police,Agency_sheriff
0,25747.0,21.0,2000.0,1.0,0.0,1.0,5.0,1.0,2061162.0,1820584,1802164.0,1794073.0,1777790.0,1769007.0,1762098.0,1756598.0,1753616.0,6.5,6.5,25.4,12.7,14.4,37.9,52.0,52.3,40.5,19.55,0.4,0.16,2.5,1.56,0.01,0.97,2.4,1.17,5.2,5.72,49.6,49.68,86.0,85.4,7.6,12.0,5.9,83.2,85.1,85.6,32.6,32.6,20.2,22.3,22.8,120005,6.4,24.6,25.2,68.9,83.6,821693,67.2,23.4,121100,690943,673143.0,2.67,2.59,22125,24209.27,42241,43570.0,43702.0,32960,593483,-21.7,117124.0,17275751.0,8720.0,735,19357718.0,612.08,2974.4,21.4,22.9,23.7,39.2,35.3,883501.0,810552.0,72949.0,8.26,863865.0,783771.0,80094.0,9.27,869996.0,728832.0,141164.0,16.23,802754.0,678426.0,124328.0,15.49,774443.0,674177.0,100266.0,12.95,771845.0,681453.0,90392.0,11.71,776560.0,687300.0,89260.0,11.49,768439.0,694244.0,74195.0,9.66,758684.0,706075.0,52609.0,6.93,778434.0,729654.0,48780.0,6.27,8.8,3.4,0.4,789088.0,746415.0,42673.0,5.41,76.2,15.1,2.0,6.6,3.4,3.33,23.9,38.7,5.9,75.5,69.2,77.1,682282,1.4,2.5,4.3,3.9,4.1,38.0,1.7,62.0,86.5,67907,37.9,47301,26659,36588,0.3,1.9,0.0,27282,2.54,1757299,2.5,8.7,6.3,0.7,2.8,1.9,5.9,53.1,49.5,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,1,0,0,0,0,0,0
1,25227.0,42.0,2018.0,12.0,49.0,13.0,3.0,347.0,897472.0,927644,933268.0,939425.0,938825.0,938434.0,937885.0,937130.0,936961.0,7.2,7.2,26.4,10.3,12.2,35.3,52.3,40.6,52.1,26.68,0.2,0.08,2.3,1.26,0.02,1.38,1.4,0.87,5.6,6.1,38.7,36.45,90.7,81.6,6.0,8.5,5.8,84.9,87.1,87.6,30.1,29.9,27.8,30.2,30.6,62382,7.5,22.4,22.9,70.5,82.4,398274,61.7,27.6,135300,340443,349207.0,2.65,2.64,25002,27419.32,44705,47639.0,48415.0,20262,428357,-10.3,70282.0,11932863.0,12971.0,1548,8657013.0,763.17,1215.5,19.7,20.8,20.8,39.5,33.7,446771.0,424197.0,22574.0,5.05,440054.0,409635.0,30419.0,6.91,438125.0,393849.0,44276.0,10.11,448829.0,405047.0,43782.0,9.75,451309.0,408345.0,42964.0,9.52,446126.0,407723.0,38403.0,8.61,438627.0,399995.0,38632.0,8.81,427667.0,395109.0,32558.0,7.61,429297.0,401766.0,27531.0,6.41,434261.0,411126.0,23135.0,5.33,12.8,5.9,0.4,439415.0,420439.0,18976.0,4.32,74.9,13.1,1.5,7.1,2.6,3.38,31.6,53.7,6.4,75.3,67.5,76.5,351194,1.7,1.8,1.0,1.8,4.9,44.9,0.9,55.1,88.4,76734,35.6,51657,28181,36699,0.2,2.7,0.0,30104,2.62,936374,1.7,6.8,11.5,1.0,5.6,4.7,7.2,39.1,35.8,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0
2,13379.0,28.0,2013.0,9.0,35.0,7.0,5.0,250.0,5376741.0,5194675,5217049.0,5237174.0,5250498.0,5253756.0,5245831.0,5231356.0,5211263.0,6.6,6.3,23.7,11.9,13.5,36.4,51.6,55.4,24.8,11.86,0.4,0.13,6.2,3.49,0.02,4.94,2.5,1.25,24.0,25.05,43.9,42.68,64.9,85.8,21.0,33.7,5.1,83.2,85.9,86.2,25.7,25.5,33.2,36.5,37.2,244059,4.3,31.8,32.9,76.7,85.9,2180359,60.4,54.0,265800,1936481,1956561.0,2.63,2.63,29335,33031.18,53942,60025.0,59426.0,127868,2245334,-12.1,408366.0,60585557.0,11571.0,2734,44219617.0,945.33,5495.1,15.3,15.0,15.9,23.3,22.8,2615750.0,2478215.0,137535.0,5.26,2615675.0,2447178.0,168497.0,6.44,2604692.0,2330033.0,274659.0,10.54,2643833.0,2356472.0,287361.0,10.87,2636434.0,2360934.0,275500.0,10.45,2653326.0,2397792.0,255534.0,9.63,2671904.0,2414722.0,257182.0,9.63,2653342.0,2455149.0,198193.0,7.47,2648861.0,2484494.0,164367.0,6.21,2669585.0,2507250.0,162335.0,6.08,11.1,3.7,1.1,2653153.0,2514113.0,139040.0,5.24,78.0,14.3,1.9,6.2,7.3,3.39,38.8,23.4,25.3,81.4,77.1,80.2,1972108,4.5,7.7,2.0,9.9,17.6,43.1,0.7,56.9,87.1,95677,36.8,64660,32955,44142,0.3,9.6,0.0,37552,2.59,5198275,2.7,6.6,8.8,1.3,3.4,2.3,3.9,56.7,42.3,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0
3,3153.0,27.0,2004.0,2.0,6.0,9.0,0.0,40.0,661645.0,839631,848767.0,855237.0,864014.0,871895.0,879607.0,885086.0,893119.0,8.7,8.2,30.3,9.0,10.2,31.3,48.4,59.5,5.8,2.74,1.5,0.55,4.2,2.35,0.09,5.03,4.5,1.68,49.2,52.22,38.6,35.35,55.9,79.4,20.5,41.0,6.6,71.1,73.6,73.8,30.7,30.3,14.7,15.7,15.8,47365,6.1,23.2,23.4,73.9,83.0,284367,61.4,18.1,217100,248057,264993.0,3.14,3.2,20100,21636.25,47089,49812.0,50826.0,12111,179606,19.5,41628.0,7876043.0,10037.0,1656,5762188.0,8131.92,103.3,20.6,22.4,22.6,34.3,31.0,345187.0,316959.0,28228.0,8.18,359141.0,323808.0,35333.0,9.84,362714.0,311707.0,51007.0,14.06,371515.0,313361.0,58154.0,15.65,382367.0,325712.0,56655.0,14.82,391888.0,340355.0,51533.0,13.15,393440.0,347213.0,46227.0,11.75,393459.0,352504.0,40955.0,10.41,390873.0,350944.0,39929.0,10.22,388445.0,347996.0,40449.0,10.41,11.2,4.5,0.9,384944.0,349502.0,35442.0,9.21,70.9,10.7,1.2,7.9,4.7,3.69,16.4,5.5,53.3,79.7,70.0,79.3,270282,3.1,9.5,0.8,1.9,37.8,41.7,6.6,58.3,74.1,73478,31.6,53350,25013,32876,1.0,10.7,0.2,23326,3.17,887641,3.5,9.4,7.9,1.0,3.2,2.3,5.7,74.4,34.2,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,1,0,0,0,0,1
4,9594.0,31.0,2010.0,11.0,44.0,4.0,3.0,308.0,2218899.0,2368139,2408133.0,2454781.0,2483807.0,2517417.0,2554233.0,2587462.0,2618148.0,8.1,7.6,27.6,8.8,10.0,33.3,50.6,53.5,22.3,11.22,0.7,0.16,5.0,3.0,0.03,3.58,2.8,1.35,38.3,39.63,33.1,30.22,57.4,80.9,23.0,38.8,5.9,76.5,78.0,78.3,25.8,25.6,28.0,29.7,30.1,112642,5.1,25.7,27.2,76.5,87.2,943257,54.7,38.9,129700,832360,906179.0,2.75,2.78,26185,29202.16,47974,54429.0,53626.0,61359,1253122,-15.2,185432.0,33177208.0,13929.0,5485,17621936.0,871.28,2718.0,17.6,16.3,17.7,27.6,26.9,1137194.0,1085427.0,51767.0,4.55,1140659.0,1080079.0,60580.0,5.31,1143197.0,1049081.0,94116.0,8.23,1194015.0,1091493.0,102522.0,8.59,1212288.0,1113577.0,98711.0,8.14,1223343.0,1136230.0,87113.0,7.12,1234933.0,1153569.0,81364.0,6.59,1250984.0,1183245.0,67739.0,5.41,1261721.0,1207803.0,53918.0,4.27,1296256.0,1244191.0,52065.0,4.02,22.1,13.5,1.5,1333933.0,1282785.0,51148.0,3.83,73.6,10.5,1.2,7.5,6.3,3.51,31.5,22.6,40.2,81.1,74.1,82.9,928341,4.1,10.8,2.1,3.4,29.2,50.0,1.5,50.0,79.3,88716,33.4,59607,31136,38500,0.4,6.8,0.0,32653,2.78,2606868,2.6,4.4,21.0,2.5,13.8,9.6,4.8,61.3,29.1,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,0,0,0,1,0


### Reattaching columns description and race columns

In [39]:
df_new['description'] = df['Brief description']
df_new['Race'] = df['Race']

In [40]:
df_new.head()

Unnamed: 0,Unique ID,Age,year,month,week_of_year,day_of_month,day_of_week,day_of_year,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic,Gender_Male,Gender_Transgender,State_AL,State_AR,State_AZ,State_CA,State_CO,State_CT,State_DE,State_FL,State_GA,State_HI,State_IA,State_ID,State_IL,State_IN,State_KS,State_KY,State_LA,State_MA,State_MD,State_ME,State_MI,State_MN,State_MO,State_MS,State_MT,State_NC,State_ND,State_NE,State_NH,State_NJ,State_NM,State_NV,State_NY,State_OH,State_OK,State_OR,State_PA,State_RI,State_SC,State_SD,State_TN,State_TX,State_UT,State_VA,State_VT,State_WA,State_WI,State_WV,State_WY,Highest level of force_Asphyxiation/Restrain,Highest level of force_Asphyxiation/Restrained,Highest level of force_Beaten/Bludgeoned with instrument,Highest level of force_Burned/Smoke inhalation,Highest level of force_Chemical agent/Pepper spray,Highest level of force_Drowned,Highest level of force_Drug overdose,Highest level of force_Fell from a height,Highest level of force_Gunshot,Highest level of force_Medical emergency,Highest level of force_Other,Highest level of force_Restrain/Asphyxiation,Highest level of force_Stabbed,Highest level of force_Tasered,Highest level of force_Undetermined,Highest level of force_Vehicle,Intended use of force (Developing)_Less-than-lethal force,Intended use of force (Developing)_No,Intended use of force (Developing)_Pursuit,Intended use of force (Developing)_Suicide,Intended use of force (Developing)_Undetermined,Intended use of force (Developing)_Vehic/Purs,Intended use of force (Developing)_Vehicle,"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_No","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Unknown","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Yes",Agency_multiple_agencies,Agency_other,Agency_police,Agency_sheriff,description,Race
0,25747.0,21.0,2000.0,1.0,0.0,1.0,5.0,1.0,2061162.0,1820584,1802164.0,1794073.0,1777790.0,1769007.0,1762098.0,1756598.0,1753616.0,6.5,6.5,25.4,12.7,14.4,37.9,52.0,52.3,40.5,19.55,0.4,0.16,2.5,1.56,0.01,0.97,2.4,1.17,5.2,5.72,49.6,49.68,86.0,85.4,7.6,12.0,5.9,83.2,85.1,85.6,32.6,32.6,20.2,22.3,22.8,120005,6.4,24.6,25.2,68.9,83.6,821693,67.2,23.4,121100,690943,673143.0,2.67,2.59,22125,24209.27,42241,43570.0,43702.0,32960,593483,-21.7,117124.0,17275751.0,8720.0,735,19357718.0,612.08,2974.4,21.4,22.9,23.7,39.2,35.3,883501.0,810552.0,72949.0,8.26,863865.0,783771.0,80094.0,9.27,869996.0,728832.0,141164.0,16.23,802754.0,678426.0,124328.0,15.49,774443.0,674177.0,100266.0,12.95,771845.0,681453.0,90392.0,11.71,776560.0,687300.0,89260.0,11.49,768439.0,694244.0,74195.0,9.66,758684.0,706075.0,52609.0,6.93,778434.0,729654.0,48780.0,6.27,8.8,3.4,0.4,789088.0,746415.0,42673.0,5.41,76.2,15.1,2.0,6.6,3.4,3.33,23.9,38.7,5.9,75.5,69.2,77.1,682282,1.4,2.5,4.3,3.9,4.1,38.0,1.7,62.0,86.5,67907,37.9,47301,26659,36588,0.3,1.9,0.0,27282,2.54,1757299,2.5,8.7,6.3,0.7,2.8,1.9,5.9,53.1,49.5,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,1,0,0,0,0,0,0,Two Detroit men killed when their car crashed ...,African-American/Black
1,25227.0,42.0,2018.0,12.0,49.0,13.0,3.0,347.0,897472.0,927644,933268.0,939425.0,938825.0,938434.0,937885.0,937130.0,936961.0,7.2,7.2,26.4,10.3,12.2,35.3,52.3,40.6,52.1,26.68,0.2,0.08,2.3,1.26,0.02,1.38,1.4,0.87,5.6,6.1,38.7,36.45,90.7,81.6,6.0,8.5,5.8,84.9,87.1,87.6,30.1,29.9,27.8,30.2,30.6,62382,7.5,22.4,22.9,70.5,82.4,398274,61.7,27.6,135300,340443,349207.0,2.65,2.64,25002,27419.32,44705,47639.0,48415.0,20262,428357,-10.3,70282.0,11932863.0,12971.0,1548,8657013.0,763.17,1215.5,19.7,20.8,20.8,39.5,33.7,446771.0,424197.0,22574.0,5.05,440054.0,409635.0,30419.0,6.91,438125.0,393849.0,44276.0,10.11,448829.0,405047.0,43782.0,9.75,451309.0,408345.0,42964.0,9.52,446126.0,407723.0,38403.0,8.61,438627.0,399995.0,38632.0,8.81,427667.0,395109.0,32558.0,7.61,429297.0,401766.0,27531.0,6.41,434261.0,411126.0,23135.0,5.33,12.8,5.9,0.4,439415.0,420439.0,18976.0,4.32,74.9,13.1,1.5,7.1,2.6,3.38,31.6,53.7,6.4,75.3,67.5,76.5,351194,1.7,1.8,1.0,1.8,4.9,44.9,0.9,55.1,88.4,76734,35.6,51657,28181,36699,0.2,2.7,0.0,30104,2.62,936374,1.7,6.8,11.5,1.0,5.6,4.7,7.2,39.1,35.8,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,Andre Horton was reportedly pointing a realist...,African-American/Black
2,13379.0,28.0,2013.0,9.0,35.0,7.0,5.0,250.0,5376741.0,5194675,5217049.0,5237174.0,5250498.0,5253756.0,5245831.0,5231356.0,5211263.0,6.6,6.3,23.7,11.9,13.5,36.4,51.6,55.4,24.8,11.86,0.4,0.13,6.2,3.49,0.02,4.94,2.5,1.25,24.0,25.05,43.9,42.68,64.9,85.8,21.0,33.7,5.1,83.2,85.9,86.2,25.7,25.5,33.2,36.5,37.2,244059,4.3,31.8,32.9,76.7,85.9,2180359,60.4,54.0,265800,1936481,1956561.0,2.63,2.63,29335,33031.18,53942,60025.0,59426.0,127868,2245334,-12.1,408366.0,60585557.0,11571.0,2734,44219617.0,945.33,5495.1,15.3,15.0,15.9,23.3,22.8,2615750.0,2478215.0,137535.0,5.26,2615675.0,2447178.0,168497.0,6.44,2604692.0,2330033.0,274659.0,10.54,2643833.0,2356472.0,287361.0,10.87,2636434.0,2360934.0,275500.0,10.45,2653326.0,2397792.0,255534.0,9.63,2671904.0,2414722.0,257182.0,9.63,2653342.0,2455149.0,198193.0,7.47,2648861.0,2484494.0,164367.0,6.21,2669585.0,2507250.0,162335.0,6.08,11.1,3.7,1.1,2653153.0,2514113.0,139040.0,5.24,78.0,14.3,1.9,6.2,7.3,3.39,38.8,23.4,25.3,81.4,77.1,80.2,1972108,4.5,7.7,2.0,9.9,17.6,43.1,0.7,56.9,87.1,95677,36.8,64660,32955,44142,0.3,9.6,0.0,37552,2.59,5198275,2.7,6.6,8.8,1.3,3.4,2.3,3.9,56.7,42.3,1,0,1,0,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,A sleeping Marlon Horton was asked to leave th...,African-American/Black
3,3153.0,27.0,2004.0,2.0,6.0,9.0,0.0,40.0,661645.0,839631,848767.0,855237.0,864014.0,871895.0,879607.0,885086.0,893119.0,8.7,8.2,30.3,9.0,10.2,31.3,48.4,59.5,5.8,2.74,1.5,0.55,4.2,2.35,0.09,5.03,4.5,1.68,49.2,52.22,38.6,35.35,55.9,79.4,20.5,41.0,6.6,71.1,73.6,73.8,30.7,30.3,14.7,15.7,15.8,47365,6.1,23.2,23.4,73.9,83.0,284367,61.4,18.1,217100,248057,264993.0,3.14,3.2,20100,21636.25,47089,49812.0,50826.0,12111,179606,19.5,41628.0,7876043.0,10037.0,1656,5762188.0,8131.92,103.3,20.6,22.4,22.6,34.3,31.0,345187.0,316959.0,28228.0,8.18,359141.0,323808.0,35333.0,9.84,362714.0,311707.0,51007.0,14.06,371515.0,313361.0,58154.0,15.65,382367.0,325712.0,56655.0,14.82,391888.0,340355.0,51533.0,13.15,393440.0,347213.0,46227.0,11.75,393459.0,352504.0,40955.0,10.41,390873.0,350944.0,39929.0,10.22,388445.0,347996.0,40449.0,10.41,11.2,4.5,0.9,384944.0,349502.0,35442.0,9.21,70.9,10.7,1.2,7.9,4.7,3.69,16.4,5.5,53.3,79.7,70.0,79.3,270282,3.1,9.5,0.8,1.9,37.8,41.7,6.6,58.3,74.1,73478,31.6,53350,25013,32876,1.0,10.7,0.2,23326,3.17,887641,3.5,9.4,7.9,1.0,3.2,2.3,5.7,74.4,34.2,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,1,0,0,0,0,1,Donnie Clay Coleman Jr. aka Leon Eric Horton d...,African-American/Black
4,9594.0,31.0,2010.0,11.0,44.0,4.0,3.0,308.0,2218899.0,2368139,2408133.0,2454781.0,2483807.0,2517417.0,2554233.0,2587462.0,2618148.0,8.1,7.6,27.6,8.8,10.0,33.3,50.6,53.5,22.3,11.22,0.7,0.16,5.0,3.0,0.03,3.58,2.8,1.35,38.3,39.63,33.1,30.22,57.4,80.9,23.0,38.8,5.9,76.5,78.0,78.3,25.8,25.6,28.0,29.7,30.1,112642,5.1,25.7,27.2,76.5,87.2,943257,54.7,38.9,129700,832360,906179.0,2.75,2.78,26185,29202.16,47974,54429.0,53626.0,61359,1253122,-15.2,185432.0,33177208.0,13929.0,5485,17621936.0,871.28,2718.0,17.6,16.3,17.7,27.6,26.9,1137194.0,1085427.0,51767.0,4.55,1140659.0,1080079.0,60580.0,5.31,1143197.0,1049081.0,94116.0,8.23,1194015.0,1091493.0,102522.0,8.59,1212288.0,1113577.0,98711.0,8.14,1223343.0,1136230.0,87113.0,7.12,1234933.0,1153569.0,81364.0,6.59,1250984.0,1183245.0,67739.0,5.41,1261721.0,1207803.0,53918.0,4.27,1296256.0,1244191.0,52065.0,4.02,22.1,13.5,1.5,1333933.0,1282785.0,51148.0,3.83,73.6,10.5,1.2,7.5,6.3,3.51,31.5,22.6,40.2,81.1,74.1,82.9,928341,4.1,10.8,2.1,3.4,29.2,50.0,1.5,50.0,79.3,88716,33.4,59607,31136,38500,0.4,6.8,0.0,32653,2.78,2606868,2.6,4.4,21.0,2.5,13.8,9.6,4.8,61.3,29.1,1,0,0,1,0,335,83523,30.96,28267.95,72.15,23.5,0.4,0.69,1.73,1.53,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,0,0,0,1,0,Horton parked near his ex-girlfriend's apartme...,African-American/Black


### Setting the index equal to the Unique ID number

In [41]:
df_new.set_index(['Unique ID'], inplace = True)

### Sorting index

In [42]:
df_new.sort_index(inplace = True)

### Exporting full dataset to CSV file

In [43]:
df_new.to_csv('./cleaned_data/full.csv')

### Creating training (labeled) data by removing rows where race was unspecified

In [44]:
train_set = df_new[df_new['Race'] != 'Race unspecified']

### Examining training data

In [45]:
train_set.head()

Unnamed: 0_level_0,Age,year,month,week_of_year,day_of_month,day_of_week,day_of_year,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic,Gender_Male,Gender_Transgender,State_AL,State_AR,State_AZ,State_CA,State_CO,State_CT,State_DE,State_FL,State_GA,State_HI,State_IA,State_ID,State_IL,State_IN,State_KS,State_KY,State_LA,State_MA,State_MD,State_ME,State_MI,State_MN,State_MO,State_MS,State_MT,State_NC,State_ND,State_NE,State_NH,State_NJ,State_NM,State_NV,State_NY,State_OH,State_OK,State_OR,State_PA,State_RI,State_SC,State_SD,State_TN,State_TX,State_UT,State_VA,State_VT,State_WA,State_WI,State_WV,State_WY,Highest level of force_Asphyxiation/Restrain,Highest level of force_Asphyxiation/Restrained,Highest level of force_Beaten/Bludgeoned with instrument,Highest level of force_Burned/Smoke inhalation,Highest level of force_Chemical agent/Pepper spray,Highest level of force_Drowned,Highest level of force_Drug overdose,Highest level of force_Fell from a height,Highest level of force_Gunshot,Highest level of force_Medical emergency,Highest level of force_Other,Highest level of force_Restrain/Asphyxiation,Highest level of force_Stabbed,Highest level of force_Tasered,Highest level of force_Undetermined,Highest level of force_Vehicle,Intended use of force (Developing)_Less-than-lethal force,Intended use of force (Developing)_No,Intended use of force (Developing)_Pursuit,Intended use of force (Developing)_Suicide,Intended use of force (Developing)_Undetermined,Intended use of force (Developing)_Vehic/Purs,Intended use of force (Developing)_Vehicle,"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_No","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Unknown","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Yes",Agency_multiple_agencies,Agency_other,Agency_police,Agency_sheriff,description,Race
Unique ID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1,Unnamed: 113_level_1,Unnamed: 114_level_1,Unnamed: 115_level_1,Unnamed: 116_level_1,Unnamed: 117_level_1,Unnamed: 118_level_1,Unnamed: 119_level_1,Unnamed: 120_level_1,Unnamed: 121_level_1,Unnamed: 122_level_1,Unnamed: 123_level_1,Unnamed: 124_level_1,Unnamed: 125_level_1,Unnamed: 126_level_1,Unnamed: 127_level_1,Unnamed: 128_level_1,Unnamed: 129_level_1,Unnamed: 130_level_1,Unnamed: 131_level_1,Unnamed: 132_level_1,Unnamed: 133_level_1,Unnamed: 134_level_1,Unnamed: 135_level_1,Unnamed: 136_level_1,Unnamed: 137_level_1,Unnamed: 138_level_1,Unnamed: 139_level_1,Unnamed: 140_level_1,Unnamed: 141_level_1,Unnamed: 142_level_1,Unnamed: 143_level_1,Unnamed: 144_level_1,Unnamed: 145_level_1,Unnamed: 146_level_1,Unnamed: 147_level_1,Unnamed: 148_level_1,Unnamed: 149_level_1,Unnamed: 150_level_1,Unnamed: 151_level_1,Unnamed: 152_level_1,Unnamed: 153_level_1,Unnamed: 154_level_1,Unnamed: 155_level_1,Unnamed: 156_level_1,Unnamed: 157_level_1,Unnamed: 158_level_1,Unnamed: 159_level_1,Unnamed: 160_level_1,Unnamed: 161_level_1,Unnamed: 162_level_1,Unnamed: 163_level_1,Unnamed: 164_level_1,Unnamed: 165_level_1,Unnamed: 166_level_1,Unnamed: 167_level_1,Unnamed: 168_level_1,Unnamed: 169_level_1,Unnamed: 170_level_1,Unnamed: 171_level_1,Unnamed: 172_level_1,Unnamed: 173_level_1,Unnamed: 174_level_1,Unnamed: 175_level_1,Unnamed: 176_level_1,Unnamed: 177_level_1,Unnamed: 178_level_1,Unnamed: 179_level_1,Unnamed: 180_level_1,Unnamed: 181_level_1,Unnamed: 182_level_1,Unnamed: 183_level_1,Unnamed: 184_level_1,Unnamed: 185_level_1,Unnamed: 186_level_1,Unnamed: 187_level_1,Unnamed: 188_level_1,Unnamed: 189_level_1,Unnamed: 190_level_1,Unnamed: 191_level_1,Unnamed: 192_level_1,Unnamed: 193_level_1,Unnamed: 194_level_1,Unnamed: 195_level_1,Unnamed: 196_level_1,Unnamed: 197_level_1,Unnamed: 198_level_1,Unnamed: 199_level_1,Unnamed: 200_level_1,Unnamed: 201_level_1,Unnamed: 202_level_1,Unnamed: 203_level_1,Unnamed: 204_level_1,Unnamed: 205_level_1,Unnamed: 206_level_1,Unnamed: 207_level_1,Unnamed: 208_level_1,Unnamed: 209_level_1,Unnamed: 210_level_1,Unnamed: 211_level_1,Unnamed: 212_level_1,Unnamed: 213_level_1,Unnamed: 214_level_1,Unnamed: 215_level_1,Unnamed: 216_level_1,Unnamed: 217_level_1,Unnamed: 218_level_1,Unnamed: 219_level_1,Unnamed: 220_level_1,Unnamed: 221_level_1,Unnamed: 222_level_1,Unnamed: 223_level_1,Unnamed: 224_level_1,Unnamed: 225_level_1,Unnamed: 226_level_1,Unnamed: 227_level_1,Unnamed: 228_level_1,Unnamed: 229_level_1,Unnamed: 230_level_1,Unnamed: 231_level_1,Unnamed: 232_level_1,Unnamed: 233_level_1,Unnamed: 234_level_1,Unnamed: 235_level_1,Unnamed: 236_level_1,Unnamed: 237_level_1,Unnamed: 238_level_1,Unnamed: 239_level_1,Unnamed: 240_level_1,Unnamed: 241_level_1,Unnamed: 242_level_1,Unnamed: 243_level_1,Unnamed: 244_level_1,Unnamed: 245_level_1,Unnamed: 246_level_1,Unnamed: 247_level_1,Unnamed: 248_level_1,Unnamed: 249_level_1,Unnamed: 250_level_1,Unnamed: 251_level_1,Unnamed: 252_level_1,Unnamed: 253_level_1,Unnamed: 254_level_1,Unnamed: 255_level_1,Unnamed: 256_level_1,Unnamed: 257_level_1,Unnamed: 258_level_1,Unnamed: 259_level_1,Unnamed: 260_level_1,Unnamed: 261_level_1,Unnamed: 262_level_1,Unnamed: 263_level_1,Unnamed: 264_level_1,Unnamed: 265_level_1,Unnamed: 266_level_1,Unnamed: 267_level_1,Unnamed: 268_level_1,Unnamed: 269_level_1,Unnamed: 270_level_1,Unnamed: 271_level_1,Unnamed: 272_level_1
1.0,24.0,2000.0,1.0,1.0,2.0,6.0,2.0,1223499.0,1418788,1434506.0,1446585.0,1459474.0,1477522.0,1496130.0,1513260.0,1530615.0,7.1,6.6,25.6,11.2,13.0,35.9,51.0,57.5,10.4,4.93,1.0,0.35,14.3,7.67,0.54,3.65,6.6,3.52,21.6,22.78,48.4,45.75,67.9,79.6,19.6,30.2,5.5,85.1,86.8,87.0,35.1,34.6,27.8,29.3,29.9,101397,7.2,25.8,26.9,82.8,91.7,555932,59.5,26.9,324200,508499,532050.0,2.69,2.76,26953,29240.94,56439,59728.0,60239.0,27635,417036,-1.6,83967.0,15599967.0,11357.0,1162,27906779.0,964.64,1470.8,13.9,16.3,16.7,25.2,22.2,675800.0,639075.0,36725.0,5.43,679394.0,630283.0,49111.0,7.23,681094.0,606070.0,75024.0,11.02,683127.0,596992.0,86135.0,12.61,679995.0,597683.0,82312.0,12.1,681281.0,609691.0,71590.0,10.51,680241.0,619835.0,60406.0,8.88,680676.0,630973.0,49703.0,7.3,685987.0,644870.0,41117.0,5.99,695243.0,657592.0,37651.0,5.42,8.4,3.8,0.9,702049.0,669469.0,32580.0,4.64,76.2,13.7,1.8,6.5,15.7,3.38,30.9,9.8,23.2,87.9,83.6,85.3,543025,10.1,6.5,1.1,7.9,13.9,43.6,2.4,56.4,87.7,88883,36.2,67151,32275,42375,0.7,7.9,1.1,32751,2.76,1524553,7.5,6.1,5.5,0.7,2.5,1.6,6.7,57.3,44.7,1,0,0,1,0,258,106481,39.47,25598.87,69.0,26.39,0.38,0.81,1.73,1.69,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,1,0,0,0,0,1,0,LaTanya Janelle McCoy's car was struck from be...,African-American/Black
4.0,45.0,2000.0,1.0,1.0,5.0,2.0,5.0,88787.0,101547,102468.0,103362.0,103625.0,104148.0,104249.0,104173.0,104346.0,6.6,6.1,24.5,14.5,16.5,39.6,52.0,70.0,25.8,13.37,0.4,0.19,0.8,0.43,0.01,0.28,1.7,1.0,2.9,3.23,68.7,67.2,96.2,85.8,2.2,4.1,5.8,81.9,84.9,85.3,31.8,32.6,19.0,21.0,20.9,9168,10.5,19.7,21.3,67.6,78.2,45319,67.4,15.2,116300,38035,39560.0,2.57,2.61,22725,23573.18,41022,42910.0,42803.0,2823,43725,-5.4,6460.0,1909615.0,19608.0,261,749924.0,579.82,175.1,16.8,19.4,18.5,36.4,29.4,47179.0,45607.0,1572.0,3.33,46452.0,44180.0,2272.0,4.89,45903.0,41638.0,4265.0,9.29,46414.0,42170.0,4244.0,9.14,46778.0,42747.0,4031.0,8.62,46054.0,42578.0,3476.0,7.55,45711.0,42553.0,3158.0,6.91,45159.0,42097.0,3062.0,6.78,44550.0,41767.0,2783.0,6.25,44543.0,41959.0,2584.0,5.8,11.0,3.1,0.5,44411.0,42484.0,1927.0,4.34,76.7,17.2,1.8,6.1,0.9,3.3,21.7,26.9,3.3,74.9,65.9,69.6,39311,0.8,0.5,0.2,1.0,2.3,34.3,11.8,65.7,86.0,67855,40.1,47580,25191,32910,0.3,0.6,0.1,27097,2.64,104702,2.2,6.2,10.5,0.5,2.2,1.6,9.8,69.1,66.5,1,0,0,1,0,573,52689,19.53,34109.75,66.35,30.0,0.28,0.61,1.54,1.21,1,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,1,0,John Edward Pittman was shot and killed by off...,African-American/Black
5.0,20.0,2000.0,1.0,1.0,5.0,2.0,5.0,816006.0,920581,948809.0,974918.0,982930.0,994658.0,1008841.0,1024248.0,1041423.0,6.8,6.2,23.9,9.1,10.8,35.2,51.3,44.5,44.1,22.07,0.2,0.19,5.6,3.34,0.02,0.7,2.2,1.17,7.9,7.38,40.8,39.92,83.7,78.4,13.0,15.9,4.8,89.6,91.3,91.7,23.4,23.2,47.6,49.8,50.2,48118,5.6,27.1,28.1,80.9,90.3,437105,56.0,44.1,253100,357463,391850.0,2.39,2.49,37211,40402.41,56709,62824.0,61336.0,33026,714815,-7.0,86718.0,13239670.0,13363.0,1101,16956339.0,526.64,1748.0,15.3,16.0,16.0,25.4,23.4,488891.0,465409.0,23482.0,4.8,497388.0,465380.0,32008.0,6.44,486983.0,437746.0,49237.0,10.11,485002.0,434315.0,50687.0,10.45,498861.0,448034.0,50827.0,10.19,511185.0,464673.0,46512.0,9.1,508273.0,467197.0,41076.0,8.08,509436.0,473594.0,35842.0,7.04,514749.0,483972.0,30777.0,5.98,531690.0,503142.0,28548.0,5.37,12.2,5.9,0.6,548023.0,521549.0,26474.0,4.83,77.8,11.4,1.2,6.0,7.1,3.34,52.9,44.1,7.2,85.6,83.2,86.2,410576,3.8,2.8,1.6,5.2,5.8,48.4,0.7,51.6,92.6,113594,35.5,69673,37675,49102,0.4,1.5,0.0,47163,2.44,1036200,2.3,5.2,10.3,0.9,5.4,4.0,5.3,44.6,39.6,1,0,0,1,0,4,1380145,511.62,2649.58,60.71,34.54,0.41,0.83,1.86,1.64,1,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,1,0,"On Jan. 5 around midnight, 20-year-old John Fr...",African-American/Black
6.0,19.0,2000.0,1.0,1.0,6.0,3.0,6.0,14598.0,13665,13787.0,13841.0,13858.0,13924.0,13810.0,13761.0,13806.0,6.4,5.9,22.4,18.2,19.2,40.9,51.2,95.0,1.2,0.6,0.4,0.17,0.4,0.13,0.0,0.61,1.0,1.07,4.1,4.9,93.1,90.88,95.2,82.9,2.3,5.6,8.4,90.6,94.2,94.0,36.0,36.4,21.5,28.3,28.1,1074,7.9,14.0,15.5,77.0,86.4,6231,76.2,13.0,92300,5771,5623.0,2.27,2.29,25412,29799.13,47689,53960.0,59966.0,511,5888,-11.0,978.0,241319.0,16968.0,14,103394.0,572.51,23.9,7.0,10.5,9.7,8.7,12.6,7648.0,7440.0,208.0,2.72,7050.0,6789.0,261.0,3.7,7078.0,6733.0,345.0,4.87,7411.0,7067.0,344.0,4.64,7515.0,7189.0,326.0,4.34,7601.0,7320.0,281.0,3.7,7507.0,7237.0,270.0,3.6,7393.0,7158.0,235.0,3.18,7339.0,7126.0,213.0,2.9,7312.0,7113.0,199.0,2.72,7.7,6.5,0.0,7281.0,7094.0,187.0,2.57,77.0,19.7,3.1,6.2,0.4,2.81,28.4,1.6,5.2,84.4,77.2,74.3,5656,0.2,0.4,0.1,2.1,2.9,28.4,2.5,71.6,93.6,79062,39.9,60298,29530,38108,0.5,1.3,0.0,32975,2.25,13745,1.5,2.7,7.1,0.0,8.3,5.7,7.8,94.7,90.5,0,0,0,1,0,631,48833,18.1,35202.9,86.04,9.08,0.79,1.0,1.36,1.73,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,Kyle Dillon allegedly walked up to officer Jas...,European-American/White
14.0,26.0,2000.0,1.0,2.0,10.0,0.0,10.0,1517550.0,1526006,1539649.0,1551944.0,1558109.0,1564042.0,1570507.0,1574765.0,1580863.0,6.6,6.9,22.5,12.1,12.9,34.1,52.8,41.0,43.4,21.28,0.5,0.18,6.3,3.55,0.03,2.78,2.8,1.4,12.3,14.13,36.9,34.93,77.3,86.0,11.5,20.9,5.1,79.4,82.7,83.3,22.8,22.3,22.2,26.4,27.1,84251,5.0,31.5,32.9,69.6,81.3,670171,55.3,32.8,135200,574488,591280.0,2.53,2.57,21117,23547.47,36251,41514.0,40649.0,26787,579921,-4.4,70396.0,11167787.0,7299.0,984,22582167.0,134.1,11379.5,25.1,25.3,25.8,36.4,35.7,620518.0,582727.0,37791.0,6.09,630620.0,586024.0,44596.0,7.07,652401.0,588815.0,63586.0,9.75,687795.0,615059.0,72736.0,10.58,690922.0,617204.0,73718.0,10.67,699324.0,623246.0,76078.0,10.88,698324.0,626430.0,71894.0,10.3,691042.0,634989.0,56053.0,8.11,695540.0,646080.0,49460.0,7.11,703565.0,655996.0,47569.0,6.76,10.6,4.2,0.6,704053.0,660073.0,43980.0,6.25,78.1,13.4,1.8,6.7,7.2,3.46,29.7,42.1,14.7,76.8,69.6,76.1,601337,4.9,6.8,1.9,6.6,10.9,47.0,0.3,53.0,84.7,68379,34.4,45927,26211,37691,0.4,6.5,0.0,27924,2.55,1579075,3.1,8.5,8.1,0.7,3.5,2.6,4.8,40.7,34.5,1,0,0,0,1,986,32228,11.95,40403.66,74.24,20.26,0.6,0.53,1.89,2.48,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,An Army veteran and West Chester University st...,African-American/Black


### Exporting training data to CSV file

In [46]:
train_set.to_csv('./data/train.csv')

### Creating training (unlabeled) data by only keeping rows where race was unspecified

In [47]:
test_set = df_new[df_new['Race'] == 'Race unspecified']

### Examining testing data

In [48]:
test_set.head()

Unnamed: 0_level_0,Age,year,month,week_of_year,day_of_month,day_of_week,day_of_year,pop2000,pop2010,pop2011,pop2012,pop2013,pop2014,pop2015,pop2016,pop2017,age_under_5_2010,age_under_5_2017,age_under_18_2010,age_over_65_2010,age_over_65_2017,median_age_2017,female_2010,white_2010,black_2010,black_2017,native_2010,native_2017,asian_2010,asian_2017,pac_isl_2017,other_single_race_2017,two_plus_races_2010,two_plus_races_2017,hispanic_2010,hispanic_2017,white_not_hispanic_2010,white_not_hispanic_2017,speak_english_only_2017,no_move_in_one_plus_year_2010,foreign_born_2010,foreign_spoken_at_home_2010,women_16_to_50_birth_rate_2017,hs_grad_2010,hs_grad_2016,hs_grad_2017,some_college_2016,some_college_2017,bachelors_2010,bachelors_2016,bachelors_2017,veterans_2010,veterans_2017,mean_work_travel_2010,mean_work_travel_2017,broadband_2017,computer_2017,housing_units_2010,homeownership_2010,housing_multi_unit_2010,median_val_owner_occupied_2010,households_2010,households_2017,persons_per_household_2010,persons_per_household_2017,per_capita_income_2010,per_capita_income_2017,median_household_income_2010,median_household_income_2016,median_household_income_2017,private_nonfarm_establishments_2009,private_nonfarm_employment_2009,percent_change_private_nonfarm_employment_2009,nonemployment_establishments_2009,sales_2007,sales_per_capita_2007,building_permits_2010,fed_spending_2009,area_2010,density_2010,poverty_2010,poverty_2016,poverty_2017,poverty_age_under_5_2017,poverty_age_under_18_2017,civilian_labor_force_2007,employed_2007,unemployed_2007,unemployment_rate_2007,civilian_labor_force_2008,employed_2008,unemployed_2008,unemployment_rate_2008,civilian_labor_force_2009,employed_2009,unemployed_2009,unemployment_rate_2009,civilian_labor_force_2010,employed_2010,unemployed_2010,unemployment_rate_2010,civilian_labor_force_2011,employed_2011,unemployed_2011,unemployment_rate_2011,civilian_labor_force_2012,employed_2012,unemployed_2012,unemployment_rate_2012,civilian_labor_force_2013,employed_2013,unemployed_2013,unemployment_rate_2013,civilian_labor_force_2014,employed_2014,unemployed_2014,unemployment_rate_2014,civilian_labor_force_2015,employed_2015,unemployed_2015,unemployment_rate_2015,civilian_labor_force_2016,employed_2016,unemployed_2016,unemployment_rate_2016,uninsured_2017,uninsured_age_under_19_2017,uninsured_age_over_74_2017,civilian_labor_force_2017,employed_2017,unemployed_2017,unemployment_rate_2017,age_over_18_2019,age_over_65_2019,age_over_85_2019,age_under_5_2019,asian_2019,avg_family_size_2019,bachelors_2019,black_2019,hispanic_2019,household_has_broadband_2019,household_has_computer_2019,household_has_smartphone_2019,households_2019,households_speak_asian_or_pac_isl_2019,households_speak_limited_english_2019,households_speak_other_2019,households_speak_other_indo_euro_lang_2019,households_speak_spanish_2019,housing_mobile_homes_2019,housing_one_unit_structures_2019,housing_two_unit_structures_2019,hs_grad_2019,mean_household_income_2019,median_age_2019,median_household_income_2019,median_individual_income_2019,median_individual_income_age_25plus_2019,native_2019,other_single_race_2019,pac_isl_2019,per_capita_income_2019,persons_per_household_2019,pop_2019,two_plus_races_2019,unemployment_rate_2019,uninsured_2019,uninsured_65_and_older_2019,uninsured_under_19_2019,uninsured_under_6_2019,veterans_2019,white_2019,white_not_hispanic_2019,metro_2013_1.0,metro_2013_Unknown,smoking_ban_2010_comprehensive,smoking_ban_2010_none,smoking_ban_2010_partial,rank,count,prop100k,cum_prop100k,pctwhite,pctblack,pctapi,pctaian,pct2prace,pcthispanic,Gender_Male,Gender_Transgender,State_AL,State_AR,State_AZ,State_CA,State_CO,State_CT,State_DE,State_FL,State_GA,State_HI,State_IA,State_ID,State_IL,State_IN,State_KS,State_KY,State_LA,State_MA,State_MD,State_ME,State_MI,State_MN,State_MO,State_MS,State_MT,State_NC,State_ND,State_NE,State_NH,State_NJ,State_NM,State_NV,State_NY,State_OH,State_OK,State_OR,State_PA,State_RI,State_SC,State_SD,State_TN,State_TX,State_UT,State_VA,State_VT,State_WA,State_WI,State_WV,State_WY,Highest level of force_Asphyxiation/Restrain,Highest level of force_Asphyxiation/Restrained,Highest level of force_Beaten/Bludgeoned with instrument,Highest level of force_Burned/Smoke inhalation,Highest level of force_Chemical agent/Pepper spray,Highest level of force_Drowned,Highest level of force_Drug overdose,Highest level of force_Fell from a height,Highest level of force_Gunshot,Highest level of force_Medical emergency,Highest level of force_Other,Highest level of force_Restrain/Asphyxiation,Highest level of force_Stabbed,Highest level of force_Tasered,Highest level of force_Undetermined,Highest level of force_Vehicle,Intended use of force (Developing)_Less-than-lethal force,Intended use of force (Developing)_No,Intended use of force (Developing)_Pursuit,Intended use of force (Developing)_Suicide,Intended use of force (Developing)_Undetermined,Intended use of force (Developing)_Vehic/Purs,Intended use of force (Developing)_Vehicle,"Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_No","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Unknown","Foreknowledge of mental illness? INTERNAL USE, NOT FOR ANALYSIS_Yes",Agency_multiple_agencies,Agency_other,Agency_police,Agency_sheriff,description,Race
Unique ID,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1,Unnamed: 113_level_1,Unnamed: 114_level_1,Unnamed: 115_level_1,Unnamed: 116_level_1,Unnamed: 117_level_1,Unnamed: 118_level_1,Unnamed: 119_level_1,Unnamed: 120_level_1,Unnamed: 121_level_1,Unnamed: 122_level_1,Unnamed: 123_level_1,Unnamed: 124_level_1,Unnamed: 125_level_1,Unnamed: 126_level_1,Unnamed: 127_level_1,Unnamed: 128_level_1,Unnamed: 129_level_1,Unnamed: 130_level_1,Unnamed: 131_level_1,Unnamed: 132_level_1,Unnamed: 133_level_1,Unnamed: 134_level_1,Unnamed: 135_level_1,Unnamed: 136_level_1,Unnamed: 137_level_1,Unnamed: 138_level_1,Unnamed: 139_level_1,Unnamed: 140_level_1,Unnamed: 141_level_1,Unnamed: 142_level_1,Unnamed: 143_level_1,Unnamed: 144_level_1,Unnamed: 145_level_1,Unnamed: 146_level_1,Unnamed: 147_level_1,Unnamed: 148_level_1,Unnamed: 149_level_1,Unnamed: 150_level_1,Unnamed: 151_level_1,Unnamed: 152_level_1,Unnamed: 153_level_1,Unnamed: 154_level_1,Unnamed: 155_level_1,Unnamed: 156_level_1,Unnamed: 157_level_1,Unnamed: 158_level_1,Unnamed: 159_level_1,Unnamed: 160_level_1,Unnamed: 161_level_1,Unnamed: 162_level_1,Unnamed: 163_level_1,Unnamed: 164_level_1,Unnamed: 165_level_1,Unnamed: 166_level_1,Unnamed: 167_level_1,Unnamed: 168_level_1,Unnamed: 169_level_1,Unnamed: 170_level_1,Unnamed: 171_level_1,Unnamed: 172_level_1,Unnamed: 173_level_1,Unnamed: 174_level_1,Unnamed: 175_level_1,Unnamed: 176_level_1,Unnamed: 177_level_1,Unnamed: 178_level_1,Unnamed: 179_level_1,Unnamed: 180_level_1,Unnamed: 181_level_1,Unnamed: 182_level_1,Unnamed: 183_level_1,Unnamed: 184_level_1,Unnamed: 185_level_1,Unnamed: 186_level_1,Unnamed: 187_level_1,Unnamed: 188_level_1,Unnamed: 189_level_1,Unnamed: 190_level_1,Unnamed: 191_level_1,Unnamed: 192_level_1,Unnamed: 193_level_1,Unnamed: 194_level_1,Unnamed: 195_level_1,Unnamed: 196_level_1,Unnamed: 197_level_1,Unnamed: 198_level_1,Unnamed: 199_level_1,Unnamed: 200_level_1,Unnamed: 201_level_1,Unnamed: 202_level_1,Unnamed: 203_level_1,Unnamed: 204_level_1,Unnamed: 205_level_1,Unnamed: 206_level_1,Unnamed: 207_level_1,Unnamed: 208_level_1,Unnamed: 209_level_1,Unnamed: 210_level_1,Unnamed: 211_level_1,Unnamed: 212_level_1,Unnamed: 213_level_1,Unnamed: 214_level_1,Unnamed: 215_level_1,Unnamed: 216_level_1,Unnamed: 217_level_1,Unnamed: 218_level_1,Unnamed: 219_level_1,Unnamed: 220_level_1,Unnamed: 221_level_1,Unnamed: 222_level_1,Unnamed: 223_level_1,Unnamed: 224_level_1,Unnamed: 225_level_1,Unnamed: 226_level_1,Unnamed: 227_level_1,Unnamed: 228_level_1,Unnamed: 229_level_1,Unnamed: 230_level_1,Unnamed: 231_level_1,Unnamed: 232_level_1,Unnamed: 233_level_1,Unnamed: 234_level_1,Unnamed: 235_level_1,Unnamed: 236_level_1,Unnamed: 237_level_1,Unnamed: 238_level_1,Unnamed: 239_level_1,Unnamed: 240_level_1,Unnamed: 241_level_1,Unnamed: 242_level_1,Unnamed: 243_level_1,Unnamed: 244_level_1,Unnamed: 245_level_1,Unnamed: 246_level_1,Unnamed: 247_level_1,Unnamed: 248_level_1,Unnamed: 249_level_1,Unnamed: 250_level_1,Unnamed: 251_level_1,Unnamed: 252_level_1,Unnamed: 253_level_1,Unnamed: 254_level_1,Unnamed: 255_level_1,Unnamed: 256_level_1,Unnamed: 257_level_1,Unnamed: 258_level_1,Unnamed: 259_level_1,Unnamed: 260_level_1,Unnamed: 261_level_1,Unnamed: 262_level_1,Unnamed: 263_level_1,Unnamed: 264_level_1,Unnamed: 265_level_1,Unnamed: 266_level_1,Unnamed: 267_level_1,Unnamed: 268_level_1,Unnamed: 269_level_1,Unnamed: 270_level_1,Unnamed: 271_level_1,Unnamed: 272_level_1
2.0,53.0,2000.0,1.0,1.0,2.0,6.0,2.0,665865.0,691893,699684.0,710604.0,717321.0,725647.0,737418.0,746690.0,753253.0,7.3,7.2,23.9,9.0,11.1,35.5,52.1,33.3,54.3,26.99,0.4,0.33,5.1,3.03,0.02,0.98,2.4,1.33,9.8,8.67,29.4,29.05,81.0,78.9,16.3,18.1,5.4,87.9,88.6,88.7,26.0,25.5,38.7,41.7,42.1,40569,6.8,30.7,31.9,81.0,91.6,304968,58.6,36.6,190000,264837,273614.0,2.53,2.64,28412,31113.26,51349,56053.0,55876.0,16116,259528,-21.0,63918.0,7973387.0,10867.0,432,5036572.0,267.58,2585.7,16.1,17.5,17.6,30.0,27.7,393836.0,374934.0,18902.0,4.8,392969.0,367914.0,25055.0,6.38,381335.0,343126.0,38209.0,10.02,363001.0,323687.0,39314.0,10.83,366603.0,327936.0,38667.0,10.55,370150.0,335318.0,34832.0,9.41,368650.0,337594.0,31056.0,8.42,368717.0,342134.0,26583.0,7.21,373115.0,350620.0,22495.0,6.03,383126.0,362524.0,20602.0,5.38,16.0,8.9,1.0,394688.0,375712.0,18976.0,4.81,76.7,11.9,1.3,7.1,6.1,3.43,44.2,54.0,8.5,85.0,79.7,85.4,282436,3.5,4.8,3.4,4.3,6.6,45.4,0.5,54.6,89.3,91259,35.8,62399,32501,41409,1.0,1.7,0.0,36077,2.61,749323,2.8,6.0,14.0,1.2,7.7,5.3,6.3,34.3,29.1,1,0,0,1,0,6,1127803,418.07,3572.82,85.81,10.41,0.42,0.63,1.31,1.43,1,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,1,"Darren Mayfield, a DeKalb County sheriff's dep...",Race unspecified
3.0,23.0,2000.0,1.0,1.0,5.0,2.0,5.0,39678.0,38950,38799.0,38508.0,37389.0,37012.0,36668.0,36475.0,36518.0,5.7,5.7,22.5,15.8,17.6,41.6,49.9,91.8,4.9,2.04,0.2,0.01,0.5,0.33,0.0,0.31,1.3,0.9,3.9,4.42,89.6,89.23,95.5,84.5,1.7,4.0,7.4,84.1,88.3,89.3,31.6,30.7,13.5,15.2,15.1,3191,9.0,20.2,19.8,75.7,84.9,15895,75.8,14.5,101700,14630,14379.0,2.38,2.38,23259,28978.15,50500,54339.0,54339.0,917,12321,-1.1,1983.0,396450.0,10438.0,29,252752.0,1044.29,37.3,11.0,14.3,13.3,24.5,18.5,19679.0,18808.0,871.0,4.43,19387.0,18258.0,1129.0,5.82,19293.0,17310.0,1983.0,10.28,19316.0,17390.0,1926.0,9.97,18826.0,17238.0,1588.0,8.44,18228.0,16819.0,1409.0,7.73,17691.0,16262.0,1429.0,8.08,17468.0,16351.0,1117.0,6.39,17243.0,16315.0,928.0,5.38,16986.0,16076.0,910.0,5.36,6.1,4.7,0.0,16611.0,15882.0,729.0,4.39,78.5,18.9,3.2,5.4,0.9,2.88,14.8,4.2,4.6,79.2,73.4,73.2,14307,0.2,0.4,0.3,0.9,2.7,27.2,5.6,72.8,89.9,69815,42.0,55160,29147,36096,0.0,0.7,0.0,27925,2.36,36040,1.6,4.9,4.6,0.0,4.0,5.4,9.0,92.6,89.2,0,0,0,1,0,440,67208,24.91,31174.7,58.16,37.54,0.47,0.73,1.72,1.38,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,1,0,Officer Elias E. Mendiola shot Derrick E. Tate...,Race unspecified
7.0,23.0,2000.0,1.0,1.0,6.0,3.0,6.0,695454.0,919628,944943.0,968204.0,991619.0,1011315.0,1034442.0,1057237.0,1076837.0,7.4,6.9,25.4,8.8,10.4,34.9,51.6,55.3,30.8,15.59,0.5,0.15,4.6,2.75,0.03,2.49,2.6,1.44,12.2,12.85,50.6,47.87,80.6,78.2,13.5,17.4,4.8,88.3,89.6,89.9,28.1,27.9,40.0,43.1,44.1,54254,6.1,24.7,25.8,84.2,91.9,398510,62.6,30.3,185100,350392,395503.0,2.48,2.58,31848,34939.53,55294,63197.0,61695.0,27889,531089,6.5,70157.0,14114263.0,16306.0,2672,4560780.0,523.84,1755.5,12.5,12.3,13.4,20.0,18.2,456736.0,436055.0,20681.0,4.53,466991.0,439235.0,27756.0,5.94,465775.0,416463.0,49312.0,10.59,502428.0,448844.0,53584.0,10.67,513764.0,462812.0,50952.0,9.92,527788.0,481623.0,46165.0,8.75,537143.0,496753.0,40390.0,7.52,548086.0,515227.0,32859.0,6.0,565288.0,534989.0,30299.0,5.36,582855.0,555245.0,27610.0,4.74,13.1,6.3,0.6,601998.0,576068.0,25930.0,4.31,76.2,10.9,1.2,6.7,5.9,3.29,45.4,31.6,13.3,88.2,83.1,86.9,411097,3.2,4.8,1.3,4.3,9.7,43.6,1.4,56.4,90.3,97595,35.1,66641,35339,43202,0.4,5.6,0.1,38819,2.58,1074475,2.8,4.3,11.6,1.4,6.0,4.3,5.9,53.8,46.8,1,0,0,1,0,2097,15877,5.89,49511.63,85.49,10.42,0.38,0.89,1.47,1.34,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,State troopers and county deputies had tracked...,Race unspecified
8.0,31.0,2000.0,1.0,1.0,6.0,3.0,6.0,1709434.0,2035210,2062381.0,2076601.0,2086732.0,2104038.0,2120794.0,2137131.0,2157404.0,7.8,7.3,29.2,8.9,10.6,32.9,50.3,56.7,8.9,4.21,1.1,0.4,6.3,3.46,0.16,8.49,5.0,2.34,49.2,52.28,33.3,29.82,58.9,82.7,21.6,40.5,5.8,77.5,78.8,79.2,33.3,33.2,18.4,19.3,19.8,115631,6.1,29.3,30.9,79.3,89.3,699637,65.1,18.8,319000,596125,623642.0,3.29,3.31,21867,22388.22,55845,56194.0,57156.0,31866,519247,11.5,124210.0,21717402.0,10897.0,1789,11788836.0,20056.94,101.5,14.8,17.6,18.2,27.3,25.5,862191.0,813867.0,48324.0,5.6,862104.0,792836.0,69268.0,8.03,857696.0,749142.0,108554.0,12.66,890278.0,769890.0,120388.0,13.52,888513.0,774166.0,114347.0,12.87,892212.0,790420.0,101792.0,11.41,896673.0,809057.0,87616.0,9.77,907537.0,834901.0,72636.0,8.0,921170.0,861894.0,59276.0,6.43,932257.0,878388.0,53869.0,5.78,11.6,5.4,1.3,950765.0,904183.0,46582.0,4.9,73.4,11.3,1.2,7.2,7.2,3.76,21.0,8.3,53.3,84.4,77.6,84.7,636041,6.2,6.7,1.1,2.1,34.5,40.2,5.5,59.8,80.0,81382,33.3,63362,27235,37560,0.8,17.2,0.3,25215,3.29,2149031,5.0,7.0,8.4,1.5,3.9,2.9,5.8,61.2,28.5,1,0,0,1,0,18298,1398,0.52,74294.53,84.69,12.02,0.0,0.0,1.36,1.43,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,1,0,0,0,SWAT officers shot and killed Adrian Dolby sho...,Race unspecified
10.0,28.0,2000.0,1.0,1.0,7.0,4.0,7.0,9519338.0,9818605,9885998.0,9956152.0,10018604.0,10072695.0,10123248.0,10150558.0,10163507.0,6.6,6.3,24.5,10.9,12.5,36.0,50.7,50.3,8.7,4.1,0.7,0.34,13.7,7.23,0.14,10.4,4.5,1.91,47.7,48.42,27.8,26.49,43.4,86.8,35.6,56.4,4.6,75.9,77.8,78.2,26.3,26.2,29.0,30.8,31.2,368128,3.6,29.0,30.9,79.9,88.7,3445076,48.2,41.8,508800,3217889,3295198.0,2.97,3.01,27344,29852.16,55476,61308.0,61015.0,245523,3703233,-4.2,821177.0,119111840.0,12236.0,7260,80457156.0,4057.88,2419.6,15.7,16.3,17.0,24.0,23.7,4864160.0,4614776.0,249384.0,5.13,4928959.0,4555103.0,373856.0,7.58,4914702.0,4345182.0,569520.0,11.59,4917375.0,4302274.0,615101.0,12.51,4928464.0,4327923.0,600541.0,12.19,4915263.0,4378392.0,536871.0,10.92,4967167.0,4482594.0,484573.0,9.76,5004087.0,4591068.0,413019.0,8.25,5002332.0,4671098.0,331234.0,6.62,5054938.0,4789505.0,265433.0,5.25,13.3,5.5,1.2,5123933.0,4883640.0,240293.0,4.69,78.0,13.3,1.8,6.1,14.6,3.66,32.5,8.1,48.5,84.3,80.3,84.3,3316795,12.6,12.7,1.4,7.2,34.6,54.2,1.7,45.8,79.1,99133,36.5,68044,29985,38729,0.7,21.0,0.3,34156,2.99,10081570,4.0,5.7,9.6,1.5,3.9,2.8,3.3,51.3,26.2,1,0,0,1,0,38644,538,0.2,80669.84,95.91,0.0,0.0,0.0,0.93,2.04,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,1,0,0,0,1,0,Joseph Gumpert stopped breathing after a scuff...,Race unspecified


### Exporting training data to CSV file

In [49]:
test_set.to_csv('./data/test.csv')