Notebook

# Climate Displacement

Disaster Types
- Year
- State
- Households Inflow (Number of Returns)
- Households Outflow (Number of Returns)
- Individuals Inflow (Number of Exemptions)
- Individuals Outflow (Number of Exemptions)
- Chemical
- Dam/Levee Break
- Drought
- Earthquake
- Fire
- Flood
- Human Cause
- Hurricane
- Ice
- Mud/Landslide
- Other
- Snow
- Storm
- Terrorism
- Tornado
- Tsunami
- Typhoon
- Volcano
- Water
- Winter

In [None]:
"""
Folders Setup

code 
    notebook.ipynb
    data
        Disasters
            FEMA_dataset.csv
        StateMigration
            1990to1991StateMigration
                 1990to1991StateMigrationInflow
                     Alabama91in.xls
                     Alaska91in.xls
                     .
                     .
                     .
                     Wisconsin91in.xls
                     Wyoming91in.xls 
                 1990to1991StateMigrationOutflow
                     Alabama91Out.xls
                     Alaska91Out.xls 
                     .
                     .
                     .
                     Wisconsin91Out.xls
                     Wyoming91Out.xls 
            .
            .
            .
            2008to2009StateMigration
            2009to2010StateMigration
            2010to2011StateMigration 
"""

## Pseudocode:

### FEMA Dataset Pre-processing (Neely)
1. Create new FEMA_dataset with columns 
    - contains Year, State, Disaster Type
2. Name file "State_Disasters_by_Year"

### StateMigration Data Pre-Processing (Ben)
1. Convert all datasets in StateMigration from .xls into .csv files
2. Extract "Total Flow" row with "Number of Returns" and "Number of Exemptions" - assign I if from inflow and O if from outflow - from every state file.
3. Extract "State" and "Year" from every file
4. Create file with "State" (from file name), "Year" (from file name), "Number_of_Returns_I", "Number_of_Exemptions_I", "Number_of_Returns_I" and "Number_of_Exemptions_O"
5. Name file "State Migration by Year"

### Merge Datasets (Both)
1. Merge datasets on common attributes "Year" and "State"
2. Name dataset "State_Migration_and_Disasters_by_Year"

#### Train and Testing
1. Create training and testing datasets
    Questions: How should we split training and testing data?
2. Create Neural Network models
    Input: Year, State, Disaster Type
    Output: Migration Inflow (Household/Individual), Migration Outflow (Household/Individual)
3. Put training and testing through the Neural Network models.
4. Evaluate which models are the most effective.
---
Data Augmentation
Synthetic Data
Use svm or decision tree -- skleant -- and compare against a neural network
could use all data for training and all data for validation - not this is a faulty practice in 
20% distribution of state 


```
read_file = pd.read_excel ("Test.xlsx")
 
# Write the dataframe object
# into csv file
read_file.to_csv ("Test.csv",
                  index = None,
                  header=True)
```

In [253]:
# adding imports
import shutil
import pandas as pd
import os
import glob
import xlrd
import csv
import numpy as np
#from fastai import *

### FEMA Dataset Pre-processing

Creates State_Disasters_by_Year.csv with:
- State
- Disaster Type
- Start Year
- End Year

In [None]:
# FEMA Dataset Preprocessing

# copy original FEMA dataset to new file
original = r'../code/data/Disasters/FEMA_dataset.csv'
new = r'../code/data/Disasters/State_Disasters_by_Year.csv'
shutil.copyfile(original, new)

# read csv file
data = pd.read_csv('~/code/data/Disasters/State_Disasters_by_Year.csv')

# delete irrelevant rows
data.pop('Declaration Number')
data.pop('Declaration Type')
data.pop('Declaration Date')
data.pop('County')
data.pop('Disaster Title')
data.pop('Close Date')
data.pop('Individual Assistance Program')
data.pop('Public Assistance Program')
data.pop('Hazard Mitigation Program')
data.pop('Individuals & Households Program')

# extract years
data['Start Year'] = pd.DatetimeIndex(data['Start Date']).year
data['End Year'] = pd.DatetimeIndex(data['End Date']).year

# delete start and end dates
data.pop('Start Date')
data.pop('End Date')

print(data)

# save changes csv
data.to_csv('../code/data/Disasters/State_Disasters_by_Year.csv')

### Converting State Migration data from .xls to .csv

Converting all datasets in StateMigration from .xls to .csv

In [None]:
# Convert all files in StateMigration folder from .xls to .csv

# create list of xls files
xls_list = glob.glob("/Users/ben/Desktop/climate-displacement/code/data/StateMigration/*/*/*.xls")

# replace xls 
for xls_file in xls_list:
    
    wb = xlrd.open_workbook(xls_file)
    sh = wb.sheet_by_index(0)
    csv_file = open(xls_file[0:-3]+'csv', "w")
    wr = csv.writer(csv_file, quoting=csv.QUOTE_ALL)
    
    for rownum in range(sh.nrows):
        wr.writerow(sh.row_values(rownum))
        
    csv_file.close()
    
    # remove .xls files
    os.remove(xls_file)


### More data wrangling - StateMigration dataset
- Extract "Total Flow" row with "Number of Returns" and "Number of Exemptions"
- Assign "I" if from inflow and "O" if from outflow - for every state file
- Extract "State" and "Year" from every file
- Create file with "State" (from file name), "Year" (from file name), "Number of Returns_I", "Number of Exemptions I", "Number of Returns O" and "Number of Exemptions O"

### Pseudocode

1. create output path in repository for the merged StateMigration dataset

2. read each csv file in the StateMigration folder, and for each file, 
    - d
3. check the csv files to make sure they are the intended data
4. remove the original csv files

In [None]:
# set new file location for merged StateMigration dataset
output_path = r'../code/data/StateMigration/State_Migrations_by_Year.csv'

In [None]:
# create the output file at output_path
output = open(output_path, "w")
output.close()

In [None]:
# create empty DataFrame object
df = pd.DataFrame()
df.insert(0,'State', '')
df.insert(1,'Year', '')
df.insert(2,'NOR(I)', '')
df.insert(3,'NOE(I)', '')
df.insert(4,'NOR(O)', '')
df.insert(5,'NOE(O)', '')

# print(df)

In [223]:
# create dictionary of "state initial keys" with multiple "values"
# run each segment through dictionary, and convert into state initial
stateDict = {
    "AL":['Alabama', 'al', 'AL', 'alab', 'Alab'],
    "AK":['Alaska', 'ak', 'AK', 'alas', 'Alas'],
    "AZ":['Arizona', 'az', 'AZ', 'ariz', 'Ariz'],
    "AR":['Arkansas', 'ar', 'AR', 'arka', 'Arka', 'aka'],
    "CA":['California', 'ca', 'CA', 'cali', 'Cali'],
    "CO":['Colorado', 'co', 'CO', 'colo', 'Colo'],
    "CT":['Connecticut', 'ct', 'CT', 'conn', 'Conn'],
    "DE":['Delaware', 'de', 'DE', 'dela', 'Dela'],
    "DC":['DistrictofColumbia', 'Districtofcolumbia', 'District of Columbia', 'dc', 'DC', 'dist', 'Dist', 'DiCo', 'dico'],
    "FL":['Florida', 'fl', 'FL', 'flor', 'Flor'],
    "GA":['Georgia', 'ga', 'GA', 'geor', 'Geor'],
    "HI":['Hawaii', 'hi', 'HI', 'hawa', 'Hawa'],
    "ID":['Idaho', 'id', 'ID', 'idah', 'Idah'],
    "IL":['Illinois', 'il', 'IL', 'illi', 'Illi'],
    "IN":['Indiana', 'in', 'IN', 'indi', 'Indi'],
    "IA":['Iowa', 'ia', 'IA', 'iowa'],
    "KS":['Kansas', 'ks', 'KS', 'kans', 'Kans'],
    "KY":['Kentucky', 'ky', 'KY', 'kent', 'Kent'],
    "LA":['Louisiana', 'la', 'LA', 'loui', 'Loui'],
    "MA":['Massachusetts', 'ma', 'MA', 'mass', 'Mass'],
    "MD":['Maryland', 'md', 'MD', 'mary', 'Mary'],
    "ME":['Maine', 'me', 'ME', 'main', 'Main'],
    "MI":['Michigan', 'mi', 'MI', 'mich', 'Mich'],
    "MN":['Minnesota', 'mn', 'MN', 'minn', 'Minn'],
    "MO":['Missouri', 'mo', 'MO', 'Miso', 'miso'],
    "MS":['Mississippi', 'ms', 'MS', 'Misi', 'misi', 'miss', 'Miss'],
    "MT":['Montana', 'mt', 'MT', 'mont', 'Mont'],
    "NC":['North Carolina', 'NorthCarolina', 'nc', 'NC', 'NoCa', 'noca', 'ncar', 'Northcarolina'],
    "ND":['North Dakota', 'NorthDakota', 'nd', 'ND', 'NoDa', 'noda', 'ndak', 'Northdakota'],
    "NE":['Nebraska', 'ne', 'NE', 'Nebr', 'nrbt', 'nebr'],
    "NH":['New Hampshire', 'NewHampshire', 'nh', 'NH', 'NeHa', 'neha', 'newh'],
    "NJ":['New Jersey', 'NewJersey', 'nj', 'NJ', 'NeJe', 'neje', 'newj', 'Newjersey'],
    "NM":['New Mexico', 'NewMexico', 'nm', 'NM', 'NeMe', 'neme', 'newm', 'Newmexico'],
    "NV":['Nevada', 'nv', 'NV', 'Neva', 'neva'],
    "NY":['New York', 'NewYork', 'ny', 'NY', 'newy', 'NeYo', 'neyo', 'newY','Newyork'],
    "OH":['Ohio', 'oh', 'OH', 'ohio', 'nhio'],
    "OK":['Oklahoma', 'ok', 'OK', 'okla', 'Okla'],
    "OR":['Oregon', 'or', 'OR', 'oreg', 'Oreg', 'oeg'],
    "PA":['Pennsylvania', 'pa', 'PA', 'penn', 'Penn'],
    "RI":['Rhode Island', 'RhodeIsland', 'ri', 'RI', 'Rhls', 'rhod', 'Rhod', 'RhIs'],
    "SC":['South Carolina', 'SouthCarolina', 'sc', 'SC', 'SoCa', 'soca', 'scar', 'Southcarolina'],
    "SD":['South Dakota', 'SouthDakota', 'sd', 'SD', 'SoDa', 'soda', 'sdak', 'Southdakota'],
    "TN":['Tennessee', 'tn', 'TN', 'Tenn', 'tenn'],
    "TX":['Texas', 'tx', 'TX', 'texa', 'Texa'],
    "UT":['Utah', 'ut', 'UT', 'utah'],
    "VA":['Virginia', 'va', 'VA', 'virg', 'Virg', 'vrg'],
    "VT":['Vermont', 'vt', 'VT', 'verm', 'Verm'],
    "WA":['Washington', 'wa', 'WA', 'wash', 'Wash'],
    "WI":['Wisconsin', 'wi', 'WI', 'wisc', 'Wisc', 'wiso', 'wsc'],
    "WV":['West Virginia', 'WestVirginia', 'wv', 'WV', 'west', 'wevi', 'wvir', 'Westvirginia'],
    "WY":['Wyoming', 'wy', 'WY', 'wyom', 'Wyom']    
}

def getKey(val):
    for key, valueList in stateDict.items():
         for value in valueList:
            if val == value:
                 return key
    
    return False

print(getKey('Wisconsin'))

WI


In [334]:
# read csv file
csv_list = glob.glob("/Users/ben/Desktop/climate-displacement/code/data/StateMigration/*/*/*.csv")

for csv_file in csv_list:
    # os.path.split returns a list of (head, tail) where head is the parent directories 
    # and tail is the filename and extension
    temp = os.path.split(csv_file)
    temp2 = os.path.split(temp[0])
    
    # get file name and parent folder from temp, temp2 respectively
    filename = temp[1]
    parentfile = temp2[1]
    
    print (filename, parentfile)
    # print (type(filename))
    
    # extract state, year, and inflow/outflow
    # three different naming conventions in the StateMigration dataset
    # 1) [State][Year1Year2 e.g. (0708)][in/out]
    # 2) [State][Year2 e.g. 91][In/Out]
    # 3) [Year1Year2 like 1)]inmig[in/out][state INITIAL e.g. AL]
    # 4) [first 4 letters of State][Year2][in/ot]
    # 5) s9[last digits of Year1, Year2 e.g. 56][state INITIAL][ir/or]
    # 6) same as 4) but with extra "r" at the end
    
    # naming convention 1: used for years 2004-2009
    name1 = [2004,2005,2006,2007,2008]
    # naming convention 2: used for years 1990-1993
    name2 = [1990,1991,1992]
    # naming convention 3: used for years 2009-2011
    name3 = [2009,2010]
    # naming convention 4: used for years 1993-1995, 1996-2000, 2001-2004
    name4 = [1993,1994,1996,1997, 1998, 1999, 2001,2002,2003]
    # naming convention 5: used for years 1995-1996
    name5 = 1995
    # naming convention 6: used for years 2000-2001
    name6 = 2000
    
    # extract inflow/outflow, year using parentfile, and state using filename
    if parentfile[-6] == 'u':
        io = parentfile[-7:]
    elif parentfile[-5] == 'n':
        io = parentfile[-6:]
    year = int(parentfile[0:4])
    
    #2009to2010StateMigrationInflow
    # print(io, year)
    
    if year in name1:
        if io == 'Inflow':
            state = filename[:-10]
        elif io == 'Outflow':
            state = filename[:-11]
    elif year in name2:
        if io == 'Inflow':
            state = filename[:-8]
        elif io == 'Outflow':
            state = filename[:-9]
    elif year in name3:
        state = filename[-6:-4]
    elif year in name4:
        state = filename[:4]
        if state == 'vrg9':
            state = 'vrg'
        elif state == 'vrg0':
            state = 'vrg'
        elif state == 'az94':
            state = 'az'
        elif state == 'aka9':
            state = 'aka'
        elif state == 'wsc9':
            state = 'wsc'
    elif year == name5:
        state = filename[-8:-6]
    elif year == name6:
        state = filename[:4]
        if state == 'vrg0':
            state = 'vrg'
        elif state == 'oeg0':
            state = 'oeg'

    si = getKey(state)
    if si != False:
        row_param = [si, year, io]
    print(row_param)
    
    # the total flow data in each years are located in different rows and columns.
    # type1 - 1990, 1991: located in row 9, columns D and F
    type1 = [1990, 1991]
    # type2 - 1992-1994, 2004-2006: located in row 9, columns D and E
    type2 = [1992,1993,1994,2004,2005,2006]
    # type3 - 1995-2003, 2007-2008: located in row 10, columns D and E
    type3 = [1995,1996,1997,1998,1999,2000,2001,2002,2003,2007,2008]
    # type4 - 2009-2010: located in row 8, columns E and F
    type4 = [2009,2010]
    
    data = pd.read_csv(csv_file)
    # print(data)
    if si != False:
        if year in type1:
            # total = data.iloc[7] #7, np.r_[3,5]
            totaltemp = data.iat[7,0]
            list = totaltemp.split(",")
            nor = list[3]
            noe = list[5]
        elif year in type2:
            # total = data.iloc[7] #7
            totaltemp = data.iat[7,0]
            #print(totaltemp)
            list = totaltemp.split(",")
            # print(list)
            nor = list[3]
            noe = list[4]
        elif year in type3:
            if year == 2003:
                nor = data.iat[8,4]
                noe = data.iat[8,5]
            elif year == 1997:
                nor = data.iat[8,4]
                noe = data.iat[8,5]
            else:
                totaltemp = data.iat[7,0]
                list = totaltemp.split(",")
                nor = list[3]
                noe = list[4]
        elif year in type4:
            total = data.iloc[6]
            nor = data.iat[6,4]
            noe = data.iat[6,5]
        # print(data)    
        # print(total)
        # print(nor, noe)
        # print(data)
        # data.shape
        # break
    
    

1011inmigoutmn.csv 2010to2011StateMigrationOutflow
['MN', 2010, 'Outflow']
45173.0 77893.0
1011inmigoutaz.csv 2010to2011StateMigrationOutflow
['AZ', 2010, 'Outflow']
80278.0 162164.0
1011inmigoutal.csv 2010to2011StateMigrationOutflow
['AL', 2010, 'Outflow']
43563.0 88001.0
1011inmigoutmo.csv 2010to2011StateMigrationOutflow
['MO', 2010, 'Outflow']
60740.0 115708.0
1011inmigoutnc.csv 2010to2011StateMigrationOutflow
['NC', 2010, 'Outflow']
101963.0 201396.0
1011inmigoutwv.csv 2010to2011StateMigrationOutflow
['WV', 2010, 'Outflow']
17290.0 31302.0
1011inmigoutwa.csv 2010to2011StateMigrationOutflow
['WA', 2010, 'Outflow']
75099.0 142475.0
1011inmigouttx.csv 2010to2011StateMigrationOutflow
['TX', 2010, 'Outflow']
169226.0 339560.0
1011inmigoutnv.csv 2010to2011StateMigrationOutflow
['NV', 2010, 'Outflow']
47545.0 91923.0
1011inmigoutor.csv 2010to2011StateMigrationOutflow
['OR', 2010, 'Outflow']
42231.0 76798.0
1011inmigouttn.csv 2010to2011StateMigrationOutflow
['TN', 2010, 'Outflow']
65801.0 

30940 62234
California0708out.csv 2007to2008StateMigrationOutflow
['CA', 2007, 'Outflow']
271521 523042
Delaware0708out.csv 2007to2008StateMigrationOutflow
['DE', 2007, 'Outflow']
13657 24597
NewYork0708out.csv 2007to2008StateMigrationOutflow
['NY', 2007, 'Outflow']
201570 353872
Illinois0708out.csv 2007to2008StateMigrationOutflow
['IL', 2007, 'Outflow']
122724 223922
NewHampshire0708out.csv 2007to2008StateMigrationOutflow
['NH', 2007, 'Outflow']
22314 36340
Florida0708out.csv 2007to2008StateMigrationOutflow
['FL', 2007, 'Outflow']
246296 452746
Indiana0708out.csv 2007to2008StateMigrationOutflow
['IN', 2007, 'Outflow']
62389 116700
Hawaii0708out.csv 2007to2008StateMigrationOutflow
['HI', 2007, 'Outflow']
25806 51114
RhodeIsland0708out.csv 2007to2008StateMigrationOutflow
['RI', 2007, 'Outflow']
16521 28349
Tennessee0708out.csv 2007to2008StateMigrationOutflow
['TN', 2007, 'Outflow']
70196 137842
Georgia0708out.csv 2007to2008StateMigrationOutflow
['GA', 2007, 'Outflow']
110175 220209
Ohio

100496 189668
mass01inr.csv 2000to2001StateMigrationInflow
['MA', 2000, 'Inflow']
77782 119024
illi01inr.csv 2000to2001StateMigrationInflow
['IL', 2000, 'Inflow']
104233 184133
flor01inr.csv 2000to2001StateMigrationInflow
['FL', 2000, 'Inflow']
263746 488310
cali01inr.csv 2000to2001StateMigrationInflow
['CA', 2000, 'Inflow']
265793 447702
ndak01inr.csv 2000to2001StateMigrationInflow
['ND', 2000, 'Inflow']
8498 15309
oeg01inr.csv 2000to2001StateMigrationInflow
['OR', 2000, 'Inflow']
53049 95705
newm01inr.csv 2000to2001StateMigrationInflow
['NM', 2000, 'Inflow']
28628 58137
verm01inr.csv 2000to2001StateMigrationInflow
['VT', 2000, 'Inflow']
10511 17472
arka01inr.csv 2000to2001StateMigrationInflow
['AR', 2000, 'Inflow']
29808 62874
wyom01inr.csv 2000to2001StateMigrationInflow
['WY', 2000, 'Inflow']
10777 20983
conn01inr.csv 2000to2001StateMigrationInflow
['CT', 2000, 'Inflow']
39182 70090
mont01inr.csv 2000to2001StateMigrationInflow
['MT', 2000, 'Inflow']
15084 28364
newy01inr.csv 2000to2

48084 84527
SouthCarolina0506out.csv 2005to2006StateMigrationOutflow
['SC', 2005, 'Outflow']
47617 92402
Alaska0506out.csv 2005to2006StateMigrationOutflow
['AK', 2005, 'Outflow']
14990 30159
Iowa0506out.csv 2005to2006StateMigrationOutflow
['IA', 2005, 'Outflow']
31035 56057
Alabama0506out.csv 2005to2006StateMigrationOutflow
['AL', 2005, 'Outflow']
38709 78087
Utah0506out.csv 2005to2006StateMigrationOutflow
['UT', 2005, 'Outflow']
27345 60429
Montana0506out.csv 2005to2006StateMigrationOutflow
['MT', 2005, 'Outflow']
14251 25456
Arizona0506out.csv 2005to2006StateMigrationOutflow
['AZ', 2005, 'Outflow']
73142 142620
NewJersey0506out.csv 2005to2006StateMigrationOutflow
['NJ', 2005, 'Outflow']
109461 205670
Wyoming0506out.csv 2005to2006StateMigrationOutflow
['WY', 2005, 'Outflow']
11109 20781
Maryland0506out.csv 2005to2006StateMigrationOutflow
['MD', 2005, 'Outflow']
83839 157777
SouthDakota0506out.csv 2005to2006StateMigrationOutflow
['SD', 2005, 'Outflow']
11157 20391
Maine0506out.csv 2005

109226 218437
rhod97ot.csv 1996to1997StateMigrationOutflow
['RI', 1996, 'Outflow']
13667 24644
miss97ot.csv 1996to1997StateMigrationOutflow
['MS', 1996, 'Outflow']
29567 60850
Alaska0405in.csv 2004to2005StateMigrationInflow
['AK', 2004, 'Inflow']
13497 26336
Vermont0405in.csv 2004to2005StateMigrationInflow
['VT', 2004, 'Inflow']
9423 15510
Nevada0405in.csv 2004to2005StateMigrationInflow
['NV', 2004, 'Inflow']
65624 124485
Mississippi0405in.csv 2004to2005StateMigrationInflow
['MS', 2004, 'Inflow']
28476 59874
Oklahoma0405in.csv 2004to2005StateMigrationInflow
['OK', 2004, 'Inflow']
36344 77028
Kentucky0405in.csv 2004to2005StateMigrationInflow
['KY', 2004, 'Inflow']
42112 85957
WestVirginia0405in.csv 2004to2005StateMigrationInflow
['WV', 2004, 'Inflow']
18534 37249
Maine0405in.csv 2004to2005StateMigrationInflow
['ME', 2004, 'Inflow']
15339 27817
Connecticut0405in.csv 2004to2005StateMigrationInflow
['CT', 2004, 'Inflow']
35809 65279
Washington0405in.csv 2004to2005StateMigrationInflow
['WA'

43938 79996
mont02ot.csv 2001to2002StateMigrationOutflow
['MT', 2001, 'Outflow']
15735 28504
main02ot.csv 2001to2002StateMigrationOutflow
['ME', 2001, 'Outflow']
14453 24120
miss02ot.csv 2001to2002StateMigrationOutflow
['MS', 2001, 'Outflow']
31688 63801
rhod02ot.csv 2001to2002StateMigrationOutflow
['RI', 2001, 'Outflow']
14134 24296
scar02ot.csv 2001to2002StateMigrationOutflow
['SC', 2001, 'Outflow']
49741 94443
virg02ot.csv 2001to2002StateMigrationOutflow
['VA', 2001, 'Outflow']
111993 214188
miso02ot.csv 2001to2002StateMigrationOutflow
['MO', 2001, 'Outflow']
61072 115684
verm02ot.csv 2001to2002StateMigrationOutflow
['VT', 2001, 'Outflow']
10040 15979
alas02ot.csv 2001to2002StateMigrationOutflow
['AK', 2001, 'Outflow']
14052 28522
dist02ot.csv 2001to2002StateMigrationOutflow
['DC', 2001, 'Outflow']
24648 40330
newm02ot.csv 2001to2002StateMigrationOutflow
['NM', 2001, 'Outflow']
28795 58616
penn02ot.csv 2001to2002StateMigrationOutflow
['PA', 2001, 'Outflow']
105038 183876
wash02ot.cs

97772 189997
utah03ot.csv 2002to2003StateMigrationOutflow
['UT', 2002, 'Outflow']
29051 63774
alab03ot.csv 2002to2003StateMigrationOutflow
['AL', 2002, 'Outflow']
40022 80610
indi03ot.csv 2002to2003StateMigrationOutflow
['IN', 2002, 'Outflow']
57021 108026
unit03ot.csv 2002to2003StateMigrationOutflow
['IN', 2002, 'Outflow']
ariz03in.csv 2002to2003StateMigrationInflow
['AZ', 2002, 'Inflow']
96989 184787
flor03in.csv 2002to2003StateMigrationInflow
['FL', 2002, 'Inflow']
268577 501376
hawa03in.csv 2002to2003StateMigrationInflow
['HI', 2002, 'Inflow']
25396 46639
sdak03in.csv 2002to2003StateMigrationInflow
['SD', 2002, 'Inflow']
10874 20723
cali03in.csv 2002to2003StateMigrationInflow
['CA', 2002, 'Inflow']
233121 400583
wyom03in.csv 2002to2003StateMigrationInflow
['WY', 2002, 'Inflow']
11118 21379
tenn03in.csv 2002to2003StateMigrationInflow
['TN', 2002, 'Inflow']
70483 141012
geor03in.csv 2002to2003StateMigrationInflow
['GA', 2002, 'Inflow']
127398 240138
kent03n.csv 2002to2003StateMigrati

newj95in.csv 1994to1995StateMigrationInflow
['NJ', 1994, 'Inflow']
76092 144626
ariz95in.csv 1994to1995StateMigrationInflow
['AZ', 1994, 'Inflow']
101506 202232
kent95in.csv 1994to1995StateMigrationInflow
['KY', 1994, 'Inflow']
44338 93214
flor95in.csv 1994to1995StateMigrationInflow
['FL', 1994, 'Inflow']
239487 458612
minn95in.csv 1994to1995StateMigrationInflow
['MN', 1994, 'Inflow']
43745 82800
texa95in.csv 1994to1995StateMigrationInflow
['TX', 1994, 'Inflow']
188255 403797
kans95in.csv 1994to1995StateMigrationInflow
['KS', 1994, 'Inflow']
38602 82942
arka95in.csv 1994to1995StateMigrationInflow
['AR', 1994, 'Inflow']
34166 74697
noca95in.csv 1994to1995StateMigrationInflow
['NC', 1994, 'Inflow']
118754 240838
fore95in.csv 1994to1995StateMigrationInflow
['NC', 1994, 'Inflow']
mary95in.csv 1994to1995StateMigrationInflow
['MD', 1994, 'Inflow']
72701 138222
wash95ot.csv 1994to1995StateMigrationOutflow
['WA', 1994, 'Outflow']
69079 138588
newh95ot.csv 1994to1995StateMigrationOutflow
['NH',

mary00ot.csv 1999to2000StateMigrationOutflow
['MD', 1999, 'Outflow']
74201 137591
minn00ot.csv 1999to2000StateMigrationOutflow
['MN', 1999, 'Outflow']
44776 80403
texa00ot.csv 1999to2000StateMigrationOutflow
['TX', 1999, 'Outflow']
178470 365607
kans00ot.csv 1999to2000StateMigrationOutflow
['KS', 1999, 'Outflow']
41324 83866
arka00ot.csv 1999to2000StateMigrationOutflow
['AR', 1999, 'Outflow']
29338 59853
vrg00ot.csv 1999to2000StateMigrationOutflow
['VA', 1999, 'Outflow']
107892 209319
kent00ot.csv 1999to2000StateMigrationOutflow
['KY', 1999, 'Outflow']
42922 85117
ariz00ot.csv 1999to2000StateMigrationOutflow
['AZ', 1999, 'Outflow']
73765 141670
flor00ot.csv 1999to2000StateMigrationOutflow
['FL', 1999, 'Outflow']
193024 356119
wyom00ot.csv 1999to2000StateMigrationOutflow
['WY', 1999, 'Outflow']
11764 22587
tenn00ot.csv 1999to2000StateMigrationOutflow
['TN', 1999, 'Outflow']
65781 131290
hawa00ot.csv 1999to2000StateMigrationOutflow
['HI', 1999, 'Outflow']
24402 50428
sdak00ot.csv 1999to2

25917 44661
Pennsylvania91Out.csv 1990to1991StateMigrationOutflow
['PA', 1990, 'Outflow']
98347 181510
Maine91Out.csv 1990to1991StateMigrationOutflow
['ME', 1990, 'Outflow']
16050 30153
Washington91Out.csv 1990to1991StateMigrationOutflow
['WA', 1990, 'Outflow']
62841 128296
Mississippi91Out.csv 1990to1991StateMigrationOutflow
['MS', 1990, 'Outflow']
29137 62824
Arkansas91Out.csv 1990to1991StateMigrationOutflow
['AR', 1990, 'Outflow']
28859 62373
Indiana91Out.csv 1990to1991StateMigrationOutflow
['IN', 1990, 'Outflow']
54571 107455
Utah91Out.csv 1990to1991StateMigrationOutflow
['UT', 1990, 'Outflow']
20081 47969
West Virginia91Out.csv 1990to1991StateMigrationOutflow
['WV', 1990, 'Outflow']
19745 39200
Montana91Out.csv 1990to1991StateMigrationOutflow
['MT', 1990, 'Outflow']
13442 26436
Minnesota91Out.csv 1990to1991StateMigrationOutflow
['MN', 1990, 'Outflow']
39327 72571
New Jersey91Out.csv 1990to1991StateMigrationOutflow
['NJ', 1990, 'Outflow']
100275 190163
Iowa91Out.csv 1990to1991State

55153.0 1031018.0
Wash04in.csv 2003to2004StateMigrationInflow
['WA', 2003, 'Inflow']
145923.0 3390105.0
Colo04in.csv 2003to2004StateMigrationInflow
['CO', 2003, 'Inflow']
127981.0 3083867.0
Iowa04in.csv 2003to2004StateMigrationInflow
['IA', 2003, 'Inflow']
51999.0 991128.0
Illi04in.csv 2003to2004StateMigrationInflow
['IL', 2003, 'Inflow']
160616.0 4038622.0
Neva04in.csv 2003to2004StateMigrationInflow
['NV', 2003, 'Inflow']
122779.0 2830762.0
Ohio04in.csv 2003to2004StateMigrationInflow
['OH', 2003, 'Inflow']
137350.0 3042753.0
Mass04in.csv 2003to2004StateMigrationInflow
['MA', 2003, 'Inflow']
88511.0 2714324.0
NeHa04in.csv 2003to2004StateMigrationInflow
['NH', 2003, 'Inflow']
40391.0 1211996.0
NoDa04in.csv 2003to2004StateMigrationInflow
['ND', 2003, 'Inflow']
16034.0 281228.0
Indi04in.csv 2003to2004StateMigrationInflow
['IN', 2003, 'Inflow']
104848.0 2082682.0
NeMe04in.csv 2003to2004StateMigrationInflow
['NM', 2003, 'Inflow']
59010.0 1100573.0
Alab04in.csv 2003to2004StateMigrationInflow

28751.0 402632.0
miss98ot.csv 1997to1998StateMigrationOutflow
['MS', 1997, 'Outflow']
59824.0 838453.0
rhod98ot.csv 1997to1998StateMigrationOutflow
['RI', 1997, 'Outflow']
23281.0 508354.0
dist98ot.csv 1997to1998StateMigrationOutflow
['DC', 1997, 'Outflow']
39496.0 948697.0
verm98ot.csv 1997to1998StateMigrationOutflow
['VT', 1997, 'Outflow']
15780.0 307970.0
alas98ot.csv 1997to1998StateMigrationOutflow
['AK', 1997, 'Outflow']
29092.0 487510.0
newm98ot.csv 1997to1998StateMigrationOutflow
['NM', 1997, 'Outflow']
66012.0 1005805.0
penn98ot.csv 1997to1998StateMigrationOutflow
['PA', 1997, 'Outflow']
184195.0 4370752.0
wsc98ot.csv 1997to1998StateMigrationOutflow
['WI', 1997, 'Outflow']
82670.0 1852299.0
newy98ot.csv 1997to1998StateMigrationOutflow
['NY', 1997, 'Outflow']
325884.0 8235769.0
main98ot.csv 1997to1998StateMigrationOutflow
['ME', 1997, 'Outflow']
25866.0 464961.0
oreg98ot.csv 1997to1998StateMigrationOutflow
['OR', 1997, 'Outflow']
80697.0 1479616.0
unit98ot.csv 1997to1998StateMig

22584.0 43149.0
0910inmiginIL.csv 2009to2010StateMigrationInflow
['IL', 2009, 'Inflow']
88260.0 155553.0
0910inmiginKS.csv 2009to2010StateMigrationInflow
['KS', 2009, 'Inflow']
35765.0 74019.0
0910inmiginIN.csv 2009to2010StateMigrationInflow
['IN', 2009, 'Inflow']
50184.0 97655.0
0910inmiginWY.csv 2009to2010StateMigrationInflow
['WY', 2009, 'Inflow']
11374.0 21788.0
0910inmiginOH.csv 2009to2010StateMigrationInflow
['OH', 2009, 'Inflow']
67259.0 124356.0
0910inmiginNM.csv 2009to2010StateMigrationInflow
['NM', 2009, 'Inflow']
28793.0 57568.0
0910inmiginMA.csv 2009to2010StateMigrationInflow
['MA', 2009, 'Inflow']
59534.0 92194.0
0910inmiginMT.csv 2009to2010StateMigrationInflow
['MT', 2009, 'Inflow']
13819.0 24485.0
0910inmiginOK.csv 2009to2010StateMigrationInflow
['OK', 2009, 'Inflow']
39457.0 82495.0
0910inmiginNY.csv 2009to2010StateMigrationInflow
['NY', 2009, 'Inflow']
138660.0 219994.0
0910inmiginAR.csv 2009to2010StateMigrationInflow
['AR', 2009, 'Inflow']
29135.0 60023.0
0910inmiginN

In [None]:
# add parameters to df object

In [None]:
# parse out the row from csv file for NOR, NOE

In [None]:
# convert df into csv

In [None]:
# close output csv file
output.close()

In [None]:
"""
for filename in os.listdir(directory):
    f = os.path.join(directory, filename)
    if os.path.isfile(f) and filename.endswith('.txt'):
        read_file = pd.read_excel (r'../code/data/StateMigration/1990to1991StateMigration/1990to1991StateMigrationInflow/Alabama91in.xls')
    read_file.to_csv (r'../code/data/StateMigration/1990to1991StateMigration/1990to1991StateMigrationInflow/Alabama91in.csv', index = None, header=True)


read_file = pd.read_excel (r'../code/data/StateMigration/1990to1991StateMigration/1990to1991StateMigrationInflow/Alabama91in.xls')
read_file.to_csv (r'../code/data/StateMigration/1990to1991StateMigration/1990to1991StateMigrationInflow/Alabama91in.csv', index = None, header=True)



location = "/Users/neely/Desktop/climate-displacement/code/data/StateMigration/1990to1991StateMigration/1990to1991StateMigrationInflow"
placement = "./Users/neely/Desktop/climate-displacement/code/data/StateMigration"

for file in glob.glob("*.xls"):
"""

"""
from os.path import isfile, join
onlyfiles = [f for f in listdir('../code/data/StateMigration/') if isfile(join('../code/data/StateMigration/', f))]
"""

In [None]:
# Neural Network -> fastai

# path to dataset here (./Users/neely/Desktop/climate-displacement/code/data/StateMigration)
path = untar_data(URLs.MNIST)
path.ls()

# data loader
dls = ImageDataLoaders.from_folder(path, train="training")