
# Lab | Handling Data Imbalance in Classification Models

For this lab and in the next lessons we will use the dataset 'Healthcare For All' building a model to predict who will donate (TargetB) and then - how much they will give (TargetD) (will be used for lab on Friday). You will be using `files_for_lab/categorical.csv, numerical.csv, and target.csv`. You will need to download the data locally.  Remember to add the files to your .gitignore.

### Scenario

You are revisiting the Healthcare for All Case Study. You are provided with this historical data about Donors and how much they donated. Your task is to build a machine learning model that will help the company identify people who are more likely to donate and then try to predict the donation amount.

### Instructions

In this lab, we will take a look at the degree of imbalance in the data and correct it using the techniques we learned in the class.  You should fork and clone this Repo and begin a new Jupyter notebook.

Here are the steps to be followed (building a simple model without balancing the data):


**Everyone is starting with the same cleaned data**

 

**Begin the Modeling here**
- Look critically at the dtypes of numerical and categorical columns and make changes where appropriate.
- Concatenate numerical and categorical back together again for your X dataframe.  Designate the TargetB as y.
  - Split the data into a training set and a test set.
  - Split further into train_num and train_cat.  Also test_num and test_cat.
  - Scale the features either by using MinMax Scaler or a Standard Scaler. (train_num, test_num)
  - Encode the categorical features using One-Hot Encoding or Ordinal Encoding.  (train_cat, test_cat)
      - **fit** only on train data, transform both train and test
      - again re-concatenate train_num and train_cat as X_train as well as test_num and test_cat as X_test
  - Fit a logistic regression (classification) model on the training data.
  - Check the accuracy on the test data.

**Note**: So far we have not balanced the data.

Managing imbalance in the dataset

- Check for the imbalance.
- Use the resampling strategies used in class for upsampling and downsampling to create a balance between the two classes.
- Each time fit the model and see how the accuracy of the model has changed.





In [1]:
import pandas as pd
import numpy as np
import statsmodels.api as sm

from sklearn.model_selection import train_test_split
from sklearn.preprocessing import MinMaxScaler
from sklearn.preprocessing import OneHotEncoder, OrdinalEncoder

from sklearn.model_selection import cross_val_score

pd.set_option('display.max_columns', None)

In [2]:
num = pd.read_csv('files_for_lab/numerical.csv')
cat = pd.read_csv('files_for_lab/categorical.csv')
tar = pd.read_csv('files_for_lab/target.csv')

In [3]:
cat.isna().any()

STATE           False
CLUSTER         False
HOMEOWNR        False
GENDER          False
DATASRCE        False
RFA_2R          False
RFA_2A          False
GEOCODE2        False
DOMAIN_A        False
DOMAIN_B        False
ODATEW_YR       False
ODATEW_MM       False
DOB_YR          False
DOB_MM          False
MINRDATE_YR     False
MINRDATE_MM     False
MAXRDATE_YR     False
MAXRDATE_MM     False
LASTDATE_YR     False
LASTDATE_MM     False
FIRSTDATE_YR    False
FIRSTDATE_MM    False
dtype: bool

In [4]:
cat.dtypes

STATE           object
CLUSTER          int64
HOMEOWNR        object
GENDER          object
DATASRCE         int64
RFA_2R          object
RFA_2A          object
GEOCODE2        object
DOMAIN_A        object
DOMAIN_B         int64
ODATEW_YR        int64
ODATEW_MM        int64
DOB_YR           int64
DOB_MM           int64
MINRDATE_YR      int64
MINRDATE_MM      int64
MAXRDATE_YR      int64
MAXRDATE_MM      int64
LASTDATE_YR      int64
LASTDATE_MM      int64
FIRSTDATE_YR     int64
FIRSTDATE_MM     int64
dtype: object

In [5]:
for i in cat.columns:
    print(i,cat[i].unique())

STATE ['IL' 'CA' 'NC' 'FL' 'other' 'IN' 'MI' 'MO' 'TX' 'WA' 'WI' 'GA']
CLUSTER [36 14 43 44 16 40 39 45 35 53 17 51  2 20 27 12 22 13  8 25 15 42 11 28
 18 24 34  5 31 32 46  3 50  7 37 10 38 21  9 29 30  4 41 49 23 33  1 47
 26 48 19  6 52]
HOMEOWNR ['H' 'U']
GENDER ['F' 'M' 'other']
DATASRCE [3 1 2]
RFA_2R ['L']
RFA_2A ['E' 'G' 'F' 'D']
GEOCODE2 ['C' 'A' 'D' 'B']
DOMAIN_A ['T' 'S' 'R' 'U' 'C']
DOMAIN_B [2 1 3 4]
ODATEW_YR [89 94 90 87 86 88 96 92 93 95 91 97 85 84 83]
ODATEW_MM [ 1  4  2  9 10 11 12  6  8  3  5  7]
DOB_YR [37 52  0 28 20 60 32 23 26 27 54 36 16 43 56 14 48 29 10 13 68 53 46 31
 90 30 58 50 64 18 62 57 44 25 51 65 35 34 38 21 24 11 47 80 39 12 72 22
 40 59 15 17 42 19 61 41 49 33 45 63 81 55 67 69 70 66 76 91 74 71 78  1
 79 73 75  2 84 82 77 87 95 96 88 97 86  8  6  4 92  5 93  7 89  9 85 83]
DOB_MM [12  2  1 11  3  9  7  4 10  8  6  5]
MINRDATE_YR [92 93 91 87 94 95 88 96 89 90 97 86 77 80 75 84 82 78 85 83]
MINRDATE_MM [ 8 10 11  7  5  9  2 12  6  1  4  3]
MAXRDATE

In [6]:
(num.isna().any()).unique()

array([False])

In [7]:
for i in num.columns:
    if num[i].nunique()<75:
        print(i,num[i].nunique())

TCODE 55
INCOME 7
WEALTH1 10
LOCALGOV 55
STATEGOV 65
FEDGOV 55
WEALTH2 10
ETH6 21
ETH7 61
ETH8 60
ETH9 61
ETH10 33
ETH11 34
ETH12 43
ETH14 45
ETH16 66
AGE901 74
AGE902 67
AGE903 59
AGE904 66
AGE905 62
AGE906 55
AGE907 64
CHIL2 70
AGEC2 74
AGEC3 57
AGEC4 43
AGEC5 43
AGEC6 63
CHILC1 62
CHILC2 46
CHILC3 62
CHILC4 56
HHN5 69
HHN6 53
MARR2 51
MARR3 60
DW3 70
HV3 14
HV4 14
HHD7 66
HHD8 17
HHD9 64
HHD12 66
ETHC1 52
ETHC4 45
ETHC6 53
RHP3 27
RHP4 25
HUPA7 48
IC7 65
IC8 60
IC9 66
IC10 67
IC11 50
IC12 36
IC13 31
IC14 71
IC17 72
IC18 71
IC19 73
IC20 54
IC21 43
IC22 36
HHAS2 72
TPE2 70
TPE3 72
TPE4 67
TPE5 50
TPE6 27
TPE7 18
TPE9 56
TPE10 67
TPE11 62
TPE12 59
OCC1 70
OCC2 61
OCC3 35
OCC4 60
OCC5 63
OCC6 27
OCC7 40
OCC8 63
OCC9 69
OCC10 61
OCC11 61
OCC12 48
OCC13 44
EIC2 54
EIC3 51
EIC4 73
EIC5 45
EIC6 34
EIC7 43
EIC8 71
EIC9 50
EIC10 44
EIC11 50
EIC12 41
EIC13 56
EIC14 69
EIC15 58
EIC16 51
OEDC1 54
OEDC2 63
OEDC3 54
OEDC4 66
OEDC6 63
OEDC7 25
EC3 63
EC5 69
EC6 36
EC7 61
EC8 71
SEC3 19
SEC4 54
AFC3

In [8]:
col_transfer = ['TCODE','INCOME','WEALTH1','WEALTH2']

In [9]:
cat[col_transfer] = num[col_transfer]

In [10]:
num = num.drop(col_transfer,axis=1)

In [11]:
cat.dtypes

STATE           object
CLUSTER          int64
HOMEOWNR        object
GENDER          object
DATASRCE         int64
RFA_2R          object
RFA_2A          object
GEOCODE2        object
DOMAIN_A        object
DOMAIN_B         int64
ODATEW_YR        int64
ODATEW_MM        int64
DOB_YR           int64
DOB_MM           int64
MINRDATE_YR      int64
MINRDATE_MM      int64
MAXRDATE_YR      int64
MAXRDATE_MM      int64
LASTDATE_YR      int64
LASTDATE_MM      int64
FIRSTDATE_YR     int64
FIRSTDATE_MM     int64
TCODE            int64
INCOME           int64
WEALTH1          int64
WEALTH2          int64
dtype: object

In [12]:
for i in cat.columns:
    print(i,cat[i].unique())

STATE ['IL' 'CA' 'NC' 'FL' 'other' 'IN' 'MI' 'MO' 'TX' 'WA' 'WI' 'GA']
CLUSTER [36 14 43 44 16 40 39 45 35 53 17 51  2 20 27 12 22 13  8 25 15 42 11 28
 18 24 34  5 31 32 46  3 50  7 37 10 38 21  9 29 30  4 41 49 23 33  1 47
 26 48 19  6 52]
HOMEOWNR ['H' 'U']
GENDER ['F' 'M' 'other']
DATASRCE [3 1 2]
RFA_2R ['L']
RFA_2A ['E' 'G' 'F' 'D']
GEOCODE2 ['C' 'A' 'D' 'B']
DOMAIN_A ['T' 'S' 'R' 'U' 'C']
DOMAIN_B [2 1 3 4]
ODATEW_YR [89 94 90 87 86 88 96 92 93 95 91 97 85 84 83]
ODATEW_MM [ 1  4  2  9 10 11 12  6  8  3  5  7]
DOB_YR [37 52  0 28 20 60 32 23 26 27 54 36 16 43 56 14 48 29 10 13 68 53 46 31
 90 30 58 50 64 18 62 57 44 25 51 65 35 34 38 21 24 11 47 80 39 12 72 22
 40 59 15 17 42 19 61 41 49 33 45 63 81 55 67 69 70 66 76 91 74 71 78  1
 79 73 75  2 84 82 77 87 95 96 88 97 86  8  6  4 92  5 93  7 89  9 85 83]
DOB_MM [12  2  1 11  3  9  7  4 10  8  6  5]
MINRDATE_YR [92 93 91 87 94 95 88 96 89 90 97 86 77 80 75 84 82 78 85 83]
MINRDATE_MM [ 8 10 11  7  5  9  2 12  6  1  4  3]
MAXRDATE

In [13]:
ord_cols = ['DOMAIN_B','INCOME','WEALTH1','WEALTH2']

In [14]:
X = pd.concat([num,cat],axis=1)
y = tar['TARGET_B']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)

In [15]:
X_train_num = X_train[num.columns]
X_test_num = X_test[num.columns]
X_train_cat = X_train[cat.columns]
X_test_cat = X_test[cat.columns]

In [16]:
X_train_num 

Unnamed: 0,AGE,HIT,MALEMILI,MALEVET,VIETVETS,WWIIVETS,LOCALGOV,STATEGOV,FEDGOV,POP901,POP902,POP903,POP90C1,POP90C2,POP90C3,POP90C4,POP90C5,ETH1,ETH2,ETH3,ETH4,ETH5,ETH6,ETH7,ETH8,ETH9,ETH10,ETH11,ETH12,ETH13,ETH14,ETH15,ETH16,AGE901,AGE902,AGE903,AGE904,AGE905,AGE906,AGE907,CHIL1,CHIL2,CHIL3,AGEC1,AGEC2,AGEC3,AGEC4,AGEC5,AGEC6,AGEC7,CHILC1,CHILC2,CHILC3,CHILC4,CHILC5,HHAGE1,HHAGE2,HHAGE3,HHN1,HHN2,HHN3,HHN4,HHN5,HHN6,MARR1,MARR2,MARR3,MARR4,HHP1,HHP2,DW1,DW2,DW3,DW4,DW5,DW6,DW7,DW8,DW9,HV1,HV2,HV3,HV4,HU1,HU2,HU3,HU4,HU5,HHD1,HHD2,HHD3,HHD4,HHD5,HHD6,HHD7,HHD8,HHD9,HHD10,HHD11,HHD12,ETHC1,ETHC2,ETHC3,ETHC4,ETHC5,ETHC6,HVP1,HVP2,HVP3,HVP4,HVP5,HVP6,HUR1,HUR2,RHP1,RHP2,RHP3,RHP4,HUPA1,HUPA2,HUPA3,HUPA4,HUPA5,HUPA6,HUPA7,RP1,RP2,RP3,RP4,MSA,ADI,DMA,IC1,IC2,IC3,IC4,IC5,IC6,IC7,IC8,IC9,IC10,IC11,IC12,IC13,IC14,IC15,IC16,IC17,IC18,IC19,IC20,IC21,IC22,IC23,HHAS1,HHAS2,HHAS3,HHAS4,MC1,MC2,MC3,TPE1,TPE2,TPE3,TPE4,TPE5,TPE6,TPE7,TPE8,TPE9,PEC1,PEC2,TPE10,TPE11,TPE12,TPE13,LFC1,LFC2,LFC3,LFC4,LFC5,LFC6,LFC7,LFC8,LFC9,LFC10,OCC1,OCC2,OCC3,OCC4,OCC5,OCC6,OCC7,OCC8,OCC9,OCC10,OCC11,OCC12,OCC13,EIC1,EIC2,EIC3,EIC4,EIC5,EIC6,EIC7,EIC8,EIC9,EIC10,EIC11,EIC12,EIC13,EIC14,EIC15,EIC16,OEDC1,OEDC2,OEDC3,OEDC4,OEDC5,OEDC6,OEDC7,EC1,EC2,EC3,EC4,EC5,EC6,EC7,EC8,SEC1,SEC2,SEC3,SEC4,SEC5,AFC1,AFC2,AFC3,AFC4,AFC5,AFC6,VC1,VC2,VC3,VC4,ANC1,ANC2,ANC3,ANC4,ANC5,ANC6,ANC7,ANC8,ANC9,ANC10,ANC11,ANC12,ANC13,ANC14,ANC15,POBC1,POBC2,LSC1,LSC2,LSC3,LSC4,VOC1,VOC2,VOC3,HC1,HC2,HC3,HC4,HC5,HC6,HC7,HC8,HC9,HC10,HC11,HC12,HC13,HC14,HC15,HC16,HC17,HC18,HC19,HC20,HC21,MHUC1,MHUC2,AC1,AC2,CARDPROM,NUMPROM,CARDPM12,NUMPRM12,RAMNTALL,NGIFTALL,CARDGIFT,MINRAMNT,MAXRAMNT,LASTGIFT,TIMELAG,AVGGIFT,CONTROLN,HPHONE_D,RFA_2F,CLUSTER2
52735,80.000000,24,0,39,18,57,2,5,0,1765,513,675,0,0,99,47,53,99,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,40,48,52,42,50,54,21,44,36,20,9,16,19,13,16,12,15,18,17,31,21,13,32,12,29,23,37,40,19,6,1,66,6,12,15,174,245,80,75,2,4,3,2,6,6,0,618,744,2,2,85,15,71,29,26,34,76,67,30,83,17,4,1,3,11,19,1,17,53,28,0,0,0,3,8,21,36,63,0,1,42,51,55,14,3,4,0,15,8,2,0,4,8,14,24,63,0.0,213.0,567.0,239,275,297,344,12833,26,25,18,18,7,4,1,0,1,16,25,22,20,9,5,1,0,2,40,5,40,9,37,63,5,79,16,0,0,0,0,0,2,2,1,33,19,23,6,57,49,62,38,60,36,52,46,67,99,3,10,14,4,10,8,0,1,7,2,14,15,7,7,2,11,4,35,7,1,3,13,1,2,4,3,4,5,4,2,2,5,0,4,86,2,0,120,22,13,26,14,7,13,4,2,17,2,14,3,0,0,0,19,39,2,18,21,57,5,1,11,1,8,0,0,10,0,0,1,0,0,3,0,0,1,77,97,2,0,1,93,68,22,8,19,3,13,29,55,72,28,2,0,0,2,28,48,0,23,38,34,9,99,96,6,2,6,9,13,30,6,12,78.0,6,3,5.0,25.0,10.0,1,13.000000,22081,1,3,60
13046,54.000000,0,1,25,33,32,7,9,2,4469,1272,1645,0,0,99,49,51,85,14,0,0,0,0,0,0,0,0,0,0,0,0,0,0,33,42,45,35,45,48,27,35,42,24,13,21,20,16,14,10,6,13,15,33,23,16,25,10,23,21,31,48,27,10,4,63,7,9,20,192,270,65,65,0,0,0,0,0,0,0,377,432,2,1,84,16,90,10,25,40,77,64,33,90,10,7,2,6,12,17,1,17,53,15,5,8,2,1,1,5,11,33,0,2,40,50,53,15,5,0,0,33,9,0,0,6,0,0,3,10,0.0,355.0,550.0,184,227,218,258,7980,43,24,16,10,6,1,0,0,0,31,28,19,13,7,1,0,0,0,36,13,19,22,24,76,1,71,23,1,1,0,0,0,2,3,17,22,30,35,16,70,54,63,46,55,43,61,48,57,99,9,9,6,3,6,11,0,0,8,9,20,17,4,7,8,0,13,28,2,2,2,14,4,3,6,0,5,9,3,2,5,6,1,9,77,1,1,120,26,23,33,9,5,3,2,0,27,0,23,4,0,0,0,12,24,1,27,26,35,11,0,7,2,3,0,0,6,0,0,0,0,0,2,0,0,0,82,96,2,0,2,88,64,23,13,21,3,11,26,52,68,32,0,0,2,31,26,26,0,15,1,98,1,98,92,8,2,9,8,21,58,6,15,165.0,13,5,5.0,21.0,6.0,9,12.692308,20775,1,4,60
47921,61.611649,0,0,0,0,0,0,0,0,1405,502,712,99,0,0,47,53,99,1,0,0,1,0,0,0,0,0,0,0,0,0,0,0,64,65,65,58,61,63,7,35,30,34,3,7,6,7,24,37,16,16,14,22,30,18,64,17,60,26,63,12,4,1,0,72,8,12,9,139,192,10,9,0,57,57,53,3,0,3,1523,1449,6,6,87,13,51,49,81,6,71,66,5,79,21,1,0,1,12,20,4,5,30,64,1,0,0,22,52,72,79,90,0,1,14,44,48,12,2,20,38,33,1,0,10,2,73,84,91,95,5345.0,133.0,571.0,275,300,321,361,18143,17,25,26,19,11,0,0,0,2,9,26,27,21,13,0,0,0,3,70,1,69,1,63,38,45,84,13,0,0,0,0,0,0,3,0,0,19,20,0,82,27,29,25,29,24,99,99,0,0,0,3,18,6,13,14,3,2,11,3,17,3,3,6,0,0,14,8,3,2,0,26,6,11,8,3,8,3,6,3,11,2,0,0,78,8,0,120,4,4,53,26,3,7,3,0,5,0,4,1,0,0,0,35,69,2,6,13,79,0,1,12,1,6,1,0,9,8,1,1,0,0,2,1,0,5,3,99,0,0,1,99,37,3,3,5,21,53,62,94,99,1,12,6,2,4,92,1,0,1,97,3,83,99,99,8,1,7,15,26,62,6,12,58.0,14,7,2.0,7.0,5.0,4,4.142857,43356,1,4,13
10200,61.611649,0,0,28,20,38,7,4,3,2094,614,836,0,0,99,50,50,99,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,38,49,52,40,50,53,25,36,39,25,9,17,16,16,16,15,12,15,13,32,24,16,37,16,35,25,39,37,22,10,4,68,6,9,17,166,250,87,86,0,2,2,0,0,0,0,150,211,1,0,77,23,80,20,3,31,73,67,28,87,13,3,2,2,12,18,2,20,54,26,0,0,0,0,0,1,2,7,0,2,49,55,56,13,4,2,0,11,20,2,0,1,1,1,4,10,0.0,227.0,717.0,178,223,229,271,9199,43,24,13,10,8,1,0,0,1,30,27,17,12,11,2,0,0,1,39,7,40,23,25,75,1,62,14,1,1,0,0,0,5,19,2,23,11,20,6,41,56,69,43,63,42,58,48,88,99,3,6,4,1,9,11,1,0,12,35,8,2,7,4,36,1,5,4,4,3,3,14,5,0,2,0,9,10,2,2,7,4,3,30,49,5,3,120,15,10,51,13,3,4,3,1,23,1,20,4,0,0,0,14,28,0,20,29,38,1,0,9,0,20,0,0,10,0,0,0,0,0,1,0,0,0,73,96,0,0,4,95,70,30,14,39,0,1,9,33,46,54,0,0,12,38,14,10,0,26,55,36,23,94,99,7,2,6,9,27,69,5,14,80.5,18,10,2.0,10.0,5.0,13,4.472222,186613,0,4,57
69225,61.611649,0,1,33,29,27,3,3,2,1758,519,736,99,0,0,47,53,96,2,1,1,1,0,1,0,0,0,0,0,1,0,0,0,29,36,39,32,40,44,25,47,35,19,17,29,23,12,9,6,5,22,17,31,16,14,15,8,14,25,35,40,18,5,1,62,13,6,18,171,239,36,33,1,63,62,60,0,0,0,958,1015,4,4,42,58,87,13,0,38,71,58,28,85,15,10,1,9,13,19,4,20,64,11,0,1,0,1,5,44,80,98,1,1,35,49,50,14,4,20,43,0,3,4,51,0,10,44,93,99,5360.0,181.0,659.0,288,328,335,371,13891,15,23,25,22,10,4,1,0,0,14,16,25,25,13,6,1,0,0,18,1,40,6,82,18,24,82,12,2,2,0,0,0,1,3,3,55,30,33,7,64,78,87,70,83,66,69,50,99,0,4,11,15,2,21,19,0,1,9,0,8,2,6,6,0,0,11,8,5,2,5,29,5,8,2,2,7,5,8,3,3,3,2,8,78,5,1,126,4,9,32,33,3,13,6,2,22,3,19,2,0,1,0,16,33,0,29,4,27,13,0,9,0,9,0,0,10,1,0,0,0,0,2,1,0,3,47,96,3,0,1,99,62,10,2,10,0,49,57,95,99,0,4,1,26,0,72,1,0,0,99,0,98,99,96,10,4,2,5,21,50,5,12,87.0,12,7,3.0,15.0,11.0,3,7.250000,50209,1,1,33
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
56942,83.000000,0,0,47,19,54,10,13,1,1395,375,716,99,0,0,40,60,96,3,0,1,1,0,0,0,1,0,0,0,0,1,0,0,40,47,51,42,49,53,18,43,37,20,11,18,17,13,15,15,12,18,19,25,22,16,35,23,34,46,30,24,12,3,1,51,16,14,20,115,195,43,41,0,56,56,48,0,0,0,677,705,3,4,36,64,96,4,0,21,52,40,16,73,27,6,1,5,12,42,2,15,53,28,0,2,0,0,0,9,35,86,0,6,32,48,48,11,4,27,29,0,7,9,47,0,9,32,68,98,7680.0,321.0,612.0,261,365,290,379,14298,26,21,24,13,14,2,0,0,0,12,19,17,22,27,3,0,0,0,47,0,51,7,56,44,15,77,18,3,3,0,0,0,0,2,2,7,17,18,0,64,57,69,50,67,50,78,73,35,0,11,23,11,3,16,25,0,2,6,1,5,3,4,3,1,1,2,8,4,3,7,16,9,4,4,0,14,12,9,8,10,13,1,6,59,9,1,140,3,9,24,25,4,21,14,4,21,1,15,8,0,0,0,17,47,1,19,16,54,8,0,7,5,7,0,0,10,3,0,1,1,0,1,0,0,1,59,98,1,0,1,88,38,13,4,24,0,0,0,5,45,55,0,2,38,0,62,0,0,0,99,0,99,99,99,6,2,6,6,24,62,6,13,124.0,11,3,5.0,20.0,15.0,8,11.272727,110124,0,2,13
58048,78.000000,0,0,17,33,31,5,3,1,2068,589,728,99,0,0,50,50,95,1,0,3,6,0,0,1,1,1,0,0,3,0,0,3,36,42,43,36,44,47,24,43,35,22,9,18,29,20,13,7,3,18,17,30,19,16,16,4,13,12,37,51,30,10,3,67,6,4,23,203,283,99,97,0,1,0,0,0,0,0,5380,5089,12,10,93,7,96,4,10,38,81,74,35,88,12,3,2,1,13,10,7,19,63,13,0,1,0,97,98,99,99,99,91,1,77,68,68,15,4,1,0,0,7,0,0,0,83,85,92,92,7360.0,65.0,807.0,700,744,770,809,29793,6,5,6,18,18,13,15,9,9,3,5,4,20,19,15,15,9,10,18,6,60,2,55,45,6,80,12,3,0,3,0,0,0,5,0,43,23,32,12,75,75,84,66,82,65,67,55,51,0,5,26,25,3,13,12,0,2,3,1,10,1,2,1,1,1,12,18,5,2,9,12,6,8,1,2,7,6,8,3,5,3,1,14,73,5,0,150,2,3,17,23,10,26,18,6,18,5,13,6,0,0,0,9,17,2,33,17,31,11,0,6,2,10,1,0,5,6,1,2,0,2,0,2,0,7,63,91,4,2,4,99,84,39,5,26,4,26,26,43,61,39,0,0,93,0,6,0,0,1,99,0,94,99,99,20,3,3,7,25,62,6,14,106.0,10,4,5.0,15.0,12.0,8,10.600000,162586,1,3,2
72833,55.000000,0,0,36,33,26,7,20,1,1027,283,379,28,0,72,46,54,88,11,0,0,2,0,0,0,0,0,0,0,0,1,0,1,38,43,45,40,47,50,19,43,37,21,9,21,23,15,12,9,12,17,17,33,20,13,16,4,13,20,41,39,19,4,1,57,11,16,17,173,243,59,59,1,1,0,0,10,10,0,1398,1512,3,3,87,13,95,5,0,32,75,63,27,77,23,5,1,4,14,18,5,15,55,19,1,6,3,23,42,69,88,99,3,1,50,56,57,14,4,1,0,40,7,1,0,6,14,28,44,79,6640.0,351.0,560.0,426,457,471,514,19545,11,16,8,26,30,2,2,6,0,2,13,9,32,33,3,0,7,0,13,2,52,5,51,49,14,73,23,0,0,0,0,1,1,1,1,64,20,23,2,81,77,88,69,88,66,89,62,99,0,0,28,12,7,7,14,0,3,6,0,18,2,1,1,1,0,13,10,4,0,0,14,5,5,0,3,12,23,5,5,7,20,1,6,53,12,0,140,5,12,23,14,9,15,20,7,18,1,16,8,0,0,0,17,36,0,33,13,26,6,0,8,0,5,0,0,1,0,0,2,0,1,1,0,0,2,61,92,0,0,8,95,76,25,5,16,0,27,38,74,97,3,0,0,0,29,56,10,0,5,5,94,4,99,96,10,2,6,5,8,19,6,13,55.0,2,2,25.0,30.0,30.0,5,27.500000,17726,1,2,17
75266,74.000000,0,1,37,25,28,13,4,3,1614,496,541,99,0,0,49,51,83,2,0,13,7,0,5,5,2,1,1,0,4,0,1,1,40,46,48,37,46,50,24,30,40,31,12,11,21,27,16,10,3,11,13,30,24,22,21,2,16,6,40,54,34,12,3,67,6,4,23,219,298,99,89,0,1,0,0,0,0,0,3556,3588,12,11,94,6,99,1,14,40,92,80,35,95,5,5,1,4,5,10,2,14,55,14,0,1,0,98,99,99,99,99,80,0,88,70,71,15,4,0,0,0,6,0,0,0,94,94,97,97,5945.0,13.0,803.0,735,701,838,805,34382,4,4,6,5,32,16,10,4,18,5,2,7,8,34,16,7,5,16,17,4,66,3,34,66,2,87,8,1,1,0,0,0,0,4,1,58,25,31,10,73,76,82,71,81,70,73,72,50,0,2,26,23,3,13,20,0,1,5,1,5,0,1,2,0,0,4,16,6,4,5,8,9,5,3,2,8,11,10,7,13,4,3,11,64,4,0,150,1,4,18,22,9,22,23,4,27,2,15,14,0,1,0,20,37,2,25,27,28,5,0,8,0,6,1,0,2,5,1,1,0,1,1,1,0,9,55,89,2,7,2,99,92,42,12,22,0,0,0,44,99,0,12,0,92,0,8,0,0,0,99,0,99,99,99,15,3,8,7,42,106,11,25,294.0,40,18,3.0,12.0,8.0,3,7.350000,2020,0,3,2


In [17]:
X_test_num

Unnamed: 0,AGE,HIT,MALEMILI,MALEVET,VIETVETS,WWIIVETS,LOCALGOV,STATEGOV,FEDGOV,POP901,POP902,POP903,POP90C1,POP90C2,POP90C3,POP90C4,POP90C5,ETH1,ETH2,ETH3,ETH4,ETH5,ETH6,ETH7,ETH8,ETH9,ETH10,ETH11,ETH12,ETH13,ETH14,ETH15,ETH16,AGE901,AGE902,AGE903,AGE904,AGE905,AGE906,AGE907,CHIL1,CHIL2,CHIL3,AGEC1,AGEC2,AGEC3,AGEC4,AGEC5,AGEC6,AGEC7,CHILC1,CHILC2,CHILC3,CHILC4,CHILC5,HHAGE1,HHAGE2,HHAGE3,HHN1,HHN2,HHN3,HHN4,HHN5,HHN6,MARR1,MARR2,MARR3,MARR4,HHP1,HHP2,DW1,DW2,DW3,DW4,DW5,DW6,DW7,DW8,DW9,HV1,HV2,HV3,HV4,HU1,HU2,HU3,HU4,HU5,HHD1,HHD2,HHD3,HHD4,HHD5,HHD6,HHD7,HHD8,HHD9,HHD10,HHD11,HHD12,ETHC1,ETHC2,ETHC3,ETHC4,ETHC5,ETHC6,HVP1,HVP2,HVP3,HVP4,HVP5,HVP6,HUR1,HUR2,RHP1,RHP2,RHP3,RHP4,HUPA1,HUPA2,HUPA3,HUPA4,HUPA5,HUPA6,HUPA7,RP1,RP2,RP3,RP4,MSA,ADI,DMA,IC1,IC2,IC3,IC4,IC5,IC6,IC7,IC8,IC9,IC10,IC11,IC12,IC13,IC14,IC15,IC16,IC17,IC18,IC19,IC20,IC21,IC22,IC23,HHAS1,HHAS2,HHAS3,HHAS4,MC1,MC2,MC3,TPE1,TPE2,TPE3,TPE4,TPE5,TPE6,TPE7,TPE8,TPE9,PEC1,PEC2,TPE10,TPE11,TPE12,TPE13,LFC1,LFC2,LFC3,LFC4,LFC5,LFC6,LFC7,LFC8,LFC9,LFC10,OCC1,OCC2,OCC3,OCC4,OCC5,OCC6,OCC7,OCC8,OCC9,OCC10,OCC11,OCC12,OCC13,EIC1,EIC2,EIC3,EIC4,EIC5,EIC6,EIC7,EIC8,EIC9,EIC10,EIC11,EIC12,EIC13,EIC14,EIC15,EIC16,OEDC1,OEDC2,OEDC3,OEDC4,OEDC5,OEDC6,OEDC7,EC1,EC2,EC3,EC4,EC5,EC6,EC7,EC8,SEC1,SEC2,SEC3,SEC4,SEC5,AFC1,AFC2,AFC3,AFC4,AFC5,AFC6,VC1,VC2,VC3,VC4,ANC1,ANC2,ANC3,ANC4,ANC5,ANC6,ANC7,ANC8,ANC9,ANC10,ANC11,ANC12,ANC13,ANC14,ANC15,POBC1,POBC2,LSC1,LSC2,LSC3,LSC4,VOC1,VOC2,VOC3,HC1,HC2,HC3,HC4,HC5,HC6,HC7,HC8,HC9,HC10,HC11,HC12,HC13,HC14,HC15,HC16,HC17,HC18,HC19,HC20,HC21,MHUC1,MHUC2,AC1,AC2,CARDPROM,NUMPROM,CARDPM12,NUMPRM12,RAMNTALL,NGIFTALL,CARDGIFT,MINRAMNT,MAXRAMNT,LASTGIFT,TIMELAG,AVGGIFT,CONTROLN,HPHONE_D,RFA_2F,CLUSTER2
34187,48.000000,16,0,30,45,29,4,18,11,1113,284,395,0,81,19,47,53,83,1,0,0,65,0,0,0,0,0,0,0,62,0,0,3,28,39,45,33,43,48,31,41,39,20,17,24,15,15,11,11,7,16,18,33,16,17,26,10,24,24,28,48,34,15,6,59,9,8,25,192,281,59,57,0,5,5,5,0,0,0,345,377,2,1,52,48,74,26,4,42,72,59,34,87,13,8,2,6,15,17,4,21,47,15,0,1,0,1,1,1,1,13,0,9,24,45,45,15,5,1,4,34,34,0,7,6,0,2,17,38,0.0,433.0,600.0,239,256,264,305,14697,34,23,16,19,7,0,0,0,1,26,23,15,26,9,0,0,0,2,12,5,19,22,40,60,2,65,30,0,0,0,0,2,0,2,0,27,13,19,8,35,61,81,37,81,37,34,20,0,0,2,21,0,2,7,10,0,6,16,5,11,6,4,11,7,7,11,9,1,4,2,12,2,4,4,1,4,16,2,15,4,18,11,0,62,4,0,120,19,23,15,24,1,12,6,1,32,2,24,7,0,0,0,18,30,2,45,5,29,6,0,2,0,3,0,0,3,0,0,0,0,0,3,0,0,3,81,41,58,1,0,91,54,17,5,32,0,18,28,40,50,50,0,0,85,3,12,0,0,0,87,9,91,99,88,5,2,5,2,15,34,6,12,55.0,6,3,3.0,20.0,7.0,6,9.166667,125995,0,2,53
58769,32.000000,0,0,35,74,7,7,0,7,942,258,285,99,0,0,51,49,88,10,0,2,1,0,0,0,0,0,0,0,0,0,0,1,29,37,38,28,39,41,35,40,42,18,11,29,36,13,6,2,2,13,19,35,20,13,9,1,3,7,23,70,47,16,4,69,9,2,19,286,330,99,99,0,0,0,0,0,0,0,849,828,6,6,89,11,98,2,0,60,91,79,53,96,4,7,1,6,8,5,2,26,58,4,4,6,0,0,0,4,80,99,0,0,74,62,64,19,5,0,0,0,11,0,0,0,91,97,97,97,4920.0,179.0,640.0,433,436,486,497,14861,0,3,23,41,27,0,6,0,0,0,0,26,39,28,0,7,0,0,6,0,36,0,68,32,30,86,11,0,0,0,0,0,0,3,8,0,23,25,0,82,83,98,67,98,67,65,42,99,0,0,11,17,0,12,22,0,4,5,0,19,1,4,5,0,0,15,8,16,0,6,20,3,6,2,2,3,3,8,9,7,0,7,13,68,5,0,140,2,6,21,35,15,14,7,5,29,3,23,8,0,0,0,18,35,0,74,13,7,0,2,6,0,11,0,0,14,1,0,0,0,0,0,1,0,0,48,95,5,0,0,99,81,23,4,13,0,9,35,99,99,0,0,0,13,6,81,0,0,0,99,0,99,99,99,8,3,5,4,10,23,6,12,27.0,4,2,3.0,10.0,8.0,2,6.750000,53579,1,3,20
78007,61.611649,2,0,32,26,27,4,5,1,1785,506,794,96,0,4,47,53,94,4,0,1,1,0,0,0,0,0,0,0,0,0,0,1,36,41,43,36,43,46,21,38,41,21,10,21,27,19,13,5,5,13,16,34,22,15,13,5,11,32,36,32,18,5,1,61,13,5,21,151,225,74,51,0,24,24,20,0,0,0,1071,1279,4,4,75,25,92,8,1,28,64,55,23,81,19,5,1,4,14,26,5,17,65,12,1,3,0,18,32,54,76,91,1,2,50,62,58,13,3,14,11,1,9,2,14,0,18,63,83,98,3120.0,281.0,518.0,357,494,490,575,23527,12,23,14,14,15,8,7,1,5,10,15,12,14,22,10,11,0,6,13,1,62,5,55,45,17,93,4,1,1,0,0,0,1,1,1,12,17,20,3,61,76,85,70,84,67,61,41,99,0,0,24,24,4,13,16,0,2,4,0,5,7,2,1,0,1,5,23,8,4,3,11,7,5,2,2,11,10,8,2,4,5,1,9,72,9,0,141,3,3,24,19,10,33,9,8,19,3,16,8,0,0,0,14,32,0,26,17,27,9,0,11,0,9,2,0,6,1,0,0,0,0,0,1,0,5,55,92,1,1,5,99,57,20,4,9,4,26,54,93,95,5,21,8,32,1,61,5,0,1,99,1,84,98,99,10,2,7,6,13,38,4,10,31.0,3,1,5.0,15.0,15.0,8,10.333333,15369,1,1,32
23738,61.611649,0,0,41,38,29,10,5,3,2526,691,1006,0,0,99,49,51,95,0,2,1,4,0,0,0,0,0,0,0,3,0,0,1,37,43,44,37,45,47,23,39,43,18,8,20,25,20,13,8,5,16,17,35,21,11,20,7,17,25,38,37,21,8,2,61,16,6,17,166,244,64,63,1,6,5,2,3,0,3,995,1095,3,3,73,27,93,7,13,31,69,58,25,80,20,6,2,4,20,16,6,19,60,15,0,0,0,5,12,49,75,95,1,6,41,51,53,13,4,6,0,27,12,4,1,6,16,28,58,75,0.0,459.0,811.0,325,369,379,420,15623,23,14,17,17,20,6,1,1,1,15,13,19,20,23,8,0,1,1,25,3,28,10,63,37,30,71,10,1,1,0,0,1,14,4,4,67,23,27,7,60,74,79,69,75,64,67,57,75,99,6,15,12,3,15,12,0,3,15,1,11,5,5,3,1,2,9,9,4,2,3,20,5,3,6,5,8,7,9,7,10,5,3,9,67,4,1,120,5,11,36,25,7,12,6,1,21,1,15,6,0,0,0,21,41,1,38,17,29,6,0,7,2,10,0,0,5,3,0,0,1,0,2,2,0,3,20,95,2,1,2,96,72,28,4,15,10,33,47,78,82,18,1,0,14,41,9,11,0,25,58,40,54,99,94,7,3,7,6,4,12,4,10,20.0,1,0,20.0,20.0,20.0,9,20.000000,142011,1,1,42
93323,35.000000,0,0,46,22,47,14,2,0,683,201,265,0,99,0,48,52,98,1,0,0,3,0,0,0,0,0,0,0,3,0,0,0,33,45,49,36,46,49,28,39,36,25,9,23,16,15,17,15,4,16,16,32,21,15,26,11,25,23,35,42,26,10,3,66,10,8,16,176,258,80,80,2,2,0,0,0,0,0,443,527,3,2,80,20,96,4,0,38,76,64,29,89,11,9,2,7,9,18,1,22,55,21,1,1,0,1,1,3,13,42,1,4,37,49,51,14,5,2,0,16,17,2,0,2,2,8,29,67,840.0,247.0,692.0,255,301,268,321,10258,38,11,26,10,14,3,0,0,0,25,14,27,14,18,3,0,0,0,27,0,25,23,33,67,3,89,11,0,0,0,0,0,0,1,0,19,11,15,0,28,63,77,51,70,48,73,35,99,99,0,11,5,8,13,14,0,4,9,0,14,2,7,11,0,0,10,21,0,3,0,18,6,9,3,0,6,5,11,8,14,2,0,11,63,9,0,120,3,23,38,19,13,3,2,0,28,1,25,2,0,0,0,21,46,0,22,6,47,18,0,1,3,5,0,0,9,1,0,0,0,0,1,0,0,1,66,93,4,0,3,97,53,15,8,28,0,13,30,32,56,44,0,0,76,0,24,0,0,0,99,0,99,99,96,5,2,3,15,25,65,4,9,61.0,19,11,3.0,5.0,3.0,5,3.210526,124049,1,3,53
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
27446,64.000000,0,0,0,0,0,0,0,0,1065,304,391,0,87,13,48,52,90,5,1,2,9,1,0,0,0,0,0,0,8,0,0,2,29,38,42,32,42,46,31,44,36,20,16,25,21,14,8,8,8,17,19,31,20,13,21,10,20,20,29,51,28,10,2,66,8,6,19,205,272,69,65,16,30,14,10,0,0,0,639,741,3,3,53,47,92,8,9,47,78,64,36,90,10,11,1,10,7,18,3,22,55,12,2,2,1,0,4,15,27,92,0,4,42,51,52,15,5,28,2,0,18,19,9,0,10,24,56,65,640.0,203.0,635.0,262,322,312,359,10726,27,20,17,18,14,2,3,0,0,22,14,19,19,20,3,4,0,0,19,7,48,23,60,40,16,81,16,0,0,0,0,0,2,0,0,40,18,22,0,54,66,79,55,75,52,58,52,51,99,9,12,18,5,8,25,0,3,9,1,13,1,1,4,1,0,7,19,0,4,3,15,6,10,4,0,8,7,3,12,14,10,4,5,64,0,3,125,6,11,30,27,4,21,1,4,29,3,21,8,0,0,0,8,18,0,38,0,5,17,0,4,0,8,0,0,7,0,1,1,0,0,0,1,4,16,68,78,5,0,17,91,51,17,4,10,0,15,52,99,99,0,0,3,63,0,36,0,0,2,99,0,87,99,94,8,2,3,5,10,24,6,12,32.0,3,3,5.0,15.0,12.0,4,10.666667,127066,0,2,52
78461,61.611649,2,0,32,28,37,7,4,3,5960,1574,2427,0,79,21,45,55,99,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,38,47,52,41,50,54,24,37,39,24,11,16,18,12,13,15,15,14,16,31,22,17,38,22,36,33,31,36,21,8,2,55,11,16,17,154,234,75,75,3,15,12,9,5,5,0,367,428,2,2,71,29,90,10,5,31,65,53,24,79,21,7,2,5,11,29,2,19,52,28,0,0,0,0,1,4,11,30,0,3,40,51,52,13,4,7,8,8,11,5,10,2,0,1,5,34,0.0,87.0,581.0,216,274,260,323,10931,36,23,18,13,9,1,1,0,0,20,24,24,17,12,1,2,0,0,42,5,42,13,46,54,5,80,11,0,0,0,0,0,5,4,2,28,12,21,5,41,55,67,46,64,44,74,55,85,99,3,13,9,4,9,10,0,1,16,3,16,7,4,7,3,2,6,14,2,7,0,23,5,3,4,0,12,9,5,5,8,4,3,11,65,8,1,120,11,17,39,15,7,7,5,2,23,2,17,5,0,0,0,16,35,1,25,12,39,4,1,8,0,16,0,0,6,0,0,0,0,0,1,1,0,0,85,98,1,0,1,89,51,16,7,36,1,2,4,24,40,60,0,0,68,3,23,4,0,3,98,2,89,99,90,9,2,5,6,32,75,6,15,153.0,19,14,3.0,15.0,15.0,5,8.052632,64316,1,4,59
1443,41.000000,11,1,26,26,50,7,3,1,1563,486,564,99,0,0,49,51,99,0,0,1,0,1,0,0,0,0,0,0,0,0,0,0,43,49,53,42,48,52,16,37,36,27,13,16,14,16,21,14,6,16,13,31,16,24,32,6,27,12,40,48,29,10,2,63,5,8,24,195,277,99,98,0,0,0,0,0,0,0,814,795,6,5,97,3,99,0,0,24,86,75,23,94,6,1,0,1,7,16,1,12,62,25,0,0,0,0,0,3,66,98,0,0,66,54,59,15,5,0,0,0,3,0,0,0,50,69,81,81,2160.0,57.0,505.0,430,468,471,505,16948,8,12,12,28,29,8,4,0,0,3,10,13,28,33,9,4,0,0,31,3,52,2,21,79,1,84,14,1,0,0,1,0,0,2,0,50,22,23,2,70,68,74,62,68,57,67,56,99,0,0,12,12,5,12,23,0,4,8,0,12,7,2,4,0,0,0,29,2,7,6,15,6,9,3,1,8,9,3,3,7,3,1,1,78,9,1,120,11,15,37,23,6,5,3,4,20,0,12,12,1,1,0,14,26,2,26,23,50,0,0,3,1,9,0,0,0,14,0,18,0,0,1,1,0,8,79,87,0,0,12,99,74,27,17,26,0,1,1,3,92,8,0,0,99,0,0,0,0,0,99,0,99,99,99,8,3,10,10,16,37,6,13,78.0,6,4,5.0,25.0,17.0,4,13.000000,65777,1,2,9
2493,69.000000,0,0,27,18,49,6,3,8,831,207,341,0,99,0,47,53,98,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,44,57,59,46,55,57,21,28,42,30,6,14,17,10,15,17,22,10,12,34,26,18,38,15,35,33,42,26,16,6,3,53,12,19,17,142,218,79,76,5,11,6,0,10,10,0,484,500,2,2,64,36,91,9,3,26,61,50,18,70,30,7,1,6,16,27,7,16,45,37,0,0,0,0,1,1,10,46,0,5,34,49,50,12,4,11,0,8,25,7,0,3,0,3,18,60,0.0,469.0,764.0,219,281,270,323,11716,26,28,22,7,14,2,0,0,0,13,32,24,10,19,3,0,0,0,48,6,44,5,47,53,6,72,23,0,0,0,0,0,3,2,3,19,8,13,2,18,45,54,37,54,37,77,43,88,0,3,18,13,7,4,6,0,3,14,7,2,7,5,12,2,2,8,10,7,3,14,8,5,5,6,0,15,7,6,3,6,3,8,5,62,16,0,120,24,17,24,13,10,9,2,0,17,0,15,3,0,0,0,14,27,2,18,33,49,5,0,4,0,17,0,0,4,2,1,0,0,0,1,3,0,1,70,97,1,0,2,94,60,19,8,35,0,0,4,8,36,64,0,0,92,5,2,0,0,2,99,0,99,99,88,5,2,6,6,28,61,5,11,117.0,20,10,2.0,10.0,10.0,27,5.850000,86373,1,1,59


In [18]:
X_train_cat

Unnamed: 0,STATE,CLUSTER,HOMEOWNR,GENDER,DATASRCE,RFA_2R,RFA_2A,GEOCODE2,DOMAIN_A,DOMAIN_B,ODATEW_YR,ODATEW_MM,DOB_YR,DOB_MM,MINRDATE_YR,MINRDATE_MM,MAXRDATE_YR,MAXRDATE_MM,LASTDATE_YR,LASTDATE_MM,FIRSTDATE_YR,FIRSTDATE_MM,TCODE,INCOME,WEALTH1,WEALTH2
52735,NC,46,H,F,3,L,F,D,R,2,95,1,18,1,94,10,94,9,95,9,94,9,28,5,2,9
13046,NC,47,U,F,1,L,F,D,R,2,92,1,44,4,92,2,95,11,96,3,92,2,2,2,3,3
47921,FL,29,H,M,3,L,D,C,C,2,86,1,0,2,90,7,95,9,96,1,89,9,0,5,9,9
10200,MO,53,H,F,2,L,D,D,R,3,86,1,0,2,92,11,91,4,96,10,86,11,0,2,9,9
69225,other,25,H,F,1,L,E,B,C,2,92,1,0,2,92,3,94,12,95,6,92,3,0,3,6,5
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
56942,other,26,U,F,2,L,F,B,C,2,91,1,15,5,91,1,94,12,95,12,91,1,2,3,9,7
58048,CA,14,H,F,2,L,E,A,S,1,90,1,20,1,90,4,93,10,95,12,90,4,0,1,9,9
72833,NC,34,H,M,3,L,G,B,T,1,96,1,43,1,95,9,96,2,96,2,95,9,1,7,8,9
75266,CA,14,U,F,2,L,E,A,S,1,86,1,24,1,86,9,96,1,96,7,86,9,2,6,9,9


In [19]:
X_test_cat

Unnamed: 0,STATE,CLUSTER,HOMEOWNR,GENDER,DATASRCE,RFA_2R,RFA_2A,GEOCODE2,DOMAIN_A,DOMAIN_B,ODATEW_YR,ODATEW_MM,DOB_YR,DOB_MM,MINRDATE_YR,MINRDATE_MM,MAXRDATE_YR,MAXRDATE_MM,LASTDATE_YR,LASTDATE_MM,FIRSTDATE_YR,FIRSTDATE_MM,TCODE,INCOME,WEALTH1,WEALTH2
34187,TX,36,H,F,3,L,F,D,T,2,94,1,50,1,94,1,95,9,95,12,94,1,2,6,3,9
58769,other,18,H,F,3,L,E,B,S,2,95,1,66,1,95,1,95,6,95,12,95,1,0,4,8,9
78007,NC,22,H,F,3,L,F,B,C,1,91,1,0,2,91,11,95,6,95,6,91,11,0,6,9,9
23738,other,46,H,F,2,L,F,D,R,2,96,1,0,2,95,12,95,12,95,12,95,12,0,5,9,9
93323,TX,36,H,M,3,L,D,B,T,2,88,1,63,1,95,11,94,2,95,11,88,1,0,5,5,4
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
27446,TX,21,U,M,2,L,F,B,S,3,95,1,34,1,95,2,95,6,95,10,95,2,0,7,9,9
78461,IN,45,U,F,3,L,F,D,R,2,86,1,0,2,87,5,95,4,96,3,86,10,2,2,2,3
1443,MI,5,H,M,3,L,G,A,U,2,94,1,56,8,93,9,95,6,95,9,93,9,1,5,7,7
2493,other,48,U,M,2,L,E,D,R,2,86,1,28,8,87,12,93,2,96,2,87,12,0,5,9,4


In [20]:
transformer = MinMaxScaler().fit(X_train_num)

X_train_num = pd.DataFrame(transformer.transform(X_train_num), columns=X_train_num.columns)
X_test_num = pd.DataFrame(transformer.transform(X_test_num), columns=X_test_num.columns)

In [21]:
X_test_num

Unnamed: 0,AGE,HIT,MALEMILI,MALEVET,VIETVETS,WWIIVETS,LOCALGOV,STATEGOV,FEDGOV,POP901,POP902,POP903,POP90C1,POP90C2,POP90C3,POP90C4,POP90C5,ETH1,ETH2,ETH3,ETH4,ETH5,ETH6,ETH7,ETH8,ETH9,ETH10,ETH11,ETH12,ETH13,ETH14,ETH15,ETH16,AGE901,AGE902,AGE903,AGE904,AGE905,AGE906,AGE907,CHIL1,CHIL2,CHIL3,AGEC1,AGEC2,AGEC3,AGEC4,AGEC5,AGEC6,AGEC7,CHILC1,CHILC2,CHILC3,CHILC4,CHILC5,HHAGE1,HHAGE2,HHAGE3,HHN1,HHN2,HHN3,HHN4,HHN5,HHN6,MARR1,MARR2,MARR3,MARR4,HHP1,HHP2,DW1,DW2,DW3,DW4,DW5,DW6,DW7,DW8,DW9,HV1,HV2,HV3,HV4,HU1,HU2,HU3,HU4,HU5,HHD1,HHD2,HHD3,HHD4,HHD5,HHD6,HHD7,HHD8,HHD9,HHD10,HHD11,HHD12,ETHC1,ETHC2,ETHC3,ETHC4,ETHC5,ETHC6,HVP1,HVP2,HVP3,HVP4,HVP5,HVP6,HUR1,HUR2,RHP1,RHP2,RHP3,RHP4,HUPA1,HUPA2,HUPA3,HUPA4,HUPA5,HUPA6,HUPA7,RP1,RP2,RP3,RP4,MSA,ADI,DMA,IC1,IC2,IC3,IC4,IC5,IC6,IC7,IC8,IC9,IC10,IC11,IC12,IC13,IC14,IC15,IC16,IC17,IC18,IC19,IC20,IC21,IC22,IC23,HHAS1,HHAS2,HHAS3,HHAS4,MC1,MC2,MC3,TPE1,TPE2,TPE3,TPE4,TPE5,TPE6,TPE7,TPE8,TPE9,PEC1,PEC2,TPE10,TPE11,TPE12,TPE13,LFC1,LFC2,LFC3,LFC4,LFC5,LFC6,LFC7,LFC8,LFC9,LFC10,OCC1,OCC2,OCC3,OCC4,OCC5,OCC6,OCC7,OCC8,OCC9,OCC10,OCC11,OCC12,OCC13,EIC1,EIC2,EIC3,EIC4,EIC5,EIC6,EIC7,EIC8,EIC9,EIC10,EIC11,EIC12,EIC13,EIC14,EIC15,EIC16,OEDC1,OEDC2,OEDC3,OEDC4,OEDC5,OEDC6,OEDC7,EC1,EC2,EC3,EC4,EC5,EC6,EC7,EC8,SEC1,SEC2,SEC3,SEC4,SEC5,AFC1,AFC2,AFC3,AFC4,AFC5,AFC6,VC1,VC2,VC3,VC4,ANC1,ANC2,ANC3,ANC4,ANC5,ANC6,ANC7,ANC8,ANC9,ANC10,ANC11,ANC12,ANC13,ANC14,ANC15,POBC1,POBC2,LSC1,LSC2,LSC3,LSC4,VOC1,VOC2,VOC3,HC1,HC2,HC3,HC4,HC5,HC6,HC7,HC8,HC9,HC10,HC11,HC12,HC13,HC14,HC15,HC16,HC17,HC18,HC19,HC20,HC21,MHUC1,MHUC2,AC1,AC2,CARDPROM,NUMPROM,CARDPM12,NUMPRM12,RAMNTALL,NGIFTALL,CARDGIFT,MINRAMNT,MAXRAMNT,LASTGIFT,TIMELAG,AVGGIFT,CONTROLN,HPHONE_D,RFA_2F,CLUSTER2
0,0.484536,0.066390,0.000000,0.303030,0.454545,0.292929,0.040404,0.181818,0.126437,0.011276,0.011950,0.011157,0.000000,0.818182,0.191919,0.474747,0.535354,0.838384,0.010101,0.000000,0.000000,0.656566,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.639175,0.0,0.0,0.034884,0.333333,0.464286,0.535714,0.392857,0.511905,0.571429,0.413333,0.414141,0.393939,0.202020,0.171717,0.242424,0.151515,0.30,0.111111,0.111111,0.070707,0.161616,0.181818,0.333333,0.161616,0.171717,0.262626,0.101010,0.242424,0.242424,0.282828,0.484848,0.343434,0.151515,0.060606,0.595960,0.090909,0.109589,0.252525,0.295385,0.401429,0.595960,0.575758,0.000000,0.050505,0.050505,0.050505,0.000000,0.000000,0.000000,0.057500,0.062833,0.153846,0.076923,0.525253,0.484848,0.747475,0.262626,0.040404,0.424242,0.727273,0.595960,0.343434,0.878788,0.131313,0.080808,0.04,0.060606,0.151515,0.171717,0.040404,0.280000,0.474747,0.151515,0.000000,0.010101,0.000000,0.010101,0.010101,0.010101,0.010101,0.131313,0.000000,0.090909,0.242424,0.529412,0.500000,0.245902,0.125,0.010101,0.040404,0.343434,0.343434,0.000000,0.070707,0.060606,0.000000,0.020202,0.171717,0.383838,0.000000,0.671318,0.681044,0.159333,0.170667,0.176000,0.203333,0.092256,0.343434,0.232323,0.161616,0.191919,0.070707,0.000000,0.00,0.000000,0.010101,0.262626,0.232323,0.151515,0.262626,0.090909,0.000000,0.00,0.000000,0.020202,0.121212,0.050505,0.191919,0.222222,0.404040,0.606061,0.020202,0.656566,0.303030,0.000000,0.000000,0.0,0.000000,0.08,0.000000,0.020202,0.000000,0.272727,0.144444,0.250000,0.080808,0.353535,0.616162,0.818182,0.373737,0.818182,0.373737,0.343434,0.202020,0.000000,0.0,0.020202,0.212121,0.000000,0.020202,0.070707,0.101010,0.0,0.109091,0.161616,0.050505,0.111111,0.060606,0.040404,0.111111,0.070707,0.107692,0.111111,0.090909,0.010101,0.08,0.020202,0.121212,0.020202,0.040404,0.040404,0.014925,0.040404,0.161616,0.020202,0.151515,0.040404,0.181818,0.111111,0.000000,0.626263,0.040404,0.000000,0.705882,0.191919,0.232323,0.151515,0.242424,0.027027,0.121212,0.060606,0.010309,0.323232,0.066667,0.333333,0.070707,0.000000,0.000000,0.0,0.181818,0.303030,0.066667,0.454545,0.050505,0.292929,0.060606,0.000000,0.020202,0.000000,0.036145,0.000000,0.0,0.030303,0.000000,0.000000,0.000000,0.000000,0.0,0.06,0.000000,0.000,0.030303,0.818182,0.414141,0.585859,0.010101,0.000000,0.919192,0.545455,0.171717,0.161290,0.615385,0.000000,0.181818,0.282828,0.404040,0.505051,0.505051,0.000000,0.000000,0.858586,0.030303,0.121212,0.000000,0.000000,0.000000,0.878788,0.090909,0.919192,1.000000,0.888889,0.238095,0.4,0.050505,0.020202,0.245614,0.157068,0.315789,0.142857,0.004434,0.021186,0.073171,0.003,0.003003,0.007,0.005515,0.007891,0.656978,0.0,0.333333,0.852459
1,0.319588,0.000000,0.000000,0.353535,0.747475,0.070707,0.070707,0.000000,0.080460,0.009544,0.010856,0.008050,1.000000,0.000000,0.000000,0.515152,0.494949,0.888889,0.101010,0.000000,0.020202,0.010101,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.000000,0.0,0.0,0.011628,0.345238,0.440476,0.452381,0.333333,0.464286,0.488095,0.466667,0.404040,0.424242,0.181818,0.111111,0.292929,0.363636,0.26,0.060606,0.020202,0.020202,0.131313,0.191919,0.353535,0.202020,0.131313,0.090909,0.010101,0.030303,0.070707,0.232323,0.707071,0.474747,0.161616,0.040404,0.696970,0.090909,0.027397,0.191919,0.440000,0.471429,1.000000,1.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.141500,0.138000,0.461538,0.461538,0.898990,0.111111,0.989899,0.020202,0.000000,0.606061,0.919192,0.797980,0.535354,0.969697,0.040404,0.070707,0.02,0.060606,0.080808,0.050505,0.020202,0.346667,0.585859,0.040404,0.072727,0.060606,0.000000,0.000000,0.000000,0.040404,0.808081,1.000000,0.000000,0.000000,0.747475,0.729412,0.711111,0.311475,0.125,0.000000,0.000000,0.000000,0.111111,0.000000,0.000000,0.000000,0.919192,0.979798,0.979798,0.979798,0.525641,0.277519,0.726447,0.288667,0.290667,0.324000,0.331333,0.093286,0.000000,0.030303,0.232323,0.414141,0.272727,0.000000,0.12,0.000000,0.000000,0.000000,0.000000,0.262626,0.393939,0.282828,0.000000,0.14,0.000000,0.000000,0.060606,0.000000,0.363636,0.000000,0.686869,0.323232,0.303030,0.868687,0.111111,0.000000,0.000000,0.0,0.000000,0.00,0.000000,0.030303,0.080808,0.000000,0.255556,0.328947,0.000000,0.828283,0.838384,0.989899,0.676768,0.989899,0.676768,0.656566,0.424242,1.000000,0.0,0.000000,0.111111,0.171717,0.000000,0.121212,0.222222,0.0,0.072727,0.050505,0.000000,0.191919,0.010101,0.040404,0.050505,0.000000,0.000000,0.151515,0.080808,0.161616,0.00,0.060606,0.202020,0.030303,0.060606,0.020202,0.029851,0.030303,0.030303,0.080808,0.090909,0.070707,0.000000,0.070707,0.131313,0.686869,0.050505,0.000000,0.823529,0.020202,0.060606,0.212121,0.353535,0.405405,0.141414,0.070707,0.051546,0.292929,0.100000,0.319444,0.080808,0.000000,0.000000,0.0,0.181818,0.353535,0.000000,0.747475,0.131313,0.070707,0.000000,0.024096,0.060606,0.000000,0.132530,0.000000,0.0,0.141414,0.018182,0.000000,0.000000,0.000000,0.0,0.00,0.037037,0.000,0.000000,0.484848,0.959596,0.050505,0.000000,0.000000,1.000000,0.818182,0.232323,0.129032,0.250000,0.000000,0.090909,0.353535,1.000000,1.000000,0.000000,0.000000,0.000000,0.131313,0.060606,0.818182,0.000000,0.000000,0.000000,1.000000,0.000000,1.000000,1.000000,1.000000,0.380952,0.6,0.050505,0.040404,0.157895,0.099476,0.315789,0.142857,0.001478,0.012712,0.048780,0.003,0.001001,0.008,0.001838,0.005471,0.279375,1.0,0.666667,0.311475
2,0.624862,0.008299,0.000000,0.323232,0.262626,0.272727,0.040404,0.050505,0.011494,0.018085,0.021291,0.022427,0.969697,0.000000,0.040404,0.474747,0.535354,0.949495,0.040404,0.000000,0.010101,0.010101,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.000000,0.0,0.0,0.011628,0.428571,0.488095,0.511905,0.428571,0.511905,0.547619,0.280000,0.383838,0.414141,0.212121,0.101010,0.212121,0.272727,0.38,0.131313,0.050505,0.050505,0.131313,0.161616,0.343434,0.222222,0.151515,0.131313,0.050505,0.111111,0.323232,0.363636,0.323232,0.181818,0.050505,0.010101,0.616162,0.131313,0.068493,0.212121,0.232308,0.321429,0.747475,0.515152,0.000000,0.242424,0.242424,0.202020,0.000000,0.000000,0.000000,0.178500,0.213167,0.307692,0.307692,0.757576,0.252525,0.929293,0.080808,0.010101,0.282828,0.646465,0.555556,0.232323,0.818182,0.191919,0.050505,0.02,0.040404,0.141414,0.262626,0.050505,0.226667,0.656566,0.121212,0.018182,0.030303,0.000000,0.181818,0.323232,0.545455,0.767677,0.919192,0.010101,0.020202,0.505051,0.729412,0.644444,0.213115,0.075,0.141414,0.111111,0.010101,0.090909,0.020202,0.141414,0.000000,0.181818,0.636364,0.838384,0.989899,0.333333,0.435659,0.587968,0.238000,0.329333,0.326667,0.383333,0.147684,0.121212,0.232323,0.141414,0.141414,0.151515,0.080808,0.14,0.016393,0.050505,0.101010,0.151515,0.121212,0.141414,0.222222,0.101010,0.22,0.000000,0.060606,0.131313,0.010101,0.626263,0.050505,0.555556,0.454545,0.171717,0.939394,0.040404,0.010101,0.010101,0.0,0.000000,0.00,0.010101,0.010101,0.010101,0.121212,0.188889,0.263158,0.030303,0.616162,0.767677,0.858586,0.707071,0.848485,0.676768,0.616162,0.414141,1.000000,0.0,0.000000,0.242424,0.242424,0.040404,0.131313,0.161616,0.0,0.036364,0.040404,0.000000,0.050505,0.070707,0.020202,0.010101,0.000000,0.015385,0.050505,0.232323,0.080808,0.08,0.030303,0.111111,0.070707,0.050505,0.020202,0.029851,0.111111,0.101010,0.080808,0.020202,0.040404,0.050505,0.010101,0.090909,0.727273,0.090909,0.000000,0.829412,0.030303,0.030303,0.242424,0.191919,0.270270,0.333333,0.090909,0.082474,0.191919,0.100000,0.222222,0.080808,0.000000,0.000000,0.0,0.141414,0.323232,0.000000,0.262626,0.171717,0.272727,0.090909,0.000000,0.111111,0.000000,0.108434,0.042553,0.0,0.060606,0.018182,0.000000,0.000000,0.000000,0.0,0.00,0.037037,0.000,0.050505,0.555556,0.929293,0.010101,0.010101,0.050505,1.000000,0.575758,0.202020,0.129032,0.173077,0.040404,0.262626,0.545455,0.939394,0.959596,0.050505,0.235955,0.129032,0.323232,0.010101,0.616162,0.050505,0.000000,0.010638,1.000000,0.010101,0.848485,0.989899,1.000000,0.476190,0.4,0.070707,0.060606,0.210526,0.178010,0.210526,0.116883,0.001900,0.008475,0.024390,0.005,0.002002,0.015,0.007353,0.009059,0.080134,1.0,0.000000,0.508197
3,0.624862,0.000000,0.000000,0.414141,0.383838,0.292929,0.101010,0.050505,0.034483,0.025592,0.029075,0.028416,0.000000,0.000000,1.000000,0.494949,0.515152,0.959596,0.000000,0.020408,0.010101,0.040404,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.030928,0.0,0.0,0.011628,0.440476,0.511905,0.523810,0.440476,0.535714,0.559524,0.306667,0.393939,0.434343,0.181818,0.080808,0.202020,0.252525,0.40,0.131313,0.080808,0.050505,0.161616,0.171717,0.353535,0.212121,0.111111,0.202020,0.070707,0.171717,0.252525,0.383838,0.373737,0.212121,0.080808,0.020202,0.616162,0.161616,0.082192,0.171717,0.255385,0.348571,0.646465,0.636364,0.014706,0.060606,0.050505,0.020202,0.030303,0.000000,0.030303,0.165833,0.182500,0.230769,0.230769,0.737374,0.272727,0.939394,0.070707,0.131313,0.313131,0.696970,0.585859,0.252525,0.808081,0.202020,0.060606,0.04,0.040404,0.202020,0.161616,0.060606,0.253333,0.606061,0.151515,0.000000,0.000000,0.000000,0.050505,0.121212,0.494949,0.757576,0.959596,0.010101,0.060606,0.414141,0.600000,0.588889,0.213115,0.100,0.060606,0.000000,0.272727,0.121212,0.040404,0.010101,0.060606,0.161616,0.282828,0.585859,0.757576,0.000000,0.711628,0.920545,0.216667,0.246000,0.252667,0.280000,0.098069,0.232323,0.141414,0.171717,0.171717,0.202020,0.060606,0.02,0.016393,0.010101,0.151515,0.131313,0.191919,0.202020,0.232323,0.080808,0.00,0.010101,0.010101,0.252525,0.030303,0.282828,0.101010,0.636364,0.373737,0.303030,0.717172,0.101010,0.010101,0.010101,0.0,0.000000,0.04,0.141414,0.040404,0.040404,0.676768,0.255556,0.355263,0.070707,0.606061,0.747475,0.797980,0.696970,0.757576,0.646465,0.676768,0.575758,0.757576,1.0,0.060606,0.151515,0.121212,0.030303,0.151515,0.121212,0.0,0.054545,0.151515,0.010101,0.111111,0.050505,0.050505,0.030303,0.010101,0.030769,0.090909,0.090909,0.040404,0.04,0.030303,0.202020,0.050505,0.030303,0.060606,0.074627,0.080808,0.070707,0.090909,0.070707,0.101010,0.050505,0.030303,0.090909,0.676768,0.040404,0.010101,0.705882,0.050505,0.111111,0.363636,0.252525,0.189189,0.121212,0.060606,0.010309,0.212121,0.033333,0.208333,0.060606,0.000000,0.000000,0.0,0.212121,0.414141,0.033333,0.383838,0.171717,0.292929,0.060606,0.000000,0.070707,0.064516,0.120482,0.000000,0.0,0.050505,0.054545,0.000000,0.000000,0.023256,0.0,0.04,0.074074,0.000,0.030303,0.202020,0.959596,0.020202,0.010101,0.020202,0.969697,0.727273,0.282828,0.129032,0.288462,0.101010,0.333333,0.474747,0.787879,0.828283,0.181818,0.011236,0.000000,0.141414,0.414141,0.090909,0.111111,0.000000,0.265957,0.585859,0.404040,0.545455,1.000000,0.949495,0.333333,0.6,0.070707,0.060606,0.052632,0.041885,0.210526,0.116883,0.000739,0.000000,0.000000,0.020,0.003003,0.020,0.008272,0.018738,0.740492,1.0,0.000000,0.672131
4,0.350515,0.000000,0.000000,0.464646,0.222222,0.474747,0.141414,0.020202,0.000000,0.006920,0.008457,0.007485,0.000000,1.000000,0.000000,0.484848,0.525253,0.989899,0.010101,0.000000,0.000000,0.030303,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.030928,0.0,0.0,0.000000,0.392857,0.535714,0.583333,0.428571,0.547619,0.583333,0.373333,0.393939,0.363636,0.252525,0.090909,0.232323,0.161616,0.30,0.171717,0.151515,0.040404,0.161616,0.161616,0.323232,0.212121,0.151515,0.262626,0.111111,0.252525,0.232323,0.353535,0.424242,0.262626,0.101010,0.030303,0.666667,0.101010,0.109589,0.161616,0.270769,0.368571,0.808081,0.808081,0.029412,0.020202,0.000000,0.000000,0.000000,0.000000,0.000000,0.073833,0.087833,0.230769,0.153846,0.808081,0.202020,0.969697,0.040404,0.000000,0.383838,0.767677,0.646465,0.292929,0.898990,0.111111,0.090909,0.04,0.070707,0.090909,0.181818,0.010101,0.293333,0.555556,0.212121,0.018182,0.010101,0.000000,0.010101,0.010101,0.030303,0.131313,0.424242,0.010101,0.040404,0.373737,0.576471,0.566667,0.229508,0.125,0.020202,0.000000,0.161616,0.171717,0.020202,0.000000,0.020202,0.020202,0.080808,0.292929,0.676768,0.089744,0.382946,0.785471,0.170000,0.200667,0.178667,0.214000,0.064392,0.383838,0.111111,0.262626,0.101010,0.141414,0.030303,0.00,0.000000,0.000000,0.252525,0.141414,0.272727,0.141414,0.181818,0.030303,0.00,0.000000,0.000000,0.272727,0.000000,0.252525,0.232323,0.333333,0.676768,0.030303,0.898990,0.111111,0.000000,0.000000,0.0,0.000000,0.00,0.000000,0.010101,0.000000,0.191919,0.122222,0.197368,0.000000,0.282828,0.636364,0.777778,0.515152,0.707071,0.484848,0.737374,0.353535,1.000000,1.0,0.000000,0.111111,0.050505,0.080808,0.131313,0.141414,0.0,0.072727,0.090909,0.000000,0.141414,0.020202,0.070707,0.111111,0.000000,0.000000,0.101010,0.212121,0.000000,0.06,0.000000,0.181818,0.060606,0.090909,0.030303,0.000000,0.060606,0.050505,0.111111,0.080808,0.141414,0.020202,0.000000,0.111111,0.636364,0.090909,0.000000,0.705882,0.030303,0.232323,0.383838,0.191919,0.351351,0.030303,0.020202,0.000000,0.282828,0.033333,0.347222,0.020202,0.000000,0.000000,0.0,0.212121,0.464646,0.000000,0.222222,0.060606,0.474747,0.181818,0.000000,0.010101,0.096774,0.060241,0.000000,0.0,0.090909,0.018182,0.000000,0.000000,0.000000,0.0,0.02,0.000000,0.000,0.010101,0.666667,0.939394,0.040404,0.000000,0.030303,0.979798,0.535354,0.151515,0.258065,0.538462,0.000000,0.131313,0.303030,0.323232,0.565657,0.444444,0.000000,0.000000,0.767677,0.000000,0.242424,0.000000,0.000000,0.000000,1.000000,0.000000,1.000000,1.000000,0.969697,0.238095,0.4,0.030303,0.151515,0.421053,0.319372,0.210526,0.103896,0.005068,0.076271,0.268293,0.003,0.000000,0.003,0.004596,0.001927,0.646831,1.0,0.666667,0.852459
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
19078,0.649485,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.010790,0.012791,0.011044,0.000000,0.878788,0.131313,0.484848,0.525253,0.909091,0.050505,0.010204,0.020202,0.090909,0.045455,0.0,0.000000,0.0,0.0,0.0,0.0,0.082474,0.0,0.0,0.023256,0.345238,0.452381,0.500000,0.380952,0.500000,0.547619,0.413333,0.444444,0.363636,0.202020,0.161616,0.252525,0.212121,0.28,0.080808,0.080808,0.080808,0.171717,0.191919,0.313131,0.202020,0.131313,0.212121,0.101010,0.202020,0.202020,0.292929,0.515152,0.282828,0.101010,0.020202,0.666667,0.080808,0.082192,0.191919,0.315385,0.388571,0.696970,0.656566,0.235294,0.303030,0.141414,0.101010,0.000000,0.000000,0.000000,0.106500,0.123500,0.230769,0.230769,0.535354,0.474747,0.929293,0.080808,0.090909,0.474747,0.787879,0.646465,0.363636,0.909091,0.101010,0.111111,0.02,0.101010,0.070707,0.181818,0.030303,0.293333,0.555556,0.121212,0.036364,0.020202,0.010101,0.000000,0.040404,0.151515,0.272727,0.929293,0.000000,0.040404,0.424242,0.600000,0.577778,0.245902,0.125,0.282828,0.020202,0.000000,0.181818,0.191919,0.090909,0.000000,0.101010,0.242424,0.565657,0.656566,0.068376,0.314729,0.720772,0.174667,0.214667,0.208000,0.239333,0.067330,0.272727,0.202020,0.171717,0.181818,0.141414,0.020202,0.06,0.000000,0.000000,0.222222,0.141414,0.191919,0.191919,0.202020,0.030303,0.08,0.000000,0.000000,0.191919,0.070707,0.484848,0.232323,0.606061,0.404040,0.161616,0.818182,0.161616,0.000000,0.000000,0.0,0.000000,0.00,0.020202,0.000000,0.000000,0.404040,0.200000,0.289474,0.000000,0.545455,0.666667,0.797980,0.555556,0.757576,0.525253,0.585859,0.525253,0.515152,1.0,0.090909,0.121212,0.181818,0.050505,0.080808,0.252525,0.0,0.054545,0.090909,0.010101,0.131313,0.010101,0.010101,0.040404,0.010101,0.000000,0.070707,0.191919,0.000000,0.08,0.030303,0.151515,0.060606,0.101010,0.040404,0.000000,0.080808,0.070707,0.030303,0.121212,0.141414,0.101010,0.040404,0.050505,0.646465,0.000000,0.030303,0.735294,0.060606,0.111111,0.303030,0.272727,0.108108,0.212121,0.010101,0.041237,0.292929,0.100000,0.291667,0.080808,0.000000,0.000000,0.0,0.080808,0.181818,0.000000,0.383838,0.000000,0.050505,0.171717,0.000000,0.040404,0.000000,0.096386,0.000000,0.0,0.070707,0.000000,0.014706,0.010101,0.000000,0.0,0.00,0.037037,0.125,0.161616,0.686869,0.787879,0.050505,0.000000,0.171717,0.919192,0.515152,0.171717,0.129032,0.192308,0.000000,0.151515,0.525253,1.000000,1.000000,0.000000,0.000000,0.048387,0.636364,0.000000,0.363636,0.000000,0.000000,0.021277,1.000000,0.000000,0.878788,1.000000,0.949495,0.380952,0.4,0.030303,0.050505,0.157895,0.104712,0.315789,0.142857,0.002006,0.008475,0.073171,0.005,0.002002,0.012,0.003676,0.009393,0.662563,0.0,0.333333,0.836066
19079,0.624862,0.008299,0.000000,0.323232,0.282828,0.373737,0.070707,0.040404,0.034483,0.060384,0.066229,0.068554,0.000000,0.797980,0.212121,0.454545,0.555556,1.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.000000,0.0,0.0,0.000000,0.452381,0.559524,0.619048,0.488095,0.595238,0.642857,0.320000,0.373737,0.393939,0.242424,0.111111,0.161616,0.181818,0.24,0.131313,0.151515,0.151515,0.141414,0.161616,0.313131,0.222222,0.171717,0.383838,0.222222,0.363636,0.333333,0.313131,0.363636,0.212121,0.080808,0.020202,0.555556,0.111111,0.219178,0.171717,0.236923,0.334286,0.757576,0.757576,0.044118,0.151515,0.121212,0.090909,0.050505,0.050505,0.000000,0.061167,0.071333,0.153846,0.153846,0.717172,0.292929,0.909091,0.101010,0.050505,0.313131,0.656566,0.535354,0.242424,0.797980,0.212121,0.070707,0.04,0.050505,0.111111,0.292929,0.020202,0.253333,0.525253,0.282828,0.000000,0.000000,0.000000,0.000000,0.010101,0.040404,0.111111,0.303030,0.000000,0.030303,0.404040,0.600000,0.577778,0.213115,0.100,0.070707,0.080808,0.080808,0.111111,0.050505,0.101010,0.020202,0.000000,0.010101,0.050505,0.343434,0.000000,0.134884,0.659478,0.144000,0.182667,0.173333,0.215333,0.068616,0.363636,0.232323,0.181818,0.131313,0.090909,0.010101,0.02,0.000000,0.000000,0.202020,0.242424,0.242424,0.171717,0.121212,0.010101,0.04,0.000000,0.000000,0.424242,0.050505,0.424242,0.131313,0.464646,0.545455,0.050505,0.808081,0.111111,0.000000,0.000000,0.0,0.000000,0.00,0.050505,0.040404,0.020202,0.282828,0.133333,0.276316,0.050505,0.414141,0.555556,0.676768,0.464646,0.646465,0.444444,0.747475,0.555556,0.858586,1.0,0.030303,0.131313,0.090909,0.040404,0.090909,0.101010,0.0,0.018182,0.161616,0.030303,0.161616,0.070707,0.040404,0.070707,0.030303,0.030769,0.060606,0.141414,0.020202,0.14,0.000000,0.232323,0.050505,0.030303,0.040404,0.000000,0.121212,0.090909,0.050505,0.050505,0.080808,0.040404,0.030303,0.111111,0.656566,0.080808,0.010101,0.705882,0.111111,0.171717,0.393939,0.151515,0.189189,0.070707,0.050505,0.020619,0.232323,0.066667,0.236111,0.050505,0.000000,0.000000,0.0,0.161616,0.353535,0.033333,0.252525,0.121212,0.393939,0.040404,0.012048,0.080808,0.000000,0.192771,0.000000,0.0,0.060606,0.000000,0.000000,0.000000,0.000000,0.0,0.02,0.037037,0.000,0.000000,0.858586,0.989899,0.010101,0.000000,0.010101,0.898990,0.515152,0.161616,0.225806,0.692308,0.010101,0.020202,0.040404,0.242424,0.404040,0.606061,0.000000,0.000000,0.686869,0.030303,0.232323,0.040404,0.000000,0.031915,0.989899,0.020202,0.898990,1.000000,0.909091,0.428571,0.4,0.050505,0.060606,0.543860,0.371728,0.315789,0.181818,0.014780,0.076271,0.341463,0.003,0.002002,0.015,0.004596,0.006776,0.335362,1.0,1.000000,0.950820
19080,0.412371,0.045643,0.010101,0.262626,0.262626,0.505051,0.070707,0.030303,0.011494,0.015836,0.020449,0.015931,1.000000,0.000000,0.000000,0.494949,0.515152,1.000000,0.000000,0.000000,0.010101,0.000000,0.045455,0.0,0.000000,0.0,0.0,0.0,0.0,0.000000,0.0,0.0,0.000000,0.511905,0.583333,0.630952,0.500000,0.571429,0.619048,0.213333,0.373737,0.363636,0.272727,0.131313,0.161616,0.141414,0.32,0.212121,0.141414,0.060606,0.161616,0.131313,0.313131,0.161616,0.242424,0.323232,0.060606,0.272727,0.121212,0.404040,0.484848,0.292929,0.101010,0.020202,0.636364,0.050505,0.109589,0.242424,0.300000,0.395714,1.000000,0.989899,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.135667,0.132500,0.461538,0.384615,0.979798,0.030303,1.000000,0.000000,0.000000,0.242424,0.868687,0.757576,0.232323,0.949495,0.060606,0.010101,0.00,0.010101,0.070707,0.161616,0.010101,0.160000,0.626263,0.252525,0.000000,0.000000,0.000000,0.000000,0.000000,0.030303,0.666667,0.989899,0.000000,0.000000,0.666667,0.635294,0.655556,0.245902,0.125,0.000000,0.000000,0.000000,0.030303,0.000000,0.000000,0.000000,0.505051,0.696970,0.818182,0.818182,0.230769,0.088372,0.573212,0.286667,0.312000,0.314000,0.336667,0.106386,0.080808,0.121212,0.121212,0.282828,0.292929,0.080808,0.08,0.000000,0.000000,0.030303,0.101010,0.131313,0.282828,0.333333,0.090909,0.08,0.000000,0.000000,0.313131,0.030303,0.525253,0.020202,0.212121,0.797980,0.010101,0.848485,0.141414,0.010101,0.000000,0.0,0.021277,0.00,0.000000,0.020202,0.000000,0.505051,0.244444,0.302632,0.020202,0.707071,0.686869,0.747475,0.626263,0.686869,0.575758,0.676768,0.565657,1.000000,0.0,0.000000,0.121212,0.121212,0.050505,0.121212,0.232323,0.0,0.072727,0.080808,0.000000,0.121212,0.070707,0.020202,0.040404,0.000000,0.000000,0.000000,0.292929,0.020202,0.14,0.060606,0.151515,0.060606,0.090909,0.030303,0.014925,0.080808,0.090909,0.030303,0.030303,0.070707,0.030303,0.010101,0.010101,0.787879,0.090909,0.010101,0.705882,0.111111,0.151515,0.373737,0.232323,0.162162,0.050505,0.030303,0.041237,0.202020,0.000000,0.166667,0.121212,0.010309,0.010101,0.0,0.141414,0.262626,0.066667,0.262626,0.232323,0.505051,0.000000,0.000000,0.030303,0.032258,0.108434,0.000000,0.0,0.000000,0.254545,0.000000,0.181818,0.000000,0.0,0.02,0.037037,0.000,0.080808,0.797980,0.878788,0.000000,0.000000,0.121212,1.000000,0.747475,0.272727,0.548387,0.500000,0.000000,0.010101,0.010101,0.030303,0.929293,0.080808,0.000000,0.000000,1.000000,0.000000,0.000000,0.000000,0.000000,0.000000,1.000000,0.000000,1.000000,1.000000,1.000000,0.380952,0.6,0.101010,0.101010,0.263158,0.172775,0.315789,0.155844,0.006862,0.021186,0.097561,0.005,0.004004,0.017,0.003676,0.011729,0.342980,1.0,0.333333,0.131148
19081,0.701031,0.000000,0.000000,0.272727,0.181818,0.494949,0.060606,0.030303,0.091954,0.008419,0.008710,0.009632,0.000000,1.000000,0.000000,0.474747,0.535354,0.989899,0.000000,0.010204,0.000000,0.000000,0.000000,0.0,0.000000,0.0,0.0,0.0,0.0,0.000000,0.0,0.0,0.000000,0.523810,0.678571,0.702381,0.547619,0.654762,0.678571,0.280000,0.282828,0.424242,0.303030,0.060606,0.141414,0.171717,0.20,0.151515,0.171717,0.222222,0.101010,0.121212,0.343434,0.262626,0.181818,0.383838,0.151515,0.353535,0.333333,0.424242,0.262626,0.161616,0.060606,0.030303,0.535354,0.121212,0.260274,0.171717,0.218462,0.311429,0.797980,0.767677,0.073529,0.111111,0.060606,0.000000,0.101010,0.101010,0.000000,0.080667,0.083333,0.153846,0.153846,0.646465,0.363636,0.919192,0.090909,0.030303,0.262626,0.616162,0.505051,0.181818,0.707071,0.303030,0.070707,0.02,0.060606,0.161616,0.272727,0.070707,0.213333,0.454545,0.373737,0.000000,0.000000,0.000000,0.000000,0.010101,0.010101,0.101010,0.464646,0.000000,0.050505,0.343434,0.576471,0.555556,0.196721,0.100,0.111111,0.000000,0.080808,0.252525,0.070707,0.000000,0.030303,0.000000,0.030303,0.181818,0.606061,0.000000,0.727132,0.867196,0.146000,0.187333,0.180000,0.215333,0.073544,0.262626,0.282828,0.222222,0.070707,0.141414,0.020202,0.00,0.000000,0.000000,0.131313,0.323232,0.242424,0.101010,0.191919,0.030303,0.00,0.000000,0.000000,0.484848,0.060606,0.444444,0.050505,0.474747,0.535354,0.060606,0.727273,0.232323,0.000000,0.000000,0.0,0.000000,0.00,0.030303,0.020202,0.030303,0.191919,0.088889,0.171053,0.020202,0.181818,0.454545,0.545455,0.373737,0.545455,0.373737,0.777778,0.434343,0.888889,0.0,0.030303,0.181818,0.131313,0.070707,0.040404,0.060606,0.0,0.054545,0.141414,0.070707,0.020202,0.070707,0.050505,0.121212,0.020202,0.030769,0.080808,0.101010,0.070707,0.06,0.141414,0.080808,0.050505,0.050505,0.060606,0.000000,0.151515,0.070707,0.060606,0.030303,0.060606,0.030303,0.080808,0.050505,0.626263,0.161616,0.000000,0.705882,0.242424,0.171717,0.242424,0.131313,0.270270,0.090909,0.020202,0.000000,0.171717,0.000000,0.208333,0.030303,0.000000,0.000000,0.0,0.141414,0.272727,0.066667,0.181818,0.333333,0.494949,0.050505,0.000000,0.040404,0.000000,0.204819,0.000000,0.0,0.040404,0.036364,0.014706,0.000000,0.000000,0.0,0.02,0.111111,0.000,0.010101,0.707071,0.979798,0.010101,0.000000,0.020202,0.949495,0.606061,0.191919,0.258065,0.673077,0.000000,0.000000,0.040404,0.080808,0.363636,0.646465,0.000000,0.000000,0.929293,0.050505,0.020202,0.000000,0.000000,0.021277,1.000000,0.000000,1.000000,1.000000,0.888889,0.238095,0.4,0.060606,0.060606,0.473684,0.298429,0.263158,0.129870,0.010980,0.080508,0.243902,0.002,0.001001,0.010,0.024816,0.004570,0.450375,1.0,0.000000,0.950820


In [22]:
X_train_num

Unnamed: 0,AGE,HIT,MALEMILI,MALEVET,VIETVETS,WWIIVETS,LOCALGOV,STATEGOV,FEDGOV,POP901,POP902,POP903,POP90C1,POP90C2,POP90C3,POP90C4,POP90C5,ETH1,ETH2,ETH3,ETH4,ETH5,ETH6,ETH7,ETH8,ETH9,ETH10,ETH11,ETH12,ETH13,ETH14,ETH15,ETH16,AGE901,AGE902,AGE903,AGE904,AGE905,AGE906,AGE907,CHIL1,CHIL2,CHIL3,AGEC1,AGEC2,AGEC3,AGEC4,AGEC5,AGEC6,AGEC7,CHILC1,CHILC2,CHILC3,CHILC4,CHILC5,HHAGE1,HHAGE2,HHAGE3,HHN1,HHN2,HHN3,HHN4,HHN5,HHN6,MARR1,MARR2,MARR3,MARR4,HHP1,HHP2,DW1,DW2,DW3,DW4,DW5,DW6,DW7,DW8,DW9,HV1,HV2,HV3,HV4,HU1,HU2,HU3,HU4,HU5,HHD1,HHD2,HHD3,HHD4,HHD5,HHD6,HHD7,HHD8,HHD9,HHD10,HHD11,HHD12,ETHC1,ETHC2,ETHC3,ETHC4,ETHC5,ETHC6,HVP1,HVP2,HVP3,HVP4,HVP5,HVP6,HUR1,HUR2,RHP1,RHP2,RHP3,RHP4,HUPA1,HUPA2,HUPA3,HUPA4,HUPA5,HUPA6,HUPA7,RP1,RP2,RP3,RP4,MSA,ADI,DMA,IC1,IC2,IC3,IC4,IC5,IC6,IC7,IC8,IC9,IC10,IC11,IC12,IC13,IC14,IC15,IC16,IC17,IC18,IC19,IC20,IC21,IC22,IC23,HHAS1,HHAS2,HHAS3,HHAS4,MC1,MC2,MC3,TPE1,TPE2,TPE3,TPE4,TPE5,TPE6,TPE7,TPE8,TPE9,PEC1,PEC2,TPE10,TPE11,TPE12,TPE13,LFC1,LFC2,LFC3,LFC4,LFC5,LFC6,LFC7,LFC8,LFC9,LFC10,OCC1,OCC2,OCC3,OCC4,OCC5,OCC6,OCC7,OCC8,OCC9,OCC10,OCC11,OCC12,OCC13,EIC1,EIC2,EIC3,EIC4,EIC5,EIC6,EIC7,EIC8,EIC9,EIC10,EIC11,EIC12,EIC13,EIC14,EIC15,EIC16,OEDC1,OEDC2,OEDC3,OEDC4,OEDC5,OEDC6,OEDC7,EC1,EC2,EC3,EC4,EC5,EC6,EC7,EC8,SEC1,SEC2,SEC3,SEC4,SEC5,AFC1,AFC2,AFC3,AFC4,AFC5,AFC6,VC1,VC2,VC3,VC4,ANC1,ANC2,ANC3,ANC4,ANC5,ANC6,ANC7,ANC8,ANC9,ANC10,ANC11,ANC12,ANC13,ANC14,ANC15,POBC1,POBC2,LSC1,LSC2,LSC3,LSC4,VOC1,VOC2,VOC3,HC1,HC2,HC3,HC4,HC5,HC6,HC7,HC8,HC9,HC10,HC11,HC12,HC13,HC14,HC15,HC16,HC17,HC18,HC19,HC20,HC21,MHUC1,MHUC2,AC1,AC2,CARDPROM,NUMPROM,CARDPM12,NUMPRM12,RAMNTALL,NGIFTALL,CARDGIFT,MINRAMNT,MAXRAMNT,LASTGIFT,TIMELAG,AVGGIFT,CONTROLN,HPHONE_D,RFA_2F,CLUSTER2
0,0.814433,0.099585,0.000000,0.393939,0.181818,0.575758,0.020202,0.050505,0.000000,0.017882,0.021585,0.019066,0.000000,0.0,1.000000,0.474747,0.535354,1.000000,0.010101,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.476190,0.571429,0.619048,0.500000,0.595238,0.642857,0.280000,0.444444,0.363636,0.202020,0.090909,0.161616,0.191919,0.26,0.161616,0.121212,0.151515,0.181818,0.171717,0.313131,0.212121,0.131313,0.323232,0.121212,0.292929,0.232323,0.373737,0.404040,0.191919,0.060606,0.010101,0.666667,0.060606,0.164384,0.151515,0.267692,0.350000,0.808081,0.757576,0.029412,0.040404,0.030303,0.020202,0.060606,0.060606,0.000000,0.103000,0.124000,0.153846,0.153846,0.858586,0.151515,0.717172,0.292929,0.262626,0.343434,0.767677,0.676768,0.303030,0.838384,0.171717,0.040404,0.02,0.030303,0.111111,0.191919,0.010101,0.226667,0.535354,0.282828,0.000000,0.000000,0.000000,0.030303,0.080808,0.212121,0.363636,0.636364,0.000000,0.010101,0.424242,0.600000,0.611111,0.229508,0.075,0.040404,0.000000,0.151515,0.080808,0.020202,0.000000,0.040404,0.080808,0.141414,0.242424,0.636364,0.000000,0.330233,0.643587,0.159333,0.183333,0.198000,0.229333,0.080556,0.262626,0.252525,0.181818,0.181818,0.070707,0.040404,0.02,0.000000,0.010101,0.161616,0.252525,0.222222,0.202020,0.090909,0.050505,0.02,0.000000,0.020202,0.404040,0.050505,0.404040,0.090909,0.373737,0.636364,0.050505,0.797980,0.161616,0.000000,0.000000,0.000000,0.0,0.00,0.020202,0.020202,0.010101,0.333333,0.211111,0.302632,0.060606,0.575758,0.494949,0.626263,0.383838,0.606061,0.363636,0.525253,0.464646,0.676768,1.0,0.030303,0.101010,0.141414,0.040404,0.101010,0.080808,0.000000,0.018182,0.070707,0.020202,0.141414,0.151515,0.070707,0.070707,0.020202,0.169231,0.040404,0.353535,0.070707,0.02,0.030303,0.131313,0.010101,0.020202,0.040404,0.044776,0.040404,0.050505,0.040404,0.020202,0.020202,0.050505,0.000000,0.040404,0.868687,0.020202,0.000000,0.705882,0.222222,0.131313,0.262626,0.141414,0.189189,0.131313,0.040404,0.020619,0.171717,0.066667,0.194444,0.030303,0.0,0.000000,0.0,0.191919,0.393939,0.066667,0.181818,0.212121,0.575758,0.050505,0.012048,0.111111,0.032258,0.096386,0.000000,0.0,0.101010,0.000000,0.000000,0.010101,0.000000,0.000000,0.06,0.000000,0.0,0.010101,0.777778,0.979798,0.020202,0.000000,0.010101,0.939394,0.686869,0.222222,0.258065,0.365385,0.030303,0.131313,0.292929,0.555556,0.727273,0.282828,0.022472,0.000000,0.000000,0.020202,0.282828,0.484848,0.0,0.244681,0.383838,0.343434,0.090909,1.000000,0.969697,0.285714,0.4,0.060606,0.090909,0.210526,0.136126,0.315789,0.142857,0.006862,0.021186,0.073171,0.005,0.004004,0.010,0.000919,0.011729,0.115133,1.0,0.666667,0.967213
1,0.546392,0.000000,0.010101,0.252525,0.333333,0.323232,0.070707,0.090909,0.022989,0.045278,0.053522,0.046465,0.000000,0.0,1.000000,0.494949,0.515152,0.858586,0.141414,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.392857,0.500000,0.535714,0.416667,0.535714,0.571429,0.360000,0.353535,0.424242,0.242424,0.131313,0.212121,0.202020,0.32,0.141414,0.101010,0.060606,0.131313,0.151515,0.333333,0.232323,0.161616,0.252525,0.101010,0.232323,0.212121,0.313131,0.484848,0.272727,0.101010,0.040404,0.636364,0.070707,0.123288,0.202020,0.295385,0.385714,0.656566,0.656566,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.062833,0.072000,0.153846,0.076923,0.848485,0.161616,0.909091,0.101010,0.252525,0.404040,0.777778,0.646465,0.333333,0.909091,0.101010,0.070707,0.04,0.060606,0.121212,0.171717,0.010101,0.226667,0.535354,0.151515,0.090909,0.080808,0.020202,0.010101,0.010101,0.050505,0.111111,0.333333,0.000000,0.020202,0.404040,0.588235,0.588889,0.245902,0.125,0.000000,0.000000,0.333333,0.090909,0.000000,0.000000,0.060606,0.000000,0.000000,0.030303,0.101010,0.000000,0.550388,0.624291,0.122667,0.151333,0.145333,0.172000,0.050092,0.434343,0.242424,0.161616,0.101010,0.060606,0.010101,0.00,0.000000,0.000000,0.313131,0.282828,0.191919,0.131313,0.070707,0.010101,0.00,0.000000,0.000000,0.363636,0.131313,0.191919,0.222222,0.242424,0.767677,0.010101,0.717172,0.232323,0.010101,0.010101,0.000000,0.0,0.00,0.020202,0.030303,0.171717,0.222222,0.333333,0.460526,0.161616,0.707071,0.545455,0.636364,0.464646,0.555556,0.434343,0.616162,0.484848,0.575758,1.0,0.090909,0.090909,0.060606,0.030303,0.060606,0.111111,0.000000,0.000000,0.080808,0.090909,0.202020,0.171717,0.040404,0.070707,0.080808,0.000000,0.131313,0.282828,0.020202,0.04,0.020202,0.141414,0.040404,0.030303,0.060606,0.000000,0.050505,0.090909,0.030303,0.020202,0.050505,0.060606,0.010101,0.090909,0.777778,0.010101,0.010101,0.705882,0.262626,0.232323,0.333333,0.090909,0.135135,0.030303,0.020202,0.000000,0.272727,0.000000,0.319444,0.040404,0.0,0.000000,0.0,0.121212,0.242424,0.033333,0.272727,0.262626,0.353535,0.111111,0.000000,0.070707,0.064516,0.036145,0.000000,0.0,0.060606,0.000000,0.000000,0.000000,0.000000,0.000000,0.04,0.000000,0.0,0.000000,0.828283,0.969697,0.020202,0.000000,0.020202,0.888889,0.646465,0.232323,0.419355,0.403846,0.030303,0.111111,0.262626,0.525253,0.686869,0.323232,0.000000,0.000000,0.020202,0.313131,0.262626,0.262626,0.0,0.159574,0.010101,0.989899,0.010101,0.989899,0.929293,0.380952,0.4,0.090909,0.080808,0.350877,0.282723,0.315789,0.181818,0.016047,0.050847,0.121951,0.005,0.003203,0.006,0.008272,0.011421,0.108323,1.0,1.000000,0.967213
2,0.624862,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.000000,0.014235,0.021123,0.020111,1.000000,0.0,0.000000,0.474747,0.535354,1.000000,0.010101,0.000000,0.000000,0.010101,0.0,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.761905,0.773810,0.773810,0.690476,0.726190,0.750000,0.093333,0.353535,0.303030,0.343434,0.030303,0.070707,0.060606,0.14,0.242424,0.373737,0.161616,0.161616,0.141414,0.222222,0.303030,0.181818,0.646465,0.171717,0.606061,0.262626,0.636364,0.121212,0.040404,0.010101,0.000000,0.727273,0.080808,0.164384,0.090909,0.213846,0.274286,0.101010,0.090909,0.000000,0.575758,0.575758,0.535354,0.030303,0.000000,0.030303,0.253833,0.241500,0.461538,0.461538,0.878788,0.131313,0.515152,0.494949,0.818182,0.060606,0.717172,0.666667,0.050505,0.797980,0.212121,0.010101,0.00,0.010101,0.121212,0.202020,0.040404,0.066667,0.303030,0.646465,0.018182,0.000000,0.000000,0.222222,0.525253,0.727273,0.797980,0.909091,0.000000,0.010101,0.141414,0.517647,0.533333,0.196721,0.050,0.202020,0.383838,0.333333,0.010101,0.000000,0.101010,0.020202,0.737374,0.848485,0.919192,0.959596,0.571047,0.206202,0.648127,0.183333,0.200000,0.214000,0.240667,0.113888,0.171717,0.252525,0.262626,0.191919,0.111111,0.000000,0.00,0.000000,0.020202,0.090909,0.262626,0.272727,0.212121,0.131313,0.000000,0.00,0.000000,0.030303,0.707071,0.010101,0.696970,0.010101,0.636364,0.383838,0.454545,0.848485,0.131313,0.000000,0.000000,0.000000,0.0,0.00,0.000000,0.030303,0.000000,0.000000,0.211111,0.263158,0.000000,0.828283,0.272727,0.292929,0.252525,0.292929,0.242424,1.000000,1.000000,0.000000,0.0,0.000000,0.030303,0.181818,0.060606,0.131313,0.141414,0.069767,0.036364,0.111111,0.030303,0.171717,0.030303,0.030303,0.060606,0.000000,0.000000,0.141414,0.080808,0.030303,0.04,0.000000,0.262626,0.060606,0.111111,0.080808,0.044776,0.080808,0.030303,0.060606,0.030303,0.111111,0.020202,0.000000,0.000000,0.787879,0.080808,0.000000,0.705882,0.040404,0.040404,0.535354,0.262626,0.081081,0.070707,0.030303,0.000000,0.050505,0.000000,0.055556,0.010101,0.0,0.000000,0.0,0.353535,0.696970,0.066667,0.060606,0.131313,0.797980,0.000000,0.012048,0.121212,0.032258,0.072289,0.021277,0.0,0.090909,0.145455,0.014706,0.010101,0.000000,0.000000,0.04,0.037037,0.0,0.050505,0.030303,1.000000,0.000000,0.000000,0.010101,1.000000,0.373737,0.030303,0.096774,0.096154,0.212121,0.535354,0.626263,0.949495,1.000000,0.010101,0.134831,0.096774,0.020202,0.040404,0.929293,0.010101,0.0,0.010638,0.979798,0.030303,0.838384,1.000000,1.000000,0.380952,0.2,0.070707,0.151515,0.438596,0.303665,0.315789,0.142857,0.004751,0.055085,0.170732,0.002,0.000400,0.005,0.003676,0.002861,0.226069,1.0,1.000000,0.196721
3,0.624862,0.000000,0.000000,0.282828,0.202020,0.383838,0.070707,0.040404,0.034483,0.021216,0.025835,0.023614,0.000000,0.0,1.000000,0.505051,0.505051,1.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.000000,0.000000,0.000000,0.452381,0.583333,0.619048,0.476190,0.595238,0.630952,0.333333,0.363636,0.393939,0.252525,0.090909,0.171717,0.161616,0.32,0.161616,0.151515,0.121212,0.151515,0.131313,0.323232,0.242424,0.161616,0.373737,0.161616,0.353535,0.252525,0.393939,0.373737,0.222222,0.101010,0.040404,0.686869,0.060606,0.123288,0.171717,0.255385,0.357143,0.878788,0.868687,0.000000,0.020202,0.020202,0.000000,0.000000,0.000000,0.000000,0.025000,0.035167,0.076923,0.000000,0.777778,0.232323,0.808081,0.202020,0.030303,0.313131,0.737374,0.676768,0.282828,0.878788,0.131313,0.030303,0.04,0.020202,0.121212,0.181818,0.020202,0.266667,0.545455,0.262626,0.000000,0.000000,0.000000,0.000000,0.000000,0.010101,0.020202,0.070707,0.000000,0.020202,0.494949,0.647059,0.622222,0.213115,0.100,0.020202,0.000000,0.111111,0.202020,0.020202,0.000000,0.010101,0.010101,0.010101,0.040404,0.101010,0.000000,0.351938,0.813848,0.118667,0.148667,0.152667,0.180667,0.057744,0.434343,0.242424,0.131313,0.101010,0.080808,0.010101,0.00,0.000000,0.010101,0.303030,0.272727,0.171717,0.121212,0.111111,0.020202,0.00,0.000000,0.010101,0.393939,0.070707,0.404040,0.232323,0.252525,0.757576,0.010101,0.626263,0.141414,0.010101,0.010101,0.000000,0.0,0.00,0.050505,0.191919,0.020202,0.232323,0.122222,0.263158,0.060606,0.414141,0.565657,0.696970,0.434343,0.636364,0.424242,0.585859,0.484848,0.888889,1.0,0.030303,0.060606,0.040404,0.010101,0.090909,0.111111,0.023256,0.000000,0.121212,0.353535,0.080808,0.020202,0.070707,0.040404,0.363636,0.015385,0.050505,0.040404,0.040404,0.06,0.030303,0.141414,0.050505,0.000000,0.020202,0.000000,0.090909,0.101010,0.020202,0.020202,0.070707,0.040404,0.030303,0.303030,0.494949,0.050505,0.030303,0.705882,0.151515,0.101010,0.515152,0.131313,0.081081,0.040404,0.030303,0.010309,0.232323,0.033333,0.277778,0.040404,0.0,0.000000,0.0,0.141414,0.282828,0.000000,0.202020,0.292929,0.383838,0.010101,0.000000,0.090909,0.000000,0.240964,0.000000,0.0,0.101010,0.000000,0.000000,0.000000,0.000000,0.000000,0.02,0.000000,0.0,0.000000,0.737374,0.969697,0.000000,0.000000,0.040404,0.959596,0.707071,0.303030,0.451613,0.750000,0.000000,0.010101,0.090909,0.333333,0.464646,0.545455,0.000000,0.000000,0.121212,0.383838,0.141414,0.101010,0.0,0.276596,0.555556,0.363636,0.232323,0.949495,1.000000,0.333333,0.4,0.060606,0.090909,0.456140,0.340314,0.263158,0.168831,0.007126,0.072034,0.243902,0.002,0.001001,0.005,0.011949,0.003191,0.973063,0.0,1.000000,0.918033
4,0.624862,0.000000,0.010101,0.333333,0.292929,0.272727,0.030303,0.030303,0.022989,0.017811,0.021838,0.020789,1.000000,0.0,0.000000,0.474747,0.535354,0.969697,0.020202,0.010204,0.010101,0.010101,0.0,0.014925,0.000000,0.000000,0.000000,0.000000,0.0,0.010309,0.000000,0.000000,0.000000,0.345238,0.428571,0.464286,0.380952,0.476190,0.523810,0.333333,0.474747,0.353535,0.191919,0.171717,0.292929,0.232323,0.24,0.090909,0.060606,0.050505,0.222222,0.171717,0.313131,0.161616,0.141414,0.151515,0.080808,0.141414,0.252525,0.353535,0.404040,0.181818,0.050505,0.010101,0.626263,0.131313,0.082192,0.181818,0.263077,0.341429,0.363636,0.333333,0.014706,0.636364,0.626263,0.606061,0.000000,0.000000,0.000000,0.159667,0.169167,0.307692,0.307692,0.424242,0.585859,0.878788,0.131313,0.000000,0.383838,0.717172,0.585859,0.282828,0.858586,0.151515,0.101010,0.02,0.090909,0.131313,0.191919,0.040404,0.266667,0.646465,0.111111,0.000000,0.010101,0.000000,0.010101,0.050505,0.444444,0.808081,0.989899,0.010101,0.010101,0.353535,0.576471,0.555556,0.229508,0.100,0.202020,0.434343,0.000000,0.030303,0.040404,0.515152,0.000000,0.101010,0.444444,0.939394,1.000000,0.572650,0.280620,0.748014,0.192000,0.218667,0.223333,0.247333,0.087197,0.151515,0.232323,0.252525,0.222222,0.101010,0.040404,0.02,0.000000,0.000000,0.141414,0.161616,0.252525,0.252525,0.131313,0.060606,0.02,0.000000,0.000000,0.181818,0.010101,0.404040,0.060606,0.828283,0.181818,0.242424,0.828283,0.121212,0.020202,0.020202,0.000000,0.0,0.00,0.010101,0.030303,0.030303,0.555556,0.333333,0.434211,0.070707,0.646465,0.787879,0.878788,0.707071,0.838384,0.666667,0.696970,0.505051,1.000000,0.0,0.040404,0.111111,0.151515,0.020202,0.212121,0.191919,0.000000,0.018182,0.090909,0.000000,0.080808,0.020202,0.060606,0.060606,0.000000,0.000000,0.111111,0.080808,0.050505,0.04,0.050505,0.292929,0.050505,0.080808,0.020202,0.029851,0.070707,0.050505,0.080808,0.030303,0.030303,0.030303,0.020202,0.080808,0.787879,0.050505,0.010101,0.741176,0.040404,0.090909,0.323232,0.333333,0.081081,0.131313,0.060606,0.020619,0.222222,0.100000,0.263889,0.020202,0.0,0.010101,0.0,0.161616,0.333333,0.000000,0.292929,0.040404,0.272727,0.131313,0.000000,0.090909,0.000000,0.108434,0.000000,0.0,0.101010,0.018182,0.000000,0.000000,0.000000,0.000000,0.04,0.037037,0.0,0.030303,0.474747,0.969697,0.030303,0.000000,0.010101,1.000000,0.626263,0.101010,0.064516,0.192308,0.000000,0.494949,0.575758,0.959596,1.000000,0.000000,0.044944,0.016129,0.262626,0.000000,0.727273,0.010101,0.0,0.000000,1.000000,0.000000,0.989899,1.000000,0.969697,0.476190,0.8,0.020202,0.050505,0.350877,0.240838,0.263158,0.142857,0.007812,0.046610,0.170732,0.003,0.002002,0.011,0.002757,0.005972,0.261803,1.0,0.000000,0.524590
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
76324,0.845361,0.000000,0.000000,0.474747,0.191919,0.545455,0.101010,0.131313,0.011494,0.014134,0.015779,0.020224,1.000000,0.0,0.000000,0.404040,0.606061,0.969697,0.030303,0.000000,0.010101,0.010101,0.0,0.000000,0.000000,0.014925,0.000000,0.000000,0.0,0.000000,0.017544,0.000000,0.000000,0.476190,0.559524,0.607143,0.500000,0.583333,0.630952,0.240000,0.434343,0.373737,0.202020,0.111111,0.181818,0.171717,0.26,0.151515,0.151515,0.121212,0.181818,0.191919,0.252525,0.222222,0.161616,0.353535,0.232323,0.343434,0.464646,0.303030,0.242424,0.121212,0.030303,0.010101,0.515152,0.161616,0.191781,0.202020,0.176923,0.278571,0.434343,0.414141,0.000000,0.565657,0.565657,0.484848,0.000000,0.000000,0.000000,0.112833,0.117500,0.230769,0.307692,0.363636,0.646465,0.969697,0.040404,0.000000,0.212121,0.525253,0.404040,0.161616,0.737374,0.272727,0.060606,0.02,0.050505,0.121212,0.424242,0.020202,0.200000,0.535354,0.282828,0.000000,0.020202,0.000000,0.000000,0.000000,0.090909,0.353535,0.868687,0.000000,0.060606,0.323232,0.564706,0.533333,0.180328,0.100,0.272727,0.292929,0.000000,0.070707,0.090909,0.474747,0.000000,0.090909,0.323232,0.686869,0.989899,0.820513,0.497674,0.694665,0.174000,0.243333,0.193333,0.252667,0.089752,0.262626,0.212121,0.242424,0.131313,0.141414,0.020202,0.00,0.000000,0.000000,0.121212,0.191919,0.171717,0.222222,0.272727,0.030303,0.00,0.000000,0.000000,0.474747,0.000000,0.515152,0.070707,0.565657,0.444444,0.151515,0.777778,0.181818,0.030303,0.030303,0.000000,0.0,0.00,0.000000,0.020202,0.020202,0.070707,0.188889,0.236842,0.000000,0.646465,0.575758,0.696970,0.505051,0.676768,0.505051,0.787879,0.737374,0.353535,0.0,0.111111,0.232323,0.111111,0.030303,0.161616,0.252525,0.000000,0.036364,0.060606,0.010101,0.050505,0.030303,0.040404,0.030303,0.010101,0.015385,0.020202,0.080808,0.040404,0.06,0.070707,0.161616,0.090909,0.040404,0.040404,0.000000,0.141414,0.121212,0.090909,0.080808,0.101010,0.131313,0.010101,0.060606,0.595960,0.090909,0.010101,0.823529,0.030303,0.090909,0.242424,0.252525,0.108108,0.212121,0.141414,0.041237,0.212121,0.033333,0.208333,0.080808,0.0,0.000000,0.0,0.171717,0.474747,0.033333,0.191919,0.161616,0.545455,0.080808,0.000000,0.070707,0.161290,0.084337,0.000000,0.0,0.101010,0.054545,0.000000,0.010101,0.023256,0.000000,0.02,0.000000,0.0,0.010101,0.595960,0.989899,0.010101,0.000000,0.010101,0.888889,0.383838,0.131313,0.129032,0.461538,0.000000,0.000000,0.000000,0.050505,0.454545,0.555556,0.000000,0.032258,0.383838,0.000000,0.626263,0.000000,0.0,0.000000,1.000000,0.000000,1.000000,1.000000,1.000000,0.285714,0.4,0.060606,0.060606,0.403509,0.303665,0.315789,0.155844,0.011719,0.042373,0.073171,0.005,0.003003,0.015,0.007353,0.010000,0.574221,0.0,0.333333,0.196721
76325,0.793814,0.000000,0.000000,0.171717,0.333333,0.313131,0.050505,0.030303,0.011494,0.020952,0.024783,0.020563,1.000000,0.0,0.000000,0.505051,0.505051,0.959596,0.010101,0.000000,0.030303,0.060606,0.0,0.000000,0.010101,0.014925,0.021739,0.000000,0.0,0.030928,0.000000,0.000000,0.034884,0.428571,0.500000,0.511905,0.428571,0.523810,0.559524,0.320000,0.434343,0.353535,0.222222,0.090909,0.181818,0.292929,0.40,0.131313,0.070707,0.030303,0.181818,0.171717,0.303030,0.191919,0.161616,0.161616,0.040404,0.131313,0.121212,0.373737,0.515152,0.303030,0.101010,0.030303,0.676768,0.060606,0.054795,0.232323,0.312308,0.404286,1.000000,0.979798,0.000000,0.010101,0.000000,0.000000,0.000000,0.000000,0.000000,0.896667,0.848167,0.923077,0.769231,0.939394,0.070707,0.969697,0.040404,0.101010,0.383838,0.818182,0.747475,0.353535,0.888889,0.121212,0.030303,0.04,0.010101,0.131313,0.101010,0.070707,0.253333,0.636364,0.131313,0.000000,0.010101,0.000000,0.979798,0.989899,1.000000,1.000000,1.000000,0.919192,0.010101,0.777778,0.800000,0.755556,0.245902,0.100,0.010101,0.000000,0.000000,0.070707,0.000000,0.000000,0.000000,0.838384,0.858586,0.929293,0.929293,0.786325,0.100775,0.916005,0.466667,0.496000,0.513333,0.539333,0.187017,0.060606,0.050505,0.060606,0.181818,0.181818,0.131313,0.30,0.147541,0.090909,0.030303,0.050505,0.040404,0.202020,0.191919,0.151515,0.30,0.090909,0.101010,0.181818,0.060606,0.606061,0.020202,0.555556,0.454545,0.060606,0.808081,0.121212,0.030303,0.000000,0.054545,0.0,0.00,0.000000,0.050505,0.000000,0.434343,0.255556,0.421053,0.121212,0.757576,0.757576,0.848485,0.666667,0.828283,0.656566,0.676768,0.555556,0.515152,0.0,0.050505,0.262626,0.252525,0.030303,0.131313,0.121212,0.000000,0.036364,0.030303,0.010101,0.101010,0.010101,0.020202,0.010101,0.010101,0.015385,0.121212,0.181818,0.050505,0.04,0.090909,0.121212,0.060606,0.080808,0.010101,0.029851,0.070707,0.060606,0.080808,0.030303,0.050505,0.030303,0.010101,0.141414,0.737374,0.050505,0.000000,0.882353,0.020202,0.030303,0.171717,0.232323,0.270270,0.262626,0.181818,0.061856,0.181818,0.166667,0.180556,0.060606,0.0,0.000000,0.0,0.090909,0.171717,0.066667,0.333333,0.171717,0.313131,0.111111,0.000000,0.060606,0.064516,0.120482,0.021277,0.0,0.050505,0.109091,0.014706,0.020202,0.000000,0.038462,0.00,0.074074,0.0,0.070707,0.636364,0.919192,0.040404,0.020202,0.040404,1.000000,0.848485,0.393939,0.161290,0.500000,0.040404,0.262626,0.262626,0.434343,0.616162,0.393939,0.000000,0.000000,0.939394,0.000000,0.060606,0.000000,0.0,0.010638,1.000000,0.000000,0.949495,1.000000,1.000000,0.952381,0.6,0.030303,0.070707,0.421053,0.303665,0.315789,0.168831,0.009818,0.038136,0.097561,0.005,0.002002,0.012,0.007353,0.009326,0.847777,1.0,0.666667,0.016393
76326,0.556701,0.000000,0.000000,0.363636,0.333333,0.262626,0.070707,0.202020,0.011494,0.010405,0.011908,0.010705,0.282828,0.0,0.727273,0.464646,0.545455,0.888889,0.111111,0.000000,0.000000,0.020202,0.0,0.000000,0.000000,0.000000,0.000000,0.000000,0.0,0.000000,0.017544,0.000000,0.011628,0.452381,0.511905,0.535714,0.476190,0.559524,0.595238,0.253333,0.434343,0.373737,0.212121,0.090909,0.212121,0.232323,0.30,0.121212,0.090909,0.121212,0.171717,0.171717,0.333333,0.202020,0.131313,0.161616,0.040404,0.131313,0.202020,0.414141,0.393939,0.191919,0.040404,0.010101,0.575758,0.111111,0.219178,0.171717,0.266154,0.347143,0.595960,0.595960,0.014706,0.010101,0.000000,0.000000,0.101010,0.101010,0.000000,0.233000,0.252000,0.230769,0.230769,0.878788,0.131313,0.959596,0.050505,0.000000,0.323232,0.757576,0.636364,0.272727,0.777778,0.232323,0.050505,0.02,0.040404,0.141414,0.181818,0.050505,0.200000,0.555556,0.191919,0.018182,0.060606,0.030303,0.232323,0.424242,0.696970,0.888889,1.000000,0.030303,0.010101,0.505051,0.658824,0.633333,0.229508,0.100,0.010101,0.000000,0.404040,0.070707,0.010101,0.000000,0.060606,0.141414,0.282828,0.444444,0.797980,0.709402,0.544186,0.635641,0.284000,0.304667,0.314000,0.342667,0.122688,0.111111,0.161616,0.080808,0.262626,0.303030,0.020202,0.04,0.098361,0.000000,0.020202,0.131313,0.090909,0.323232,0.333333,0.030303,0.00,0.070707,0.000000,0.131313,0.020202,0.525253,0.050505,0.515152,0.494949,0.141414,0.737374,0.232323,0.000000,0.000000,0.000000,0.0,0.04,0.010101,0.010101,0.010101,0.646465,0.222222,0.302632,0.020202,0.818182,0.777778,0.888889,0.696970,0.888889,0.666667,0.898990,0.626263,1.000000,0.0,0.000000,0.282828,0.121212,0.070707,0.070707,0.141414,0.000000,0.054545,0.060606,0.000000,0.181818,0.020202,0.010101,0.010101,0.010101,0.000000,0.131313,0.101010,0.040404,0.00,0.000000,0.141414,0.050505,0.050505,0.000000,0.044776,0.121212,0.232323,0.050505,0.050505,0.070707,0.202020,0.010101,0.060606,0.535354,0.121212,0.000000,0.823529,0.050505,0.121212,0.232323,0.141414,0.243243,0.151515,0.202020,0.072165,0.181818,0.033333,0.222222,0.080808,0.0,0.000000,0.0,0.171717,0.363636,0.000000,0.333333,0.131313,0.262626,0.060606,0.000000,0.080808,0.000000,0.060241,0.000000,0.0,0.010101,0.000000,0.000000,0.020202,0.000000,0.019231,0.02,0.000000,0.0,0.020202,0.616162,0.929293,0.000000,0.000000,0.080808,0.959596,0.767677,0.252525,0.161290,0.307692,0.000000,0.272727,0.383838,0.747475,0.979798,0.030303,0.000000,0.000000,0.000000,0.292929,0.565657,0.101010,0.0,0.053191,0.050505,0.949495,0.040404,1.000000,0.969697,0.476190,0.4,0.060606,0.050505,0.122807,0.078534,0.315789,0.155844,0.004434,0.004237,0.048780,0.025,0.005005,0.030,0.004596,0.026248,0.092425,1.0,0.333333,0.262295
76327,0.752577,0.000000,0.010101,0.373737,0.252525,0.282828,0.131313,0.040404,0.034483,0.016352,0.020870,0.015281,1.000000,0.0,0.000000,0.494949,0.515152,0.838384,0.020202,0.000000,0.131313,0.070707,0.0,0.074627,0.050505,0.029851,0.021739,0.021277,0.0,0.041237,0.000000,0.012346,0.011628,0.476190,0.547619,0.571429,0.440476,0.547619,0.595238,0.320000,0.303030,0.404040,0.313131,0.121212,0.111111,0.212121,0.54,0.161616,0.101010,0.030303,0.111111,0.131313,0.303030,0.242424,0.222222,0.212121,0.020202,0.161616,0.060606,0.404040,0.545455,0.343434,0.121212,0.030303,0.676768,0.060606,0.054795,0.232323,0.336923,0.425714,1.000000,0.898990,0.000000,0.010101,0.000000,0.000000,0.000000,0.000000,0.000000,0.592667,0.598000,0.923077,0.846154,0.949495,0.060606,1.000000,0.010101,0.141414,0.404040,0.929293,0.808081,0.353535,0.959596,0.050505,0.050505,0.02,0.040404,0.050505,0.101010,0.020202,0.186667,0.555556,0.141414,0.000000,0.010101,0.000000,0.989899,1.000000,1.000000,1.000000,1.000000,0.808081,0.000000,0.888889,0.823529,0.788889,0.245902,0.100,0.000000,0.000000,0.000000,0.060606,0.000000,0.000000,0.000000,0.949495,0.949495,0.979798,0.979798,0.635150,0.020155,0.911464,0.490000,0.467333,0.558667,0.536667,0.215824,0.040404,0.040404,0.060606,0.050505,0.323232,0.161616,0.20,0.065574,0.181818,0.050505,0.020202,0.070707,0.080808,0.343434,0.161616,0.14,0.050505,0.161616,0.171717,0.040404,0.666667,0.030303,0.343434,0.666667,0.020202,0.878788,0.080808,0.010101,0.010101,0.000000,0.0,0.00,0.000000,0.040404,0.010101,0.585859,0.277778,0.407895,0.101010,0.737374,0.767677,0.828283,0.717172,0.818182,0.707071,0.737374,0.727273,0.505051,0.0,0.020202,0.262626,0.232323,0.030303,0.131313,0.202020,0.000000,0.018182,0.050505,0.010101,0.050505,0.000000,0.010101,0.020202,0.000000,0.000000,0.040404,0.161616,0.060606,0.08,0.050505,0.080808,0.090909,0.050505,0.030303,0.029851,0.080808,0.111111,0.101010,0.070707,0.131313,0.040404,0.030303,0.111111,0.646465,0.040404,0.000000,0.882353,0.010101,0.040404,0.181818,0.222222,0.243243,0.222222,0.232323,0.041237,0.272727,0.066667,0.208333,0.141414,0.0,0.010101,0.0,0.202020,0.373737,0.066667,0.252525,0.272727,0.282828,0.050505,0.000000,0.080808,0.000000,0.072289,0.021277,0.0,0.020202,0.090909,0.014706,0.010101,0.000000,0.019231,0.02,0.037037,0.0,0.090909,0.555556,0.898990,0.020202,0.070707,0.020202,1.000000,0.929293,0.424242,0.387097,0.423077,0.000000,0.000000,0.000000,0.444444,1.000000,0.000000,0.134831,0.000000,0.929293,0.000000,0.080808,0.000000,0.0,0.000000,1.000000,0.000000,1.000000,1.000000,1.000000,0.714286,0.6,0.080808,0.070707,0.719298,0.534031,0.578947,0.311688,0.029666,0.165254,0.439024,0.003,0.001401,0.008,0.002757,0.006072,0.010528,0.0,0.666667,0.016393


In [23]:
X_train_cat_ord = X_train_cat[ord_cols]
X_test_cat_ord = X_test_cat[ord_cols]

In [24]:
ordinal = OrdinalEncoder().fit(X_train_cat_ord)

X_train_cat_ord = pd.DataFrame(ordinal.transform(X_train_cat_ord), columns=X_train_cat_ord.columns)
X_test_cat_ord = pd.DataFrame(ordinal.transform(X_test_cat_ord), columns=X_test_cat_ord.columns)

In [25]:
X_train_cat_ord

Unnamed: 0,DOMAIN_B,INCOME,WEALTH1,WEALTH2
0,1.0,4.0,2.0,9.0
1,1.0,1.0,3.0,3.0
2,1.0,4.0,9.0,9.0
3,2.0,1.0,9.0,9.0
4,1.0,2.0,6.0,5.0
...,...,...,...,...
76324,1.0,2.0,9.0,7.0
76325,0.0,0.0,9.0,9.0
76326,0.0,6.0,8.0,9.0
76327,0.0,5.0,9.0,9.0


In [26]:
X_test_cat_ord

Unnamed: 0,DOMAIN_B,INCOME,WEALTH1,WEALTH2
0,1.0,5.0,3.0,9.0
1,1.0,3.0,8.0,9.0
2,0.0,5.0,9.0,9.0
3,1.0,4.0,9.0,9.0
4,1.0,4.0,5.0,4.0
...,...,...,...,...
19078,2.0,6.0,9.0,9.0
19079,1.0,1.0,2.0,3.0
19080,1.0,4.0,7.0,7.0
19081,1.0,4.0,9.0,4.0


In [27]:
X_train_cat_onehot = X_train_cat.drop(columns=ord_cols)
X_test_cat_onehot = X_test_cat.drop(columns=ord_cols)

In [28]:
X_train_cat_onehot.columns == X_test_cat_onehot.columns

array([ True,  True,  True,  True,  True,  True,  True,  True,  True,
        True,  True,  True,  True,  True,  True,  True,  True,  True,
        True,  True,  True,  True])

In [29]:
X_train_cat_onehot.dtypes

STATE           object
CLUSTER          int64
HOMEOWNR        object
GENDER          object
DATASRCE         int64
RFA_2R          object
RFA_2A          object
GEOCODE2        object
DOMAIN_A        object
ODATEW_YR        int64
ODATEW_MM        int64
DOB_YR           int64
DOB_MM           int64
MINRDATE_YR      int64
MINRDATE_MM      int64
MAXRDATE_YR      int64
MAXRDATE_MM      int64
LASTDATE_YR      int64
LASTDATE_MM      int64
FIRSTDATE_YR     int64
FIRSTDATE_MM     int64
TCODE            int64
dtype: object

In [30]:
onehot = OneHotEncoder(drop='first',handle_unknown='ignore').fit(X_train_cat_onehot)

X_train_cat_onehot_encoded = onehot.transform(X_train_cat_onehot)
X_test_cat_onehot_encoded = onehot.transform(X_test_cat_onehot)



In [31]:
X_train_cat_onehot = pd.DataFrame(X_train_cat_onehot_encoded.toarray(), columns=onehot.get_feature_names_out(X_train_cat_onehot.columns))
X_test_cat_onehot = pd.DataFrame(X_test_cat_onehot_encoded.toarray(), columns=onehot.get_feature_names_out(X_test_cat_onehot.columns))

In [32]:
(X_test_cat_ord.isnull().any()).unique()

array([False])

In [33]:
X_train = pd.concat([X_train_num,X_train_cat_ord,X_train_cat_onehot],axis=1)
X_test = pd.concat([X_test_num,X_test_cat_ord,X_test_cat_onehot],axis=1)

In [34]:
(X_test.isnull().any()).value_counts()

False    677
Name: count, dtype: int64

In [35]:
from sklearn.linear_model import LogisticRegression

classification = LogisticRegression(random_state=7, solver='saga',
                  multi_class='multinomial', max_iter=100).fit(X_train, y_train)

classification.score(X_test, y_test)



0.9474401299586019

In [42]:
from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score

In [43]:
pred = classification.predict(X_test)

In [44]:
from sklearn.metrics import confusion_matrix
confusion_matrix(y_test,pred)

array([[18080,     0],
       [ 1003,     0]])

In [45]:
from imblearn.over_sampling import SMOTE

sm = SMOTE(random_state=127)
X_train_SMOTE,y_train_SMOTE = sm.fit_resample(X_train,y_train)
X_train_SMOTE.shape

(144978, 677)

In [46]:
LR = LogisticRegression(random_state=7, solver='saga',
                  multi_class='multinomial')
LR.fit(X_train_SMOTE, y_train_SMOTE)
pred2 = LR.predict(X_test)

print("accuracy: ", LR.score(X_test, y_test))
print("precision: ",precision_score(y_test,pred))
print("recall: ",recall_score(y_test,pred))
print("f1: ",f1_score(y_test,pred))

accuracy:  0.6214431693130011
precision:  0.0
recall:  0.0
f1:  0.0


  _warn_prf(average, modifier, f"{metric.capitalize()} is", len(result))


In [47]:
confusion_matrix(y_test,pred2)

array([[11362,  6718],
       [  506,   497]])

# Lab | Random Forests

For this lab, you will be using the .CSV files provided in the files_for_lab folder. These are cleaned versions of the learningSet data from the Case Study 'Healthcare for All'.
You may continue in the Jupyter Notebook you created yesterday. There is no need to fork and clone this Repo.

### Instructions

- Apply the Random Forests algorithm AFTER upscaling the data to deal with the imbalance.
- Use Feature Selections that you have learned in class to decide if you want to use all of the features (Variance Threshold, RFE, PCA, etc.)
- Re-run the Random Forest algorithm to determine if the Feature Selection has improved the results.
- Discuss the output and its impact in the business scenario. Is the cost of a false positive equals to the cost of the false negative? How would you change your algorithm or data in order to maximize the return of the business?

In [48]:
from sklearn.ensemble import RandomForestClassifier

clf = RandomForestClassifier(max_depth=10, # max number of questions to ask
                             min_samples_split=20, # amount of rows still considered at every question
                             min_samples_leaf =20, # ultimate answer based on at least this many rows
                             max_samples=0.8, # fraction of X-train to use in each tree
                             random_state=7)

clf.fit(X_train_SMOTE, y_train_SMOTE)

In [49]:
clf.score(X_train_SMOTE, y_train_SMOTE)

0.9578004938680351

In [50]:
clf.score(X_test, y_test)

0.918618665828224

In [54]:
pred3 = clf.predict(X_test)

print("precision: ",precision_score(y_test,pred))
print("recall: ",recall_score(y_test,pred))
print("f1: ",f1_score(y_test,pred))

precision:  0.07822085889570553
recall:  0.05084745762711865
f1:  0.06163141993957704


In [55]:
confusion_matrix(y_test,pred3)

array([[17479,   601],
       [  952,    51]])

In [57]:
from sklearn.decomposition import PCA
pca = PCA(0.9).fit(X_train_SMOTE)
pca.explained_variance_ratio_

array([0.30411188, 0.16448487, 0.06768406, 0.03656555, 0.0221289 ,
       0.014751  , 0.01387058, 0.01240639, 0.01163122, 0.01029087,
       0.00975744, 0.00896683, 0.00797068, 0.0077314 , 0.00722463,
       0.00715134, 0.00658084, 0.00626459, 0.00611314, 0.00591341,
       0.00573722, 0.00550073, 0.00543544, 0.00532033, 0.00517472,
       0.00506153, 0.00488472, 0.00482276, 0.00475773, 0.00452512,
       0.00431353, 0.00426392, 0.00417205, 0.00399719, 0.00376594,
       0.0037058 , 0.00340498, 0.00323762, 0.00312458, 0.00307507,
       0.00302533, 0.00297358, 0.00292871, 0.00290961, 0.00285075,
       0.00273845, 0.00261572, 0.00252788, 0.00249277, 0.00247826,
       0.00236537, 0.00233202, 0.00227153, 0.00221311, 0.00215314,
       0.00205372, 0.00199869, 0.00193868, 0.00185954, 0.00178197,
       0.00177118, 0.00168626, 0.00165866, 0.00163601, 0.00162889,
       0.00160853, 0.00157468, 0.0015428 , 0.00150413, 0.0014556 ,
       0.00142833, 0.00141571, 0.00138477, 0.00137908, 0.00134

In [60]:
X_train_pca = pca.transform(X_train_SMOTE)
X_test_pca = pca.transform(X_test)

In [61]:
clf.fit(X_train_pca, y_train_SMOTE)

In [62]:
clf.score(X_train_pca, y_train_SMOTE)

0.8139096966436287

In [63]:
clf.score(X_test_pca, y_test)

0.6888853953780852

In [67]:
pred4 = clf.predict(X_test_pca)

print("precision: ",precision_score(y_test,pred4))
print("recall: ",recall_score(y_test,pred4))
print("f1: ",f1_score(y_test,pred4))

precision:  0.06960921144452198
recall:  0.39780658025922233
f1:  0.11848552338530066


In [68]:
confusion_matrix(y_test,pred4)

array([[12747,  5333],
       [  604,   399]])