In this notebook, we’ll use `featuretools` to automatically generate new features across all datasets, making feature engineering both faster and more powerful. `Featuretools` is an open-source Python library designed specifically to create valuable features from raw data for machine learning, streamlining the entire process so we can skip manual feature creation. By using mathematical transformations (like summing or averaging) and connecting related datasets, `featuretools` helps us build rich, informative features effortlessly—even in complex, multi-table datasets like ours.

One of the standout features of `featuretools` is its Deep Feature Synthesis (DFS) technique, which stacks multiple transformations to create highly informative "deep" features. For instance, it can automatically create a feature like "average transaction amount per month" by combining simpler data points. This makes our dataset not only more insightful but also better prepared for accurate model training. By letting `featuretools` handle the heavy lifting, we can focus on making our model more effective and our workflow more efficient!

In [2]:
import pandas as pd
import numpy as np

# Suppress warnings from pandas
import warnings
warnings.filterwarnings('ignore')

# import the functions from the file
from notebook_setup import RootPath, config
from my_functions import pie_plot

# Automatically Reload Changes if there are frequent changes to my xx.py file
%load_ext autoreload
%autoreload 2

In [3]:
import pandas as pd
import numpy as np

# featuretools for automated feature engineering
import featuretools as ft

# Suppress warnings from pandas
import warnings
warnings.filterwarnings('ignore')

# Automatically Reload Changes if there are frequent changes to my xx.py file
%load_ext autoreload
%autoreload 2

The autoreload extension is already loaded. To reload it, use:
  %reload_ext autoreload


In [4]:
from featuretools.selection import (
    remove_highly_correlated_features,
    remove_highly_null_features,
    remove_single_value_features,
)

# import the functions from the file
from my_functions import (
    convert_to_categorical, 
    encode_and_one_hot, 
    summarize_dataframe,
    get_top_correlations, 
    missing_values_table, 
    preprocess_and_reduce_features,
    merge_data_on_index
)

In [5]:
# Load all sampling data and drop "Unnamed" which is the index column
app_df = pd.read_csv(f'{config.CleanDataPath}app_df.csv')
app_df = app_df.loc[:, ~app_df.columns.str.contains('^Unnamed')]

bureau_df = pd.read_csv(f'{config.CleanDataPath}bureau_df.csv')
bureau_df = bureau_df.loc[:, ~bureau_df.columns.str.contains('^Unnamed')]

bureau_balance_df = pd.read_csv(f'{config.CleanDataPath}bureau_balance_df.csv')
bureau_balance_df = bureau_balance_df.loc[:, ~bureau_balance_df.columns.str.contains('^Unnamed')]

previous_df = pd.read_csv(f'{config.CleanDataPath}previous_df.csv')
previous_df = previous_df.loc[:, ~previous_df.columns.str.contains('^Unnamed')]

cash_df = pd.read_csv(f'{config.CleanDataPath}cash_df.csv')
cash_df = cash_df.loc[:, ~cash_df.columns.str.contains('^Unnamed')]

instalments_df = pd.read_csv(f'{config.CleanDataPath}instalments_df.csv')
instalments_df = instalments_df.loc[:, ~instalments_df.columns.str.contains('^Unnamed')]

credit_df = pd.read_csv(f'{config.CleanDataPath}credit_df.csv')
credit_df = credit_df.loc[:, ~credit_df.columns.str.contains('^Unnamed')]
credit_df.head(5)

Unnamed: 0,SK_ID_PREV,SK_ID_CURR,MONTHS_BALANCE,AMT_BALANCE,AMT_CREDIT_LIMIT_ACTUAL,AMT_DRAWINGS_ATM_CURRENT,AMT_DRAWINGS_CURRENT,AMT_DRAWINGS_OTHER_CURRENT,AMT_DRAWINGS_POS_CURRENT,AMT_INST_MIN_REGULARITY,...,AMT_RECIVABLE,AMT_TOTAL_RECEIVABLE,CNT_DRAWINGS_ATM_CURRENT,CNT_DRAWINGS_CURRENT,CNT_DRAWINGS_OTHER_CURRENT,CNT_DRAWINGS_POS_CURRENT,CNT_INSTALMENT_MATURE_CUM,NAME_CONTRACT_STATUS,SK_DPD,SK_DPD_DEF
0,2740914,340339,-1,1.310354,0.5,10800.0,10800.0,0.0,0.0,0.88896,...,1.276882,1.276635,2.0,2.0,0.0,0.0,6.0,Active,0.0,0.0
1,1864742,303581,-1,0.0,-0.5,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,59.0,Active,0.0,0.0
2,1189095,231608,-4,4.505954,1.75,9000.0,9000.0,0.0,0.0,3.395033,...,4.5071,4.50623,1.0,1.0,0.0,0.0,22.0,Active,0.0,0.0
3,1683604,217113,-1,0.683794,-0.375,0.0,4412.25,0.0,4412.25,0.491147,...,0.686007,0.685874,0.0,4.0,0.0,4.0,6.0,Active,0.0,0.0
4,2252971,193247,-4,0.233098,2.25,0.0,0.0,0.0,0.0,0.333333,...,0.233095,0.23305,0.0,0.0,0.0,0.0,6.0,Active,0.0,0.0


### Automated Feature Engineering with Featuretools
To keep things straightforward, let’s go with the default options for automated feature engineering. These settings are designed to quickly generate useful features based on data types and relationships, helping to boost model performance without the need for complex, hands-on engineering. We’ll set the feature depth to a maximum of 2 levels, which is usually a great balance between capturing valuable information and keeping things efficient.

Once the process is complete, we’ll fine-tune the feature set by removing any that meet the following criteria:

1. Features with a high percentage of missing values (threshold: 90%)
2. Single-value features (as long as they don’t contain NaN values)
3. Highly correlated features (threshold: 90%)

This streamlined approach will let us retain the most impactful features, enhancing our model’s effectiveness and simplicity.

In [7]:
# Create the EntitySet with sampled data
es = ft.EntitySet(id="customer_data")
es = es.add_dataframe(dataframe_name='app', dataframe=app_df, index='SK_ID_CURR')
es = es.add_dataframe(dataframe_name='bureau', dataframe=bureau_df, index='SK_ID_BUREAU')
es = es.add_dataframe(dataframe_name='previous', dataframe=previous_df, index='SK_ID_PREV')
es = es.add_dataframe(dataframe_name='bureau_balance', dataframe=bureau_balance_df, make_index=True, index='bureaubalance_index')
es = es.add_dataframe(dataframe_name='cash', dataframe=cash_df, make_index=True, index='cash_index')
es = es.add_dataframe(dataframe_name='instalments', dataframe=instalments_df, make_index=True, index='instalments_index')
es = es.add_dataframe(dataframe_name='credit', dataframe=credit_df, make_index=True, index='credit_index')

# Define relationships in the EntitySet
es = es.add_relationship('app', 'SK_ID_CURR', 'bureau', 'SK_ID_CURR')
es = es.add_relationship('app', 'SK_ID_CURR', 'previous', 'SK_ID_CURR')
es = es.add_relationship('bureau', 'SK_ID_BUREAU', 'bureau_balance', 'SK_ID_BUREAU')
es = es.add_relationship('previous', 'SK_ID_PREV', 'cash', 'SK_ID_PREV')
es = es.add_relationship('previous', 'SK_ID_PREV', 'instalments', 'SK_ID_PREV')
es = es.add_relationship('previous', 'SK_ID_PREV', 'credit', 'SK_ID_PREV')


In [8]:
# Deep Feature Synthesis (DFS) with default primitives
fm, features = ft.dfs(entityset = es, target_dataframe_name= 'app',
                      max_depth = 2, features_only=False, verbose = True)

print(fm.shape)
fm.head(10)

Built 2088 features
Elapsed: 36:48 | Progress: 100%|██████████
(30751, 2088)


Unnamed: 0_level_0,TARGET,COUNT(bureau),MAX(bureau.AMT_ANNUITY),MAX(bureau.AMT_CREDIT_MAX_OVERDUE),MAX(bureau.AMT_CREDIT_SUM),MAX(bureau.AMT_CREDIT_SUM_DEBT),MAX(bureau.AMT_CREDIT_SUM_LIMIT),MAX(bureau.AMT_CREDIT_SUM_OVERDUE),MAX(bureau.CNT_CREDIT_PROLONG),MAX(bureau.CREDIT_DAY_OVERDUE),...,SUM(credit.previous.DAYS_LAST_DUE),SUM(credit.previous.DAYS_LAST_DUE_1ST_VERSION),SUM(credit.previous.DAYS_TERMINATION),SUM(credit.previous.HOUR_APPR_PROCESS_START),SUM(credit.previous.NFLAG_INSURED_ON_APPROVAL),SUM(credit.previous.NFLAG_LAST_APPL_IN_DAY),SUM(credit.previous.RATE_DOWN_PAYMENT),SUM(credit.previous.RATE_INTEREST_PRIMARY),SUM(credit.previous.RATE_INTEREST_PRIVILEGED),SUM(credit.previous.SELLERPLACE_AREA)
SK_ID_CURR,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
307474,0,5,,0.0,3.116422,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
412537,0,4,0.033333,0.0,0.317765,0.639861,176852.565,0.0,2.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
149084,0,1,,,0.588603,0.0,0.0,0.0,0.0,0.0,...,11495.037873,10354.627451,11634.342577,702.0,0.0,0.0,0.0,0.0,0.0,-1.835294
364692,0,0,,,,,,,,,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
155423,0,5,0.0,,1.072048,2.653891,0.0,0.0,0.0,0.0,...,15.592264,22833.281046,25655.216966,1118.0,0.0,0.0,0.0,0.0,0.0,2401.929412
165164,0,12,18.730333,0.0,2.755305,12.38995,3068.775,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
427814,0,6,,0.0,2.321964,17.055449,0.0,0.0,1.0,0.0,...,2.014504,2655.03268,2983.164763,110.0,0.0,0.0,0.0,0.0,0.0,93.764706
248940,0,5,,25803.945,9.43597,33.960908,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
218738,0,12,,11493.765,1.762233,5.257485,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
285243,0,0,,,,,,,,,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0


In [9]:
features[-20:]

[<Feature: STD(credit.previous.SELLERPLACE_AREA)>,
 <Feature: SUM(credit.previous.AMT_ANNUITY)>,
 <Feature: SUM(credit.previous.AMT_APPLICATION)>,
 <Feature: SUM(credit.previous.AMT_CREDIT)>,
 <Feature: SUM(credit.previous.AMT_DOWN_PAYMENT)>,
 <Feature: SUM(credit.previous.AMT_GOODS_PRICE)>,
 <Feature: SUM(credit.previous.CNT_PAYMENT)>,
 <Feature: SUM(credit.previous.DAYS_DECISION)>,
 <Feature: SUM(credit.previous.DAYS_FIRST_DRAWING)>,
 <Feature: SUM(credit.previous.DAYS_FIRST_DUE)>,
 <Feature: SUM(credit.previous.DAYS_LAST_DUE)>,
 <Feature: SUM(credit.previous.DAYS_LAST_DUE_1ST_VERSION)>,
 <Feature: SUM(credit.previous.DAYS_TERMINATION)>,
 <Feature: SUM(credit.previous.HOUR_APPR_PROCESS_START)>,
 <Feature: SUM(credit.previous.NFLAG_INSURED_ON_APPROVAL)>,
 <Feature: SUM(credit.previous.NFLAG_LAST_APPL_IN_DAY)>,
 <Feature: SUM(credit.previous.RATE_DOWN_PAYMENT)>,
 <Feature: SUM(credit.previous.RATE_INTEREST_PRIMARY)>,
 <Feature: SUM(credit.previous.RATE_INTEREST_PRIVILEGED)>,
 <Feature:

#### Remove Highly Null Features

In [11]:
fm2 =  remove_highly_null_features(fm, pct_null_threshold=0.90)
print(fm2.shape)
#fm2.head(5)

(30751, 1867)


#### Remove Single Value Features

In [13]:
fm3, features3 = remove_single_value_features(
    fm2, features=features, count_nan_as_value=True
)
print(fm3.shape)
#fm3.head()

(30751, 1851)


#### Remove Highly Correlated Features

In [15]:
fm4, features4 = remove_highly_correlated_features(
    fm3, features=features3, pct_corr_threshold=0.9)


In [16]:
pd.options.display.max_columns = 2000
print(fm4.shape)
fm4.head()

(30751, 774)


Unnamed: 0_level_0,TARGET,COUNT(bureau),MAX(bureau.AMT_ANNUITY),MAX(bureau.AMT_CREDIT_MAX_OVERDUE),MAX(bureau.AMT_CREDIT_SUM),MAX(bureau.AMT_CREDIT_SUM_DEBT),MAX(bureau.AMT_CREDIT_SUM_LIMIT),MAX(bureau.AMT_CREDIT_SUM_OVERDUE),MAX(bureau.CNT_CREDIT_PROLONG),MAX(bureau.CREDIT_DAY_OVERDUE),MAX(bureau.DAYS_CREDIT),MAX(bureau.DAYS_CREDIT_ENDDATE),MAX(bureau.DAYS_CREDIT_UPDATE),MAX(bureau.DAYS_ENDDATE_FACT),MEAN(bureau.AMT_CREDIT_MAX_OVERDUE),MEAN(bureau.AMT_CREDIT_SUM),MEAN(bureau.AMT_CREDIT_SUM_DEBT),MEAN(bureau.AMT_CREDIT_SUM_LIMIT),MEAN(bureau.CNT_CREDIT_PROLONG),MEAN(bureau.CREDIT_DAY_OVERDUE),MEAN(bureau.DAYS_CREDIT),MEAN(bureau.DAYS_CREDIT_ENDDATE),MEAN(bureau.DAYS_CREDIT_UPDATE),MEAN(bureau.DAYS_ENDDATE_FACT),MIN(bureau.AMT_ANNUITY),MIN(bureau.AMT_CREDIT_MAX_OVERDUE),MIN(bureau.AMT_CREDIT_SUM),MIN(bureau.AMT_CREDIT_SUM_DEBT),MIN(bureau.AMT_CREDIT_SUM_LIMIT),MIN(bureau.AMT_CREDIT_SUM_OVERDUE),MIN(bureau.CNT_CREDIT_PROLONG),MIN(bureau.CREDIT_DAY_OVERDUE),MIN(bureau.DAYS_CREDIT),MIN(bureau.DAYS_CREDIT_ENDDATE),MIN(bureau.DAYS_CREDIT_UPDATE),MIN(bureau.DAYS_ENDDATE_FACT),MODE(bureau.CREDIT_ACTIVE),MODE(bureau.CREDIT_CURRENCY),MODE(bureau.CREDIT_TYPE),NUM_UNIQUE(bureau.CREDIT_ACTIVE),NUM_UNIQUE(bureau.CREDIT_CURRENCY),NUM_UNIQUE(bureau.CREDIT_TYPE),SKEW(bureau.AMT_ANNUITY),SKEW(bureau.AMT_CREDIT_MAX_OVERDUE),SKEW(bureau.AMT_CREDIT_SUM),SKEW(bureau.AMT_CREDIT_SUM_DEBT),SKEW(bureau.AMT_CREDIT_SUM_LIMIT),SKEW(bureau.AMT_CREDIT_SUM_OVERDUE),SKEW(bureau.CNT_CREDIT_PROLONG),SKEW(bureau.DAYS_CREDIT),SKEW(bureau.DAYS_CREDIT_ENDDATE),SKEW(bureau.DAYS_CREDIT_UPDATE),SKEW(bureau.DAYS_ENDDATE_FACT),STD(bureau.DAYS_CREDIT),STD(bureau.DAYS_ENDDATE_FACT),SUM(bureau.AMT_CREDIT_SUM),SUM(bureau.DAYS_CREDIT_ENDDATE),SUM(bureau.DAYS_CREDIT_UPDATE),COUNT(bureau_balance),MAX(bureau_balance.MONTHS_BALANCE),MEAN(bureau_balance.MONTHS_BALANCE),MODE(bureau_balance.STATUS),NUM_UNIQUE(bureau_balance.STATUS),SKEW(bureau_balance.MONTHS_BALANCE),COUNT(previous),MAX(previous.AMT_ANNUITY),MAX(previous.AMT_APPLICATION),MAX(previous.AMT_DOWN_PAYMENT),MAX(previous.CNT_PAYMENT),MAX(previous.DAYS_DECISION),MAX(previous.DAYS_FIRST_DRAWING),MAX(previous.DAYS_FIRST_DUE),MAX(previous.DAYS_LAST_DUE),MAX(previous.DAYS_LAST_DUE_1ST_VERSION),MAX(previous.HOUR_APPR_PROCESS_START),MAX(previous.NFLAG_INSURED_ON_APPROVAL),MAX(previous.NFLAG_LAST_APPL_IN_DAY),MAX(previous.RATE_DOWN_PAYMENT),MAX(previous.SELLERPLACE_AREA),MEAN(previous.AMT_ANNUITY),MEAN(previous.AMT_APPLICATION),MEAN(previous.AMT_DOWN_PAYMENT),MEAN(previous.CNT_PAYMENT),MEAN(previous.DAYS_DECISION),MEAN(previous.DAYS_FIRST_DRAWING),MEAN(previous.DAYS_FIRST_DUE),MEAN(previous.DAYS_LAST_DUE),MEAN(previous.DAYS_LAST_DUE_1ST_VERSION),MEAN(previous.HOUR_APPR_PROCESS_START),MEAN(previous.NFLAG_INSURED_ON_APPROVAL),MEAN(previous.NFLAG_LAST_APPL_IN_DAY),MEAN(previous.RATE_DOWN_PAYMENT),MEAN(previous.SELLERPLACE_AREA),MIN(previous.AMT_ANNUITY),MIN(previous.AMT_APPLICATION),MIN(previous.AMT_DOWN_PAYMENT),MIN(previous.AMT_GOODS_PRICE),MIN(previous.CNT_PAYMENT),MIN(previous.DAYS_DECISION),MIN(previous.DAYS_FIRST_DRAWING),MIN(previous.DAYS_FIRST_DUE),MIN(previous.DAYS_LAST_DUE),MIN(previous.HOUR_APPR_PROCESS_START),MIN(previous.NFLAG_INSURED_ON_APPROVAL),MIN(previous.NFLAG_LAST_APPL_IN_DAY),MIN(previous.RATE_DOWN_PAYMENT),MIN(previous.SELLERPLACE_AREA),MODE(previous.CHANNEL_TYPE),MODE(previous.CODE_REJECT_REASON),MODE(previous.NAME_CASH_LOAN_PURPOSE),MODE(previous.NAME_CLIENT_TYPE),MODE(previous.NAME_CONTRACT_STATUS),MODE(previous.NAME_CONTRACT_TYPE),MODE(previous.NAME_GOODS_CATEGORY),MODE(previous.NAME_PAYMENT_TYPE),MODE(previous.NAME_PORTFOLIO),MODE(previous.NAME_PRODUCT_TYPE),MODE(previous.NAME_SELLER_INDUSTRY),MODE(previous.NAME_TYPE_SUITE),MODE(previous.NAME_YIELD_GROUP),MODE(previous.PRODUCT_COMBINATION),MODE(previous.WEEKDAY_APPR_PROCESS_START),NUM_UNIQUE(previous.CHANNEL_TYPE),NUM_UNIQUE(previous.CODE_REJECT_REASON),NUM_UNIQUE(previous.NAME_CASH_LOAN_PURPOSE),NUM_UNIQUE(previous.NAME_CLIENT_TYPE),NUM_UNIQUE(previous.NAME_CONTRACT_STATUS),NUM_UNIQUE(previous.NAME_CONTRACT_TYPE),NUM_UNIQUE(previous.NAME_GOODS_CATEGORY),NUM_UNIQUE(previous.NAME_PAYMENT_TYPE),NUM_UNIQUE(previous.NAME_PRODUCT_TYPE),NUM_UNIQUE(previous.NAME_SELLER_INDUSTRY),NUM_UNIQUE(previous.NAME_TYPE_SUITE),NUM_UNIQUE(previous.NAME_YIELD_GROUP),NUM_UNIQUE(previous.PRODUCT_COMBINATION),NUM_UNIQUE(previous.WEEKDAY_APPR_PROCESS_START),PERCENT_TRUE(previous.FLAG_LAST_APPL_PER_CONTRACT),SKEW(previous.AMT_ANNUITY),SKEW(previous.AMT_APPLICATION),SKEW(previous.AMT_CREDIT),SKEW(previous.AMT_DOWN_PAYMENT),SKEW(previous.AMT_GOODS_PRICE),SKEW(previous.CNT_PAYMENT),SKEW(previous.DAYS_DECISION),SKEW(previous.DAYS_FIRST_DUE),SKEW(previous.DAYS_LAST_DUE),SKEW(previous.DAYS_LAST_DUE_1ST_VERSION),SKEW(previous.DAYS_TERMINATION),SKEW(previous.HOUR_APPR_PROCESS_START),SKEW(previous.NFLAG_INSURED_ON_APPROVAL),SKEW(previous.RATE_DOWN_PAYMENT),SKEW(previous.SELLERPLACE_AREA),STD(previous.CNT_PAYMENT),STD(previous.DAYS_DECISION),STD(previous.HOUR_APPR_PROCESS_START),STD(previous.NFLAG_INSURED_ON_APPROVAL),STD(previous.RATE_DOWN_PAYMENT),SUM(previous.AMT_ANNUITY),SUM(previous.AMT_APPLICATION),SUM(previous.CNT_PAYMENT),SUM(previous.DAYS_DECISION),SUM(previous.DAYS_LAST_DUE),SUM(previous.NFLAG_INSURED_ON_APPROVAL),SUM(previous.NFLAG_LAST_APPL_IN_DAY),SUM(previous.RATE_DOWN_PAYMENT),SUM(previous.RATE_INTEREST_PRIMARY),SUM(previous.RATE_INTEREST_PRIVILEGED),SUM(previous.SELLERPLACE_AREA),COUNT(cash),MAX(cash.CNT_INSTALMENT),MAX(cash.MONTHS_BALANCE),MAX(cash.SK_DPD),MAX(cash.SK_DPD_DEF),MAX(cash.SK_ID_CURR),MEAN(cash.CNT_INSTALMENT),MEAN(cash.MONTHS_BALANCE),MEAN(cash.SK_DPD),MEAN(cash.SK_DPD_DEF),MIN(cash.CNT_INSTALMENT),MIN(cash.CNT_INSTALMENT_FUTURE),MIN(cash.SK_DPD),MIN(cash.SK_DPD_DEF),MODE(cash.NAME_CONTRACT_STATUS),NUM_UNIQUE(cash.NAME_CONTRACT_STATUS),SKEW(cash.CNT_INSTALMENT),SKEW(cash.CNT_INSTALMENT_FUTURE),SKEW(cash.MONTHS_BALANCE),SKEW(cash.SK_DPD),SKEW(cash.SK_DPD_DEF),SKEW(cash.SK_ID_CURR),STD(cash.CNT_INSTALMENT),STD(cash.MONTHS_BALANCE),STD(cash.SK_ID_CURR),SUM(cash.CNT_INSTALMENT),SUM(cash.MONTHS_BALANCE),SUM(cash.SK_DPD_DEF),SUM(cash.SK_ID_CURR),COUNT(instalments),MAX(instalments.AMT_INSTALMENT),MAX(instalments.DAYS_ENTRY_PAYMENT),MAX(instalments.NUM_INSTALMENT_NUMBER),MAX(instalments.NUM_INSTALMENT_VERSION),MEAN(instalments.AMT_INSTALMENT),MEAN(instalments.NUM_INSTALMENT_NUMBER),MEAN(instalments.NUM_INSTALMENT_VERSION),MIN(instalments.AMT_INSTALMENT),MIN(instalments.NUM_INSTALMENT_NUMBER),SKEW(instalments.AMT_INSTALMENT),SKEW(instalments.DAYS_ENTRY_PAYMENT),SKEW(instalments.NUM_INSTALMENT_NUMBER),SKEW(instalments.NUM_INSTALMENT_VERSION),SKEW(instalments.SK_ID_CURR),STD(instalments.SK_ID_CURR),SUM(instalments.AMT_INSTALMENT),SUM(instalments.NUM_INSTALMENT_VERSION),SUM(instalments.SK_ID_CURR),COUNT(credit),MAX(credit.AMT_BALANCE),MAX(credit.AMT_CREDIT_LIMIT_ACTUAL),MAX(credit.AMT_DRAWINGS_ATM_CURRENT),MAX(credit.AMT_DRAWINGS_CURRENT),MAX(credit.AMT_DRAWINGS_OTHER_CURRENT),MAX(credit.AMT_DRAWINGS_POS_CURRENT),MAX(credit.AMT_PAYMENT_CURRENT),MAX(credit.CNT_DRAWINGS_ATM_CURRENT),MAX(credit.CNT_DRAWINGS_CURRENT),MAX(credit.CNT_DRAWINGS_OTHER_CURRENT),MAX(credit.CNT_INSTALMENT_MATURE_CUM),MAX(credit.MONTHS_BALANCE),MAX(credit.SK_DPD),MAX(credit.SK_DPD_DEF),MEAN(credit.AMT_BALANCE),MEAN(credit.AMT_DRAWINGS_ATM_CURRENT),MEAN(credit.AMT_DRAWINGS_CURRENT),MEAN(credit.AMT_DRAWINGS_OTHER_CURRENT),MEAN(credit.AMT_DRAWINGS_POS_CURRENT),MEAN(credit.AMT_PAYMENT_CURRENT),MEAN(credit.CNT_DRAWINGS_ATM_CURRENT),MEAN(credit.CNT_DRAWINGS_CURRENT),MEAN(credit.CNT_DRAWINGS_OTHER_CURRENT),MEAN(credit.SK_DPD_DEF),MIN(credit.AMT_BALANCE),MIN(credit.AMT_CREDIT_LIMIT_ACTUAL),MIN(credit.AMT_DRAWINGS_ATM_CURRENT),MIN(credit.AMT_DRAWINGS_CURRENT),MIN(credit.AMT_DRAWINGS_OTHER_CURRENT),MIN(credit.AMT_DRAWINGS_POS_CURRENT),MIN(credit.AMT_INST_MIN_REGULARITY),MIN(credit.AMT_PAYMENT_CURRENT),MIN(credit.AMT_PAYMENT_TOTAL_CURRENT),MIN(credit.CNT_DRAWINGS_ATM_CURRENT),MIN(credit.CNT_DRAWINGS_CURRENT),MIN(credit.CNT_DRAWINGS_OTHER_CURRENT),MIN(credit.CNT_DRAWINGS_POS_CURRENT),MIN(credit.CNT_INSTALMENT_MATURE_CUM),MIN(credit.SK_DPD),MIN(credit.SK_DPD_DEF),MODE(credit.NAME_CONTRACT_STATUS),NUM_UNIQUE(credit.NAME_CONTRACT_STATUS),SKEW(credit.AMT_BALANCE),SKEW(credit.AMT_CREDIT_LIMIT_ACTUAL),SKEW(credit.AMT_DRAWINGS_ATM_CURRENT),SKEW(credit.AMT_DRAWINGS_CURRENT),SKEW(credit.AMT_DRAWINGS_OTHER_CURRENT),SKEW(credit.AMT_DRAWINGS_POS_CURRENT),SKEW(credit.AMT_INST_MIN_REGULARITY),SKEW(credit.AMT_PAYMENT_CURRENT),SKEW(credit.CNT_INSTALMENT_MATURE_CUM),SKEW(credit.MONTHS_BALANCE),SKEW(credit.SK_DPD),SKEW(credit.SK_DPD_DEF),SKEW(credit.SK_ID_CURR),STD(credit.AMT_CREDIT_LIMIT_ACTUAL),STD(credit.AMT_DRAWINGS_ATM_CURRENT),STD(credit.CNT_DRAWINGS_ATM_CURRENT),STD(credit.SK_ID_CURR),SUM(credit.AMT_BALANCE),SUM(credit.AMT_CREDIT_LIMIT_ACTUAL),SUM(credit.AMT_DRAWINGS_ATM_CURRENT),SUM(credit.AMT_DRAWINGS_CURRENT),SUM(credit.AMT_DRAWINGS_OTHER_CURRENT),SUM(credit.AMT_DRAWINGS_POS_CURRENT),SUM(credit.CNT_DRAWINGS_ATM_CURRENT),SUM(credit.CNT_DRAWINGS_CURRENT),SUM(credit.CNT_DRAWINGS_OTHER_CURRENT),SUM(credit.SK_DPD_DEF),MAX(bureau.COUNT(bureau_balance)),MAX(bureau.NUM_UNIQUE(bureau_balance.STATUS)),MAX(bureau.SKEW(bureau_balance.MONTHS_BALANCE)),MAX(bureau.SUM(bureau_balance.MONTHS_BALANCE)),MEAN(bureau.MAX(bureau_balance.MONTHS_BALANCE)),MEAN(bureau.NUM_UNIQUE(bureau_balance.STATUS)),MEAN(bureau.SKEW(bureau_balance.MONTHS_BALANCE)),MIN(bureau.COUNT(bureau_balance)),MIN(bureau.MAX(bureau_balance.MONTHS_BALANCE)),MIN(bureau.MEAN(bureau_balance.MONTHS_BALANCE)),MIN(bureau.NUM_UNIQUE(bureau_balance.STATUS)),MIN(bureau.SKEW(bureau_balance.MONTHS_BALANCE)),MODE(bureau.MODE(bureau_balance.STATUS)),NUM_UNIQUE(bureau.MODE(bureau_balance.STATUS)),SKEW(bureau.COUNT(bureau_balance)),SKEW(bureau.MAX(bureau_balance.MONTHS_BALANCE)),SKEW(bureau.MEAN(bureau_balance.MONTHS_BALANCE)),SKEW(bureau.NUM_UNIQUE(bureau_balance.STATUS)),SKEW(bureau.SUM(bureau_balance.MONTHS_BALANCE)),STD(bureau.MEAN(bureau_balance.MONTHS_BALANCE)),STD(bureau.NUM_UNIQUE(bureau_balance.STATUS)),STD(bureau.SKEW(bureau_balance.MONTHS_BALANCE)),SUM(bureau.MAX(bureau_balance.MONTHS_BALANCE)),SUM(bureau.MEAN(bureau_balance.MONTHS_BALANCE)),MODE(bureau_balance.bureau.CREDIT_ACTIVE),MODE(bureau_balance.bureau.CREDIT_CURRENCY),MODE(bureau_balance.bureau.CREDIT_TYPE),SKEW(bureau_balance.bureau.AMT_ANNUITY),SKEW(bureau_balance.bureau.AMT_CREDIT_MAX_OVERDUE),SKEW(bureau_balance.bureau.AMT_CREDIT_SUM),SKEW(bureau_balance.bureau.AMT_CREDIT_SUM_DEBT),SKEW(bureau_balance.bureau.AMT_CREDIT_SUM_LIMIT),SKEW(bureau_balance.bureau.AMT_CREDIT_SUM_OVERDUE),SKEW(bureau_balance.bureau.DAYS_CREDIT),SKEW(bureau_balance.bureau.DAYS_CREDIT_ENDDATE),SKEW(bureau_balance.bureau.DAYS_CREDIT_UPDATE),SKEW(bureau_balance.bureau.DAYS_ENDDATE_FACT),STD(bureau_balance.bureau.DAYS_CREDIT),SUM(bureau_balance.bureau.AMT_CREDIT_SUM),SUM(bureau_balance.bureau.AMT_CREDIT_SUM_DEBT),SUM(bureau_balance.bureau.AMT_CREDIT_SUM_LIMIT),SUM(bureau_balance.bureau.CNT_CREDIT_PROLONG),SUM(bureau_balance.bureau.CREDIT_DAY_OVERDUE),SUM(bureau_balance.bureau.DAYS_CREDIT_ENDDATE),SUM(bureau_balance.bureau.DAYS_CREDIT_UPDATE),MAX(previous.COUNT(cash)),MAX(previous.MEAN(cash.SK_DPD_DEF)),MAX(previous.MEAN(instalments.AMT_INSTALMENT)),MAX(previous.MIN(cash.CNT_INSTALMENT)),MAX(previous.MIN(cash.CNT_INSTALMENT_FUTURE)),MAX(previous.MIN(cash.SK_DPD)),MAX(previous.MIN(cash.SK_DPD_DEF)),MAX(previous.MIN(credit.AMT_DRAWINGS_OTHER_CURRENT)),MAX(previous.MIN(credit.CNT_DRAWINGS_OTHER_CURRENT)),MAX(previous.MIN(credit.SK_DPD)),MAX(previous.MIN(credit.SK_DPD_DEF)),MAX(previous.MIN(instalments.AMT_INSTALMENT)),MAX(previous.MIN(instalments.NUM_INSTALMENT_NUMBER)),MAX(previous.MIN(instalments.NUM_INSTALMENT_VERSION)),MAX(previous.SKEW(cash.CNT_INSTALMENT)),MAX(previous.SKEW(cash.CNT_INSTALMENT_FUTURE)),MAX(previous.SKEW(cash.MONTHS_BALANCE)),MAX(previous.SKEW(cash.SK_ID_CURR)),MAX(previous.SKEW(credit.MONTHS_BALANCE)),MAX(previous.SKEW(credit.SK_ID_CURR)),MAX(previous.SKEW(instalments.AMT_INSTALMENT)),MAX(previous.SKEW(instalments.DAYS_ENTRY_PAYMENT)),MAX(previous.SKEW(instalments.DAYS_INSTALMENT)),MAX(previous.SKEW(instalments.NUM_INSTALMENT_NUMBER)),MAX(previous.SKEW(instalments.NUM_INSTALMENT_VERSION)),MAX(previous.SKEW(instalments.SK_ID_CURR)),MAX(previous.STD(cash.CNT_INSTALMENT)),MAX(previous.STD(cash.SK_ID_CURR)),MAX(previous.STD(credit.SK_ID_CURR)),MAX(previous.STD(instalments.AMT_INSTALMENT)),MAX(previous.STD(instalments.SK_ID_CURR)),MAX(previous.SUM(cash.MONTHS_BALANCE)),MAX(previous.SUM(cash.SK_ID_CURR)),MAX(previous.SUM(credit.AMT_CREDIT_LIMIT_ACTUAL)),MAX(previous.SUM(credit.MONTHS_BALANCE)),MAX(previous.SUM(instalments.AMT_INSTALMENT)),MAX(previous.SUM(instalments.DAYS_ENTRY_PAYMENT)),MAX(previous.SUM(instalments.NUM_INSTALMENT_VERSION)),MAX(previous.SUM(instalments.SK_ID_CURR)),MEAN(previous.COUNT(cash)),MEAN(previous.COUNT(credit)),MEAN(previous.COUNT(instalments)),MEAN(previous.MAX(instalments.NUM_INSTALMENT_NUMBER)),MEAN(previous.MAX(instalments.NUM_INSTALMENT_VERSION)),MEAN(previous.MEAN(instalments.AMT_INSTALMENT)),MEAN(previous.MEAN(instalments.NUM_INSTALMENT_NUMBER)),MEAN(previous.MIN(cash.CNT_INSTALMENT)),MEAN(previous.MIN(cash.CNT_INSTALMENT_FUTURE)),MEAN(previous.MIN(cash.SK_DPD)),MEAN(previous.MIN(cash.SK_DPD_DEF)),MEAN(previous.MIN(credit.AMT_DRAWINGS_OTHER_CURRENT)),MEAN(previous.MIN(credit.CNT_DRAWINGS_OTHER_CURRENT)),MEAN(previous.MIN(credit.SK_DPD)),MEAN(previous.MIN(credit.SK_DPD_DEF)),MEAN(previous.MIN(instalments.AMT_INSTALMENT)),MEAN(previous.MIN(instalments.NUM_INSTALMENT_VERSION)),MEAN(previous.NUM_UNIQUE(cash.NAME_CONTRACT_STATUS)),MEAN(previous.SKEW(cash.CNT_INSTALMENT)),MEAN(previous.SKEW(cash.CNT_INSTALMENT_FUTURE)),MEAN(previous.SKEW(cash.MONTHS_BALANCE)),MEAN(previous.SKEW(cash.SK_DPD)),MEAN(previous.SKEW(cash.SK_DPD_DEF)),MEAN(previous.SKEW(cash.SK_ID_CURR)),MEAN(previous.SKEW(credit.MONTHS_BALANCE)),MEAN(previous.SKEW(credit.SK_ID_CURR)),MEAN(previous.SKEW(instalments.AMT_INSTALMENT)),MEAN(previous.SKEW(instalments.DAYS_ENTRY_PAYMENT)),MEAN(previous.SKEW(instalments.DAYS_INSTALMENT)),MEAN(previous.SKEW(instalments.NUM_INSTALMENT_NUMBER)),MEAN(previous.SKEW(instalments.NUM_INSTALMENT_VERSION)),MEAN(previous.SKEW(instalments.SK_ID_CURR)),MEAN(previous.STD(cash.CNT_INSTALMENT)),MEAN(previous.STD(cash.CNT_INSTALMENT_FUTURE)),MEAN(previous.STD(cash.MONTHS_BALANCE)),MEAN(previous.STD(cash.SK_ID_CURR)),MEAN(previous.STD(credit.SK_ID_CURR)),MEAN(previous.STD(instalments.DAYS_ENTRY_PAYMENT)),MEAN(previous.STD(instalments.SK_ID_CURR)),MEAN(previous.SUM(cash.CNT_INSTALMENT)),MEAN(previous.SUM(cash.MONTHS_BALANCE)),MEAN(previous.SUM(cash.SK_DPD)),MEAN(previous.SUM(cash.SK_DPD_DEF)),MEAN(previous.SUM(cash.SK_ID_CURR)),MEAN(previous.SUM(credit.AMT_BALANCE)),MEAN(previous.SUM(credit.AMT_CREDIT_LIMIT_ACTUAL)),MEAN(previous.SUM(credit.AMT_DRAWINGS_ATM_CURRENT)),MEAN(previous.SUM(credit.AMT_DRAWINGS_CURRENT)),MEAN(previous.SUM(credit.AMT_DRAWINGS_OTHER_CURRENT)),MEAN(previous.SUM(credit.AMT_DRAWINGS_POS_CURRENT)),MEAN(previous.SUM(credit.CNT_DRAWINGS_ATM_CURRENT)),MEAN(previous.SUM(credit.CNT_DRAWINGS_CURRENT)),MEAN(previous.SUM(credit.CNT_DRAWINGS_OTHER_CURRENT)),MEAN(previous.SUM(credit.CNT_INSTALMENT_MATURE_CUM)),MEAN(previous.SUM(credit.SK_DPD)),MEAN(previous.SUM(credit.SK_DPD_DEF)),MEAN(previous.SUM(instalments.AMT_INSTALMENT)),MEAN(previous.SUM(instalments.DAYS_ENTRY_PAYMENT)),MEAN(previous.SUM(instalments.NUM_INSTALMENT_NUMBER)),MEAN(previous.SUM(instalments.NUM_INSTALMENT_VERSION)),MEAN(previous.SUM(instalments.SK_ID_CURR)),MIN(previous.COUNT(cash)),MIN(previous.COUNT(instalments)),MIN(previous.MAX(cash.CNT_INSTALMENT)),MIN(previous.MAX(cash.SK_DPD_DEF)),MIN(previous.MAX(instalments.AMT_INSTALMENT)),MIN(previous.MAX(instalments.NUM_INSTALMENT_NUMBER)),MIN(previous.MAX(instalments.NUM_INSTALMENT_VERSION)),MIN(previous.MEAN(cash.SK_DPD_DEF)),MIN(previous.MEAN(instalments.AMT_INSTALMENT)),MIN(previous.MEAN(instalments.NUM_INSTALMENT_NUMBER)),MIN(previous.NUM_UNIQUE(cash.NAME_CONTRACT_STATUS)),MIN(previous.SKEW(cash.CNT_INSTALMENT)),MIN(previous.SKEW(cash.CNT_INSTALMENT_FUTURE)),MIN(previous.SKEW(cash.MONTHS_BALANCE)),MIN(previous.SKEW(cash.SK_DPD)),MIN(previous.SKEW(cash.SK_DPD_DEF)),MIN(previous.SKEW(cash.SK_ID_CURR)),MIN(previous.SKEW(credit.SK_ID_CURR)),MIN(previous.SKEW(instalments.AMT_INSTALMENT)),MIN(previous.SKEW(instalments.DAYS_ENTRY_PAYMENT)),MIN(previous.SKEW(instalments.DAYS_INSTALMENT)),MIN(previous.SKEW(instalments.NUM_INSTALMENT_NUMBER)),MIN(previous.SKEW(instalments.NUM_INSTALMENT_VERSION)),MIN(previous.SKEW(instalments.SK_ID_CURR)),MIN(previous.STD(cash.CNT_INSTALMENT)),MIN(previous.STD(cash.CNT_INSTALMENT_FUTURE)),MIN(previous.STD(cash.MONTHS_BALANCE)),MIN(previous.STD(cash.SK_ID_CURR)),MIN(previous.STD(credit.SK_ID_CURR)),MIN(previous.STD(instalments.DAYS_ENTRY_PAYMENT)),MIN(previous.STD(instalments.NUM_INSTALMENT_VERSION)),MIN(previous.STD(instalments.SK_ID_CURR)),MIN(previous.SUM(cash.CNT_INSTALMENT)),MIN(previous.SUM(cash.CNT_INSTALMENT_FUTURE)),MIN(previous.SUM(cash.MONTHS_BALANCE)),MIN(previous.SUM(cash.SK_DPD)),MIN(previous.SUM(cash.SK_DPD_DEF)),MIN(previous.SUM(credit.AMT_BALANCE)),MIN(previous.SUM(credit.AMT_CREDIT_LIMIT_ACTUAL)),MIN(previous.SUM(credit.AMT_DRAWINGS_ATM_CURRENT)),MIN(previous.SUM(credit.AMT_DRAWINGS_CURRENT)),MIN(previous.SUM(credit.AMT_DRAWINGS_OTHER_CURRENT)),MIN(previous.SUM(credit.AMT_DRAWINGS_POS_CURRENT)),MIN(previous.SUM(credit.AMT_PAYMENT_CURRENT)),MIN(previous.SUM(credit.CNT_DRAWINGS_ATM_CURRENT)),MIN(previous.SUM(credit.CNT_DRAWINGS_CURRENT)),MIN(previous.SUM(credit.CNT_DRAWINGS_OTHER_CURRENT)),MIN(previous.SUM(credit.CNT_INSTALMENT_MATURE_CUM)),MIN(previous.SUM(credit.SK_DPD)),MIN(previous.SUM(credit.SK_DPD_DEF)),MIN(previous.SUM(instalments.AMT_INSTALMENT)),MIN(previous.SUM(instalments.DAYS_ENTRY_PAYMENT)),MIN(previous.SUM(instalments.NUM_INSTALMENT_NUMBER)),MODE(previous.MODE(cash.NAME_CONTRACT_STATUS)),MODE(previous.MODE(credit.NAME_CONTRACT_STATUS)),NUM_UNIQUE(previous.MODE(cash.NAME_CONTRACT_STATUS)),SKEW(previous.COUNT(cash)),SKEW(previous.COUNT(instalments)),SKEW(previous.MAX(cash.CNT_INSTALMENT)),SKEW(previous.MAX(cash.MONTHS_BALANCE)),SKEW(previous.MAX(cash.SK_ID_CURR)),SKEW(previous.MAX(instalments.AMT_INSTALMENT)),SKEW(previous.MAX(instalments.NUM_INSTALMENT_NUMBER)),SKEW(previous.MAX(instalments.NUM_INSTALMENT_VERSION)),SKEW(previous.MAX(instalments.SK_ID_CURR)),SKEW(previous.MEAN(cash.CNT_INSTALMENT_FUTURE)),SKEW(previous.MEAN(cash.SK_ID_CURR)),SKEW(previous.MEAN(instalments.AMT_INSTALMENT)),SKEW(previous.MEAN(instalments.NUM_INSTALMENT_VERSION)),SKEW(previous.MEAN(instalments.SK_ID_CURR)),SKEW(previous.MIN(cash.CNT_INSTALMENT)),SKEW(previous.MIN(cash.CNT_INSTALMENT_FUTURE)),SKEW(previous.MIN(cash.SK_DPD)),SKEW(previous.MIN(cash.SK_DPD_DEF)),SKEW(previous.MIN(cash.SK_ID_CURR)),SKEW(previous.MIN(instalments.AMT_INSTALMENT)),SKEW(previous.MIN(instalments.AMT_PAYMENT)),SKEW(previous.MIN(instalments.NUM_INSTALMENT_VERSION)),SKEW(previous.MIN(instalments.SK_ID_CURR)),SKEW(previous.NUM_UNIQUE(cash.NAME_CONTRACT_STATUS)),SKEW(previous.STD(cash.CNT_INSTALMENT)),SKEW(previous.STD(cash.CNT_INSTALMENT_FUTURE)),SKEW(previous.STD(cash.MONTHS_BALANCE)),SKEW(previous.STD(cash.SK_ID_CURR)),SKEW(previous.STD(instalments.AMT_INSTALMENT)),SKEW(previous.STD(instalments.AMT_PAYMENT)),SKEW(previous.STD(instalments.DAYS_ENTRY_PAYMENT)),SKEW(previous.STD(instalments.NUM_INSTALMENT_VERSION)),SKEW(previous.STD(instalments.SK_ID_CURR)),SKEW(previous.SUM(cash.CNT_INSTALMENT)),SKEW(previous.SUM(cash.CNT_INSTALMENT_FUTURE)),SKEW(previous.SUM(cash.MONTHS_BALANCE)),SKEW(previous.SUM(credit.AMT_CREDIT_LIMIT_ACTUAL)),SKEW(previous.SUM(credit.AMT_DRAWINGS_OTHER_CURRENT)),SKEW(previous.SUM(credit.AMT_DRAWINGS_POS_CURRENT)),SKEW(previous.SUM(credit.AMT_PAYMENT_CURRENT)),SKEW(previous.SUM(credit.SK_DPD)),SKEW(previous.SUM(credit.SK_DPD_DEF)),SKEW(previous.SUM(instalments.AMT_INSTALMENT)),SKEW(previous.SUM(instalments.DAYS_ENTRY_PAYMENT)),SKEW(previous.SUM(instalments.NUM_INSTALMENT_NUMBER)),SKEW(previous.SUM(instalments.NUM_INSTALMENT_VERSION)),STD(previous.COUNT(cash)),STD(previous.MAX(cash.SK_ID_CURR)),STD(previous.MAX(instalments.SK_ID_CURR)),STD(previous.MEAN(cash.SK_ID_CURR)),STD(previous.MEAN(instalments.SK_ID_CURR)),STD(previous.MIN(cash.SK_DPD)),STD(previous.MIN(cash.SK_DPD_DEF)),STD(previous.MIN(cash.SK_ID_CURR)),STD(previous.MIN(instalments.NUM_INSTALMENT_VERSION)),STD(previous.MIN(instalments.SK_ID_CURR)),STD(previous.NUM_UNIQUE(cash.NAME_CONTRACT_STATUS)),STD(previous.SKEW(cash.CNT_INSTALMENT_FUTURE)),STD(previous.SKEW(cash.MONTHS_BALANCE)),STD(previous.SKEW(cash.SK_ID_CURR)),STD(previous.SKEW(instalments.AMT_INSTALMENT)),STD(previous.SKEW(instalments.AMT_PAYMENT)),STD(previous.SKEW(instalments.DAYS_ENTRY_PAYMENT)),STD(previous.SKEW(instalments.DAYS_INSTALMENT)),STD(previous.SKEW(instalments.NUM_INSTALMENT_VERSION)),STD(previous.SKEW(instalments.SK_ID_CURR)),STD(previous.SUM(cash.SK_ID_CURR)),STD(previous.SUM(credit.AMT_CREDIT_LIMIT_ACTUAL)),STD(previous.SUM(instalments.NUM_INSTALMENT_VERSION)),SUM(previous.MAX(cash.CNT_INSTALMENT)),SUM(previous.MAX(cash.MONTHS_BALANCE)),SUM(previous.MAX(cash.SK_ID_CURR)),SUM(previous.MAX(instalments.NUM_INSTALMENT_VERSION)),SUM(previous.MEAN(instalments.NUM_INSTALMENT_NUMBER)),SUM(previous.MIN(cash.CNT_INSTALMENT)),SUM(previous.MIN(cash.CNT_INSTALMENT_FUTURE)),SUM(previous.MIN(instalments.NUM_INSTALMENT_NUMBER)),SUM(previous.SKEW(cash.CNT_INSTALMENT)),SUM(previous.SKEW(cash.CNT_INSTALMENT_FUTURE)),SUM(previous.SKEW(cash.MONTHS_BALANCE)),SUM(previous.SKEW(instalments.AMT_INSTALMENT)),SUM(previous.SKEW(instalments.DAYS_ENTRY_PAYMENT)),SUM(previous.SKEW(instalments.DAYS_INSTALMENT)),SUM(previous.SKEW(instalments.NUM_INSTALMENT_VERSION)),SUM(previous.STD(cash.CNT_INSTALMENT_FUTURE)),MAX(cash.previous.AMT_ANNUITY),MAX(cash.previous.AMT_APPLICATION),MAX(cash.previous.DAYS_FIRST_DRAWING),MAX(cash.previous.DAYS_FIRST_DUE),MAX(cash.previous.DAYS_LAST_DUE),MAX(cash.previous.DAYS_LAST_DUE_1ST_VERSION),MAX(cash.previous.NFLAG_LAST_APPL_IN_DAY),MEAN(cash.previous.AMT_ANNUITY),MEAN(cash.previous.AMT_APPLICATION),MEAN(cash.previous.DAYS_FIRST_DRAWING),MEAN(cash.previous.DAYS_FIRST_DUE),MEAN(cash.previous.DAYS_LAST_DUE),MEAN(cash.previous.DAYS_LAST_DUE_1ST_VERSION),MEAN(cash.previous.NFLAG_LAST_APPL_IN_DAY),MEAN(cash.previous.SELLERPLACE_AREA),MIN(cash.previous.AMT_ANNUITY),MIN(cash.previous.AMT_APPLICATION),MIN(cash.previous.DAYS_FIRST_DRAWING),MIN(cash.previous.HOUR_APPR_PROCESS_START),MIN(cash.previous.NFLAG_INSURED_ON_APPROVAL),MIN(cash.previous.NFLAG_LAST_APPL_IN_DAY),MIN(cash.previous.SELLERPLACE_AREA),MODE(cash.previous.CHANNEL_TYPE),MODE(cash.previous.CODE_REJECT_REASON),MODE(cash.previous.NAME_CASH_LOAN_PURPOSE),MODE(cash.previous.NAME_CLIENT_TYPE),MODE(cash.previous.NAME_CONTRACT_STATUS),MODE(cash.previous.NAME_CONTRACT_TYPE),MODE(cash.previous.NAME_GOODS_CATEGORY),MODE(cash.previous.NAME_PAYMENT_TYPE),MODE(cash.previous.NAME_PORTFOLIO),MODE(cash.previous.NAME_PRODUCT_TYPE),MODE(cash.previous.NAME_SELLER_INDUSTRY),MODE(cash.previous.NAME_TYPE_SUITE),MODE(cash.previous.NAME_YIELD_GROUP),MODE(cash.previous.PRODUCT_COMBINATION),MODE(cash.previous.WEEKDAY_APPR_PROCESS_START),NUM_UNIQUE(cash.previous.CHANNEL_TYPE),NUM_UNIQUE(cash.previous.CODE_REJECT_REASON),NUM_UNIQUE(cash.previous.NAME_CASH_LOAN_PURPOSE),NUM_UNIQUE(cash.previous.NAME_CLIENT_TYPE),NUM_UNIQUE(cash.previous.NAME_CONTRACT_STATUS),NUM_UNIQUE(cash.previous.NAME_GOODS_CATEGORY),NUM_UNIQUE(cash.previous.NAME_PAYMENT_TYPE),NUM_UNIQUE(cash.previous.NAME_SELLER_INDUSTRY),NUM_UNIQUE(cash.previous.NAME_YIELD_GROUP),NUM_UNIQUE(cash.previous.PRODUCT_COMBINATION),NUM_UNIQUE(cash.previous.WEEKDAY_APPR_PROCESS_START),PERCENT_TRUE(cash.previous.FLAG_LAST_APPL_PER_CONTRACT),SKEW(cash.previous.AMT_ANNUITY),SKEW(cash.previous.AMT_APPLICATION),SKEW(cash.previous.AMT_DOWN_PAYMENT),SKEW(cash.previous.CNT_PAYMENT),SKEW(cash.previous.DAYS_DECISION),SKEW(cash.previous.DAYS_FIRST_DRAWING),SKEW(cash.previous.DAYS_FIRST_DUE),SKEW(cash.previous.DAYS_LAST_DUE),SKEW(cash.previous.DAYS_LAST_DUE_1ST_VERSION),SKEW(cash.previous.HOUR_APPR_PROCESS_START),SKEW(cash.previous.NFLAG_INSURED_ON_APPROVAL),SKEW(cash.previous.NFLAG_LAST_APPL_IN_DAY),SKEW(cash.previous.RATE_DOWN_PAYMENT),SKEW(cash.previous.SELLERPLACE_AREA),STD(cash.previous.AMT_ANNUITY),STD(cash.previous.AMT_DOWN_PAYMENT),STD(cash.previous.DAYS_FIRST_DRAWING),STD(cash.previous.DAYS_LAST_DUE),STD(cash.previous.HOUR_APPR_PROCESS_START),STD(cash.previous.NFLAG_INSURED_ON_APPROVAL),STD(cash.previous.RATE_DOWN_PAYMENT),SUM(cash.previous.AMT_ANNUITY),SUM(cash.previous.AMT_APPLICATION),SUM(cash.previous.AMT_DOWN_PAYMENT),SUM(cash.previous.DAYS_FIRST_DUE),SUM(cash.previous.DAYS_LAST_DUE),SUM(cash.previous.DAYS_LAST_DUE_1ST_VERSION),SUM(cash.previous.NFLAG_INSURED_ON_APPROVAL),SUM(cash.previous.RATE_DOWN_PAYMENT),SUM(cash.previous.RATE_INTEREST_PRIVILEGED),SUM(cash.previous.SELLERPLACE_AREA),MAX(instalments.previous.DAYS_FIRST_DUE),MAX(instalments.previous.DAYS_LAST_DUE),MAX(instalments.previous.NFLAG_LAST_APPL_IN_DAY),MEAN(instalments.previous.CNT_PAYMENT),MEAN(instalments.previous.DAYS_FIRST_DUE),MEAN(instalments.previous.DAYS_LAST_DUE),MEAN(instalments.previous.DAYS_TERMINATION),MIN(instalments.previous.CNT_PAYMENT),MODE(instalments.previous.CHANNEL_TYPE),MODE(instalments.previous.CODE_REJECT_REASON),MODE(instalments.previous.NAME_CASH_LOAN_PURPOSE),MODE(instalments.previous.NAME_CLIENT_TYPE),MODE(instalments.previous.NAME_CONTRACT_STATUS),MODE(instalments.previous.NAME_CONTRACT_TYPE),MODE(instalments.previous.NAME_GOODS_CATEGORY),MODE(instalments.previous.NAME_PAYMENT_TYPE),MODE(instalments.previous.NAME_PORTFOLIO),MODE(instalments.previous.NAME_PRODUCT_TYPE),MODE(instalments.previous.NAME_SELLER_INDUSTRY),MODE(instalments.previous.NAME_TYPE_SUITE),MODE(instalments.previous.NAME_YIELD_GROUP),MODE(instalments.previous.PRODUCT_COMBINATION),MODE(instalments.previous.WEEKDAY_APPR_PROCESS_START),NUM_UNIQUE(instalments.previous.CODE_REJECT_REASON),NUM_UNIQUE(instalments.previous.NAME_CONTRACT_STATUS),NUM_UNIQUE(instalments.previous.NAME_CONTRACT_TYPE),NUM_UNIQUE(instalments.previous.NAME_PAYMENT_TYPE),NUM_UNIQUE(instalments.previous.NAME_PRODUCT_TYPE),PERCENT_TRUE(instalments.previous.FLAG_LAST_APPL_PER_CONTRACT),SKEW(instalments.previous.AMT_ANNUITY),SKEW(instalments.previous.AMT_APPLICATION),SKEW(instalments.previous.AMT_CREDIT),SKEW(instalments.previous.AMT_GOODS_PRICE),SKEW(instalments.previous.CNT_PAYMENT),SKEW(instalments.previous.DAYS_DECISION),SKEW(instalments.previous.DAYS_FIRST_DRAWING),SKEW(instalments.previous.DAYS_LAST_DUE),SKEW(instalments.previous.DAYS_LAST_DUE_1ST_VERSION),SKEW(instalments.previous.DAYS_TERMINATION),SKEW(instalments.previous.HOUR_APPR_PROCESS_START),SKEW(instalments.previous.NFLAG_INSURED_ON_APPROVAL),SKEW(instalments.previous.SELLERPLACE_AREA),STD(instalments.previous.DAYS_LAST_DUE),SUM(instalments.previous.AMT_ANNUITY),SUM(instalments.previous.CNT_PAYMENT),SUM(instalments.previous.DAYS_FIRST_DUE),SUM(instalments.previous.DAYS_LAST_DUE),SUM(instalments.previous.DAYS_TERMINATION),SUM(instalments.previous.NFLAG_LAST_APPL_IN_DAY),SUM(instalments.previous.RATE_DOWN_PAYMENT),MAX(credit.previous.AMT_APPLICATION),MAX(credit.previous.DAYS_LAST_DUE),MAX(credit.previous.DAYS_TERMINATION),MAX(credit.previous.HOUR_APPR_PROCESS_START),MAX(credit.previous.NFLAG_INSURED_ON_APPROVAL),MAX(credit.previous.NFLAG_LAST_APPL_IN_DAY),MAX(credit.previous.SELLERPLACE_AREA),MODE(credit.previous.CHANNEL_TYPE),MODE(credit.previous.CODE_REJECT_REASON),MODE(credit.previous.NAME_CASH_LOAN_PURPOSE),MODE(credit.previous.NAME_CLIENT_TYPE),MODE(credit.previous.NAME_CONTRACT_STATUS),MODE(credit.previous.NAME_CONTRACT_TYPE),MODE(credit.previous.NAME_GOODS_CATEGORY),MODE(credit.previous.NAME_PAYMENT_TYPE),MODE(credit.previous.NAME_PORTFOLIO),MODE(credit.previous.NAME_PRODUCT_TYPE),MODE(credit.previous.NAME_SELLER_INDUSTRY),MODE(credit.previous.NAME_YIELD_GROUP),MODE(credit.previous.PRODUCT_COMBINATION),MODE(credit.previous.WEEKDAY_APPR_PROCESS_START),NUM_UNIQUE(credit.previous.CHANNEL_TYPE),NUM_UNIQUE(credit.previous.CODE_REJECT_REASON),NUM_UNIQUE(credit.previous.NAME_CASH_LOAN_PURPOSE),NUM_UNIQUE(credit.previous.NAME_CLIENT_TYPE),NUM_UNIQUE(credit.previous.NAME_CONTRACT_STATUS),NUM_UNIQUE(credit.previous.NAME_CONTRACT_TYPE),NUM_UNIQUE(credit.previous.NAME_GOODS_CATEGORY),NUM_UNIQUE(credit.previous.NAME_PAYMENT_TYPE),NUM_UNIQUE(credit.previous.NAME_PORTFOLIO),NUM_UNIQUE(credit.previous.NAME_PRODUCT_TYPE),NUM_UNIQUE(credit.previous.NAME_SELLER_INDUSTRY),NUM_UNIQUE(credit.previous.NAME_TYPE_SUITE),NUM_UNIQUE(credit.previous.NAME_YIELD_GROUP),NUM_UNIQUE(credit.previous.PRODUCT_COMBINATION),NUM_UNIQUE(credit.previous.WEEKDAY_APPR_PROCESS_START),PERCENT_TRUE(credit.previous.FLAG_LAST_APPL_PER_CONTRACT),SKEW(credit.previous.AMT_ANNUITY),SKEW(credit.previous.AMT_APPLICATION),SKEW(credit.previous.AMT_GOODS_PRICE),SKEW(credit.previous.CNT_PAYMENT),SKEW(credit.previous.DAYS_DECISION),SKEW(credit.previous.DAYS_FIRST_DRAWING),SKEW(credit.previous.DAYS_LAST_DUE),SKEW(credit.previous.DAYS_LAST_DUE_1ST_VERSION),SKEW(credit.previous.HOUR_APPR_PROCESS_START),SKEW(credit.previous.NFLAG_INSURED_ON_APPROVAL),SKEW(credit.previous.NFLAG_LAST_APPL_IN_DAY),SKEW(credit.previous.SELLERPLACE_AREA),STD(credit.previous.AMT_ANNUITY),STD(credit.previous.AMT_APPLICATION),STD(credit.previous.AMT_GOODS_PRICE),STD(credit.previous.CNT_PAYMENT),STD(credit.previous.DAYS_DECISION),STD(credit.previous.DAYS_FIRST_DRAWING),STD(credit.previous.DAYS_FIRST_DUE),STD(credit.previous.DAYS_LAST_DUE),STD(credit.previous.DAYS_LAST_DUE_1ST_VERSION),STD(credit.previous.DAYS_TERMINATION),STD(credit.previous.HOUR_APPR_PROCESS_START),STD(credit.previous.NFLAG_INSURED_ON_APPROVAL),STD(credit.previous.NFLAG_LAST_APPL_IN_DAY),STD(credit.previous.SELLERPLACE_AREA),SUM(credit.previous.AMT_ANNUITY),SUM(credit.previous.AMT_APPLICATION),SUM(credit.previous.AMT_CREDIT),SUM(credit.previous.AMT_DOWN_PAYMENT),SUM(credit.previous.AMT_GOODS_PRICE),SUM(credit.previous.DAYS_FIRST_DUE),SUM(credit.previous.DAYS_LAST_DUE),SUM(credit.previous.DAYS_TERMINATION),SUM(credit.previous.SELLERPLACE_AREA)
SK_ID_CURR,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1,Unnamed: 22_level_1,Unnamed: 23_level_1,Unnamed: 24_level_1,Unnamed: 25_level_1,Unnamed: 26_level_1,Unnamed: 27_level_1,Unnamed: 28_level_1,Unnamed: 29_level_1,Unnamed: 30_level_1,Unnamed: 31_level_1,Unnamed: 32_level_1,Unnamed: 33_level_1,Unnamed: 34_level_1,Unnamed: 35_level_1,Unnamed: 36_level_1,Unnamed: 37_level_1,Unnamed: 38_level_1,Unnamed: 39_level_1,Unnamed: 40_level_1,Unnamed: 41_level_1,Unnamed: 42_level_1,Unnamed: 43_level_1,Unnamed: 44_level_1,Unnamed: 45_level_1,Unnamed: 46_level_1,Unnamed: 47_level_1,Unnamed: 48_level_1,Unnamed: 49_level_1,Unnamed: 50_level_1,Unnamed: 51_level_1,Unnamed: 52_level_1,Unnamed: 53_level_1,Unnamed: 54_level_1,Unnamed: 55_level_1,Unnamed: 56_level_1,Unnamed: 57_level_1,Unnamed: 58_level_1,Unnamed: 59_level_1,Unnamed: 60_level_1,Unnamed: 61_level_1,Unnamed: 62_level_1,Unnamed: 63_level_1,Unnamed: 64_level_1,Unnamed: 65_level_1,Unnamed: 66_level_1,Unnamed: 67_level_1,Unnamed: 68_level_1,Unnamed: 69_level_1,Unnamed: 70_level_1,Unnamed: 71_level_1,Unnamed: 72_level_1,Unnamed: 73_level_1,Unnamed: 74_level_1,Unnamed: 75_level_1,Unnamed: 76_level_1,Unnamed: 77_level_1,Unnamed: 78_level_1,Unnamed: 79_level_1,Unnamed: 80_level_1,Unnamed: 81_level_1,Unnamed: 82_level_1,Unnamed: 83_level_1,Unnamed: 84_level_1,Unnamed: 85_level_1,Unnamed: 86_level_1,Unnamed: 87_level_1,Unnamed: 88_level_1,Unnamed: 89_level_1,Unnamed: 90_level_1,Unnamed: 91_level_1,Unnamed: 92_level_1,Unnamed: 93_level_1,Unnamed: 94_level_1,Unnamed: 95_level_1,Unnamed: 96_level_1,Unnamed: 97_level_1,Unnamed: 98_level_1,Unnamed: 99_level_1,Unnamed: 100_level_1,Unnamed: 101_level_1,Unnamed: 102_level_1,Unnamed: 103_level_1,Unnamed: 104_level_1,Unnamed: 105_level_1,Unnamed: 106_level_1,Unnamed: 107_level_1,Unnamed: 108_level_1,Unnamed: 109_level_1,Unnamed: 110_level_1,Unnamed: 111_level_1,Unnamed: 112_level_1,Unnamed: 113_level_1,Unnamed: 114_level_1,Unnamed: 115_level_1,Unnamed: 116_level_1,Unnamed: 117_level_1,Unnamed: 118_level_1,Unnamed: 119_level_1,Unnamed: 120_level_1,Unnamed: 121_level_1,Unnamed: 122_level_1,Unnamed: 123_level_1,Unnamed: 124_level_1,Unnamed: 125_level_1,Unnamed: 126_level_1,Unnamed: 127_level_1,Unnamed: 128_level_1,Unnamed: 129_level_1,Unnamed: 130_level_1,Unnamed: 131_level_1,Unnamed: 132_level_1,Unnamed: 133_level_1,Unnamed: 134_level_1,Unnamed: 135_level_1,Unnamed: 136_level_1,Unnamed: 137_level_1,Unnamed: 138_level_1,Unnamed: 139_level_1,Unnamed: 140_level_1,Unnamed: 141_level_1,Unnamed: 142_level_1,Unnamed: 143_level_1,Unnamed: 144_level_1,Unnamed: 145_level_1,Unnamed: 146_level_1,Unnamed: 147_level_1,Unnamed: 148_level_1,Unnamed: 149_level_1,Unnamed: 150_level_1,Unnamed: 151_level_1,Unnamed: 152_level_1,Unnamed: 153_level_1,Unnamed: 154_level_1,Unnamed: 155_level_1,Unnamed: 156_level_1,Unnamed: 157_level_1,Unnamed: 158_level_1,Unnamed: 159_level_1,Unnamed: 160_level_1,Unnamed: 161_level_1,Unnamed: 162_level_1,Unnamed: 163_level_1,Unnamed: 164_level_1,Unnamed: 165_level_1,Unnamed: 166_level_1,Unnamed: 167_level_1,Unnamed: 168_level_1,Unnamed: 169_level_1,Unnamed: 170_level_1,Unnamed: 171_level_1,Unnamed: 172_level_1,Unnamed: 173_level_1,Unnamed: 174_level_1,Unnamed: 175_level_1,Unnamed: 176_level_1,Unnamed: 177_level_1,Unnamed: 178_level_1,Unnamed: 179_level_1,Unnamed: 180_level_1,Unnamed: 181_level_1,Unnamed: 182_level_1,Unnamed: 183_level_1,Unnamed: 184_level_1,Unnamed: 185_level_1,Unnamed: 186_level_1,Unnamed: 187_level_1,Unnamed: 188_level_1,Unnamed: 189_level_1,Unnamed: 190_level_1,Unnamed: 191_level_1,Unnamed: 192_level_1,Unnamed: 193_level_1,Unnamed: 194_level_1,Unnamed: 195_level_1,Unnamed: 196_level_1,Unnamed: 197_level_1,Unnamed: 198_level_1,Unnamed: 199_level_1,Unnamed: 200_level_1,Unnamed: 201_level_1,Unnamed: 202_level_1,Unnamed: 203_level_1,Unnamed: 204_level_1,Unnamed: 205_level_1,Unnamed: 206_level_1,Unnamed: 207_level_1,Unnamed: 208_level_1,Unnamed: 209_level_1,Unnamed: 210_level_1,Unnamed: 211_level_1,Unnamed: 212_level_1,Unnamed: 213_level_1,Unnamed: 214_level_1,Unnamed: 215_level_1,Unnamed: 216_level_1,Unnamed: 217_level_1,Unnamed: 218_level_1,Unnamed: 219_level_1,Unnamed: 220_level_1,Unnamed: 221_level_1,Unnamed: 222_level_1,Unnamed: 223_level_1,Unnamed: 224_level_1,Unnamed: 225_level_1,Unnamed: 226_level_1,Unnamed: 227_level_1,Unnamed: 228_level_1,Unnamed: 229_level_1,Unnamed: 230_level_1,Unnamed: 231_level_1,Unnamed: 232_level_1,Unnamed: 233_level_1,Unnamed: 234_level_1,Unnamed: 235_level_1,Unnamed: 236_level_1,Unnamed: 237_level_1,Unnamed: 238_level_1,Unnamed: 239_level_1,Unnamed: 240_level_1,Unnamed: 241_level_1,Unnamed: 242_level_1,Unnamed: 243_level_1,Unnamed: 244_level_1,Unnamed: 245_level_1,Unnamed: 246_level_1,Unnamed: 247_level_1,Unnamed: 248_level_1,Unnamed: 249_level_1,Unnamed: 250_level_1,Unnamed: 251_level_1,Unnamed: 252_level_1,Unnamed: 253_level_1,Unnamed: 254_level_1,Unnamed: 255_level_1,Unnamed: 256_level_1,Unnamed: 257_level_1,Unnamed: 258_level_1,Unnamed: 259_level_1,Unnamed: 260_level_1,Unnamed: 261_level_1,Unnamed: 262_level_1,Unnamed: 263_level_1,Unnamed: 264_level_1,Unnamed: 265_level_1,Unnamed: 266_level_1,Unnamed: 267_level_1,Unnamed: 268_level_1,Unnamed: 269_level_1,Unnamed: 270_level_1,Unnamed: 271_level_1,Unnamed: 272_level_1,Unnamed: 273_level_1,Unnamed: 274_level_1,Unnamed: 275_level_1,Unnamed: 276_level_1,Unnamed: 277_level_1,Unnamed: 278_level_1,Unnamed: 279_level_1,Unnamed: 280_level_1,Unnamed: 281_level_1,Unnamed: 282_level_1,Unnamed: 283_level_1,Unnamed: 284_level_1,Unnamed: 285_level_1,Unnamed: 286_level_1,Unnamed: 287_level_1,Unnamed: 288_level_1,Unnamed: 289_level_1,Unnamed: 290_level_1,Unnamed: 291_level_1,Unnamed: 292_level_1,Unnamed: 293_level_1,Unnamed: 294_level_1,Unnamed: 295_level_1,Unnamed: 296_level_1,Unnamed: 297_level_1,Unnamed: 298_level_1,Unnamed: 299_level_1,Unnamed: 300_level_1,Unnamed: 301_level_1,Unnamed: 302_level_1,Unnamed: 303_level_1,Unnamed: 304_level_1,Unnamed: 305_level_1,Unnamed: 306_level_1,Unnamed: 307_level_1,Unnamed: 308_level_1,Unnamed: 309_level_1,Unnamed: 310_level_1,Unnamed: 311_level_1,Unnamed: 312_level_1,Unnamed: 313_level_1,Unnamed: 314_level_1,Unnamed: 315_level_1,Unnamed: 316_level_1,Unnamed: 317_level_1,Unnamed: 318_level_1,Unnamed: 319_level_1,Unnamed: 320_level_1,Unnamed: 321_level_1,Unnamed: 322_level_1,Unnamed: 323_level_1,Unnamed: 324_level_1,Unnamed: 325_level_1,Unnamed: 326_level_1,Unnamed: 327_level_1,Unnamed: 328_level_1,Unnamed: 329_level_1,Unnamed: 330_level_1,Unnamed: 331_level_1,Unnamed: 332_level_1,Unnamed: 333_level_1,Unnamed: 334_level_1,Unnamed: 335_level_1,Unnamed: 336_level_1,Unnamed: 337_level_1,Unnamed: 338_level_1,Unnamed: 339_level_1,Unnamed: 340_level_1,Unnamed: 341_level_1,Unnamed: 342_level_1,Unnamed: 343_level_1,Unnamed: 344_level_1,Unnamed: 345_level_1,Unnamed: 346_level_1,Unnamed: 347_level_1,Unnamed: 348_level_1,Unnamed: 349_level_1,Unnamed: 350_level_1,Unnamed: 351_level_1,Unnamed: 352_level_1,Unnamed: 353_level_1,Unnamed: 354_level_1,Unnamed: 355_level_1,Unnamed: 356_level_1,Unnamed: 357_level_1,Unnamed: 358_level_1,Unnamed: 359_level_1,Unnamed: 360_level_1,Unnamed: 361_level_1,Unnamed: 362_level_1,Unnamed: 363_level_1,Unnamed: 364_level_1,Unnamed: 365_level_1,Unnamed: 366_level_1,Unnamed: 367_level_1,Unnamed: 368_level_1,Unnamed: 369_level_1,Unnamed: 370_level_1,Unnamed: 371_level_1,Unnamed: 372_level_1,Unnamed: 373_level_1,Unnamed: 374_level_1,Unnamed: 375_level_1,Unnamed: 376_level_1,Unnamed: 377_level_1,Unnamed: 378_level_1,Unnamed: 379_level_1,Unnamed: 380_level_1,Unnamed: 381_level_1,Unnamed: 382_level_1,Unnamed: 383_level_1,Unnamed: 384_level_1,Unnamed: 385_level_1,Unnamed: 386_level_1,Unnamed: 387_level_1,Unnamed: 388_level_1,Unnamed: 389_level_1,Unnamed: 390_level_1,Unnamed: 391_level_1,Unnamed: 392_level_1,Unnamed: 393_level_1,Unnamed: 394_level_1,Unnamed: 395_level_1,Unnamed: 396_level_1,Unnamed: 397_level_1,Unnamed: 398_level_1,Unnamed: 399_level_1,Unnamed: 400_level_1,Unnamed: 401_level_1,Unnamed: 402_level_1,Unnamed: 403_level_1,Unnamed: 404_level_1,Unnamed: 405_level_1,Unnamed: 406_level_1,Unnamed: 407_level_1,Unnamed: 408_level_1,Unnamed: 409_level_1,Unnamed: 410_level_1,Unnamed: 411_level_1,Unnamed: 412_level_1,Unnamed: 413_level_1,Unnamed: 414_level_1,Unnamed: 415_level_1,Unnamed: 416_level_1,Unnamed: 417_level_1,Unnamed: 418_level_1,Unnamed: 419_level_1,Unnamed: 420_level_1,Unnamed: 421_level_1,Unnamed: 422_level_1,Unnamed: 423_level_1,Unnamed: 424_level_1,Unnamed: 425_level_1,Unnamed: 426_level_1,Unnamed: 427_level_1,Unnamed: 428_level_1,Unnamed: 429_level_1,Unnamed: 430_level_1,Unnamed: 431_level_1,Unnamed: 432_level_1,Unnamed: 433_level_1,Unnamed: 434_level_1,Unnamed: 435_level_1,Unnamed: 436_level_1,Unnamed: 437_level_1,Unnamed: 438_level_1,Unnamed: 439_level_1,Unnamed: 440_level_1,Unnamed: 441_level_1,Unnamed: 442_level_1,Unnamed: 443_level_1,Unnamed: 444_level_1,Unnamed: 445_level_1,Unnamed: 446_level_1,Unnamed: 447_level_1,Unnamed: 448_level_1,Unnamed: 449_level_1,Unnamed: 450_level_1,Unnamed: 451_level_1,Unnamed: 452_level_1,Unnamed: 453_level_1,Unnamed: 454_level_1,Unnamed: 455_level_1,Unnamed: 456_level_1,Unnamed: 457_level_1,Unnamed: 458_level_1,Unnamed: 459_level_1,Unnamed: 460_level_1,Unnamed: 461_level_1,Unnamed: 462_level_1,Unnamed: 463_level_1,Unnamed: 464_level_1,Unnamed: 465_level_1,Unnamed: 466_level_1,Unnamed: 467_level_1,Unnamed: 468_level_1,Unnamed: 469_level_1,Unnamed: 470_level_1,Unnamed: 471_level_1,Unnamed: 472_level_1,Unnamed: 473_level_1,Unnamed: 474_level_1,Unnamed: 475_level_1,Unnamed: 476_level_1,Unnamed: 477_level_1,Unnamed: 478_level_1,Unnamed: 479_level_1,Unnamed: 480_level_1,Unnamed: 481_level_1,Unnamed: 482_level_1,Unnamed: 483_level_1,Unnamed: 484_level_1,Unnamed: 485_level_1,Unnamed: 486_level_1,Unnamed: 487_level_1,Unnamed: 488_level_1,Unnamed: 489_level_1,Unnamed: 490_level_1,Unnamed: 491_level_1,Unnamed: 492_level_1,Unnamed: 493_level_1,Unnamed: 494_level_1,Unnamed: 495_level_1,Unnamed: 496_level_1,Unnamed: 497_level_1,Unnamed: 498_level_1,Unnamed: 499_level_1,Unnamed: 500_level_1,Unnamed: 501_level_1,Unnamed: 502_level_1,Unnamed: 503_level_1,Unnamed: 504_level_1,Unnamed: 505_level_1,Unnamed: 506_level_1,Unnamed: 507_level_1,Unnamed: 508_level_1,Unnamed: 509_level_1,Unnamed: 510_level_1,Unnamed: 511_level_1,Unnamed: 512_level_1,Unnamed: 513_level_1,Unnamed: 514_level_1,Unnamed: 515_level_1,Unnamed: 516_level_1,Unnamed: 517_level_1,Unnamed: 518_level_1,Unnamed: 519_level_1,Unnamed: 520_level_1,Unnamed: 521_level_1,Unnamed: 522_level_1,Unnamed: 523_level_1,Unnamed: 524_level_1,Unnamed: 525_level_1,Unnamed: 526_level_1,Unnamed: 527_level_1,Unnamed: 528_level_1,Unnamed: 529_level_1,Unnamed: 530_level_1,Unnamed: 531_level_1,Unnamed: 532_level_1,Unnamed: 533_level_1,Unnamed: 534_level_1,Unnamed: 535_level_1,Unnamed: 536_level_1,Unnamed: 537_level_1,Unnamed: 538_level_1,Unnamed: 539_level_1,Unnamed: 540_level_1,Unnamed: 541_level_1,Unnamed: 542_level_1,Unnamed: 543_level_1,Unnamed: 544_level_1,Unnamed: 545_level_1,Unnamed: 546_level_1,Unnamed: 547_level_1,Unnamed: 548_level_1,Unnamed: 549_level_1,Unnamed: 550_level_1,Unnamed: 551_level_1,Unnamed: 552_level_1,Unnamed: 553_level_1,Unnamed: 554_level_1,Unnamed: 555_level_1,Unnamed: 556_level_1,Unnamed: 557_level_1,Unnamed: 558_level_1,Unnamed: 559_level_1,Unnamed: 560_level_1,Unnamed: 561_level_1,Unnamed: 562_level_1,Unnamed: 563_level_1,Unnamed: 564_level_1,Unnamed: 565_level_1,Unnamed: 566_level_1,Unnamed: 567_level_1,Unnamed: 568_level_1,Unnamed: 569_level_1,Unnamed: 570_level_1,Unnamed: 571_level_1,Unnamed: 572_level_1,Unnamed: 573_level_1,Unnamed: 574_level_1,Unnamed: 575_level_1,Unnamed: 576_level_1,Unnamed: 577_level_1,Unnamed: 578_level_1,Unnamed: 579_level_1,Unnamed: 580_level_1,Unnamed: 581_level_1,Unnamed: 582_level_1,Unnamed: 583_level_1,Unnamed: 584_level_1,Unnamed: 585_level_1,Unnamed: 586_level_1,Unnamed: 587_level_1,Unnamed: 588_level_1,Unnamed: 589_level_1,Unnamed: 590_level_1,Unnamed: 591_level_1,Unnamed: 592_level_1,Unnamed: 593_level_1,Unnamed: 594_level_1,Unnamed: 595_level_1,Unnamed: 596_level_1,Unnamed: 597_level_1,Unnamed: 598_level_1,Unnamed: 599_level_1,Unnamed: 600_level_1,Unnamed: 601_level_1,Unnamed: 602_level_1,Unnamed: 603_level_1,Unnamed: 604_level_1,Unnamed: 605_level_1,Unnamed: 606_level_1,Unnamed: 607_level_1,Unnamed: 608_level_1,Unnamed: 609_level_1,Unnamed: 610_level_1,Unnamed: 611_level_1,Unnamed: 612_level_1,Unnamed: 613_level_1,Unnamed: 614_level_1,Unnamed: 615_level_1,Unnamed: 616_level_1,Unnamed: 617_level_1,Unnamed: 618_level_1,Unnamed: 619_level_1,Unnamed: 620_level_1,Unnamed: 621_level_1,Unnamed: 622_level_1,Unnamed: 623_level_1,Unnamed: 624_level_1,Unnamed: 625_level_1,Unnamed: 626_level_1,Unnamed: 627_level_1,Unnamed: 628_level_1,Unnamed: 629_level_1,Unnamed: 630_level_1,Unnamed: 631_level_1,Unnamed: 632_level_1,Unnamed: 633_level_1,Unnamed: 634_level_1,Unnamed: 635_level_1,Unnamed: 636_level_1,Unnamed: 637_level_1,Unnamed: 638_level_1,Unnamed: 639_level_1,Unnamed: 640_level_1,Unnamed: 641_level_1,Unnamed: 642_level_1,Unnamed: 643_level_1,Unnamed: 644_level_1,Unnamed: 645_level_1,Unnamed: 646_level_1,Unnamed: 647_level_1,Unnamed: 648_level_1,Unnamed: 649_level_1,Unnamed: 650_level_1,Unnamed: 651_level_1,Unnamed: 652_level_1,Unnamed: 653_level_1,Unnamed: 654_level_1,Unnamed: 655_level_1,Unnamed: 656_level_1,Unnamed: 657_level_1,Unnamed: 658_level_1,Unnamed: 659_level_1,Unnamed: 660_level_1,Unnamed: 661_level_1,Unnamed: 662_level_1,Unnamed: 663_level_1,Unnamed: 664_level_1,Unnamed: 665_level_1,Unnamed: 666_level_1,Unnamed: 667_level_1,Unnamed: 668_level_1,Unnamed: 669_level_1,Unnamed: 670_level_1,Unnamed: 671_level_1,Unnamed: 672_level_1,Unnamed: 673_level_1,Unnamed: 674_level_1,Unnamed: 675_level_1,Unnamed: 676_level_1,Unnamed: 677_level_1,Unnamed: 678_level_1,Unnamed: 679_level_1,Unnamed: 680_level_1,Unnamed: 681_level_1,Unnamed: 682_level_1,Unnamed: 683_level_1,Unnamed: 684_level_1,Unnamed: 685_level_1,Unnamed: 686_level_1,Unnamed: 687_level_1,Unnamed: 688_level_1,Unnamed: 689_level_1,Unnamed: 690_level_1,Unnamed: 691_level_1,Unnamed: 692_level_1,Unnamed: 693_level_1,Unnamed: 694_level_1,Unnamed: 695_level_1,Unnamed: 696_level_1,Unnamed: 697_level_1,Unnamed: 698_level_1,Unnamed: 699_level_1,Unnamed: 700_level_1,Unnamed: 701_level_1,Unnamed: 702_level_1,Unnamed: 703_level_1,Unnamed: 704_level_1,Unnamed: 705_level_1,Unnamed: 706_level_1,Unnamed: 707_level_1,Unnamed: 708_level_1,Unnamed: 709_level_1,Unnamed: 710_level_1,Unnamed: 711_level_1,Unnamed: 712_level_1,Unnamed: 713_level_1,Unnamed: 714_level_1,Unnamed: 715_level_1,Unnamed: 716_level_1,Unnamed: 717_level_1,Unnamed: 718_level_1,Unnamed: 719_level_1,Unnamed: 720_level_1,Unnamed: 721_level_1,Unnamed: 722_level_1,Unnamed: 723_level_1,Unnamed: 724_level_1,Unnamed: 725_level_1,Unnamed: 726_level_1,Unnamed: 727_level_1,Unnamed: 728_level_1,Unnamed: 729_level_1,Unnamed: 730_level_1,Unnamed: 731_level_1,Unnamed: 732_level_1,Unnamed: 733_level_1,Unnamed: 734_level_1,Unnamed: 735_level_1,Unnamed: 736_level_1,Unnamed: 737_level_1,Unnamed: 738_level_1,Unnamed: 739_level_1,Unnamed: 740_level_1,Unnamed: 741_level_1,Unnamed: 742_level_1,Unnamed: 743_level_1,Unnamed: 744_level_1,Unnamed: 745_level_1,Unnamed: 746_level_1,Unnamed: 747_level_1,Unnamed: 748_level_1,Unnamed: 749_level_1,Unnamed: 750_level_1,Unnamed: 751_level_1,Unnamed: 752_level_1,Unnamed: 753_level_1,Unnamed: 754_level_1,Unnamed: 755_level_1,Unnamed: 756_level_1,Unnamed: 757_level_1,Unnamed: 758_level_1,Unnamed: 759_level_1,Unnamed: 760_level_1,Unnamed: 761_level_1,Unnamed: 762_level_1,Unnamed: 763_level_1,Unnamed: 764_level_1,Unnamed: 765_level_1,Unnamed: 766_level_1,Unnamed: 767_level_1,Unnamed: 768_level_1,Unnamed: 769_level_1,Unnamed: 770_level_1,Unnamed: 771_level_1,Unnamed: 772_level_1,Unnamed: 773_level_1,Unnamed: 774_level_1
307474,0,5,,0.0,3.116422,0.0,0.0,0.0,0.0,0.0,-191.0,0.239951,0.355913,-99.0,0.0,0.466834,0.0,0.0,0.0,0.0,-908.6,-0.00705,-0.102181,-498.8,,0.0,-0.404469,0.0,0.0,0.0,0.0,0.0,-2203.0,-0.434756,-0.64868,-970.0,Closed,currency 1,Consumer credit,1.0,1.0,1.0,,,2.176823,0.0,0.0,0.0,0.0,-1.360959,-1.451868,-0.451498,-0.426609,800.751709,366.714739,2.33417,-0.03525,-0.510907,0,,,,,,2,-0.168764,-0.041398,-0.228814,-0.222222,-19,0,299.564648,294.744561,0.419027,16,1,0,-0.601128,6.223529,-0.244675,-0.13789,-0.228814,-0.277778,-105.0,0.0,150.05401,147.5834,0.334423,15.0,0.5,0.0,-0.601128,3.329412,-0.320586,-0.234383,-0.228814,-0.430156,-0.333333,-191,0,0.543372,0.42224,14,0,0,-0.601128,0.435294,Country-wide,XAP,XAP,New,Approved,Consumer loans,Furniture,Cash through the bank,POS,XNA,Furniture,Unaccompanied,low_normal,POS household without interest,SUNDAY,2,1,1,2,1,1,1,1,1,1,1,2,2,2,1.0,,,,,,,,,,,,,,,,0.078567,121.622366,1.414214,0.707107,0.0,-0.48935,-0.275781,-0.555556,-210.0,295.166801,1.0,0.0,-1.202257,0.0,0.0,6.658824,7,-0.285714,-1,0,0,307474,-0.408163,-3.142857,0.0,0.0,-0.428571,-0.545455,0,0,Active,2,2.645751,0.367277,-0.288195,0.0,0.0,0,0.053995,1.9518,0,-2.857143,-22.0,0.0,2152318.0,6,-0.191197,-18,-0.181818,0,-0.191202,-0.409091,0.0,-0.191225,-0.636364,-2.44949,0.434577,0.0,0.0,0,0,-1.147209,0.0,1844844.0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,0.0,,,,0.0,,,,,,0.0,0.0,,,,0.0,,,,0.0,0.0,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,6,0.0,-0.191202,-0.285714,0.090909,0,0,,,,,-0.191225,-0.636364,0,0.0,0.0,0.0,0,,,-2.44949,0.434577,0.0,0.0,0.0,0,0.0,0,,1.1e-05,0,-1,1844844,0.0,0,0.0,0,0,1844844,3.5,0.0,3.0,-0.181818,0.0,-0.191202,-0.409091,-0.357143,-0.227273,0,0,,,,,-0.191225,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0,,,-2.44949,0.434577,0.0,0.0,0.0,0,0.0,0.170075,1.870829,0,,66.369923,0,-1.428571,-11.0,0.0,0.0,1076159.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.573605,-327.5,-1.227273,0.0,922422.0,1,0,-0.428571,0,-0.191197,-0.181818,0,0.0,-0.191202,-0.409091,1,0.0,0.0,0.0,0.0,0.0,0,,-2.44949,0.434577,0.0,0.0,0.0,0,0.0,0.170075,1.870829,0,,66.369923,0.0,0,-2.571429,-1.909091,-21,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,-1.147209,-655,-2.454545,Active,,2,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,3.535534,0.0,,0.0,,0.0,0.0,0.0,,,0.0,,,,,,,,,,1087085.0,0.0,0.0,-0.714286,-2.0,614948.0,0.0,-0.409091,-0.714286,-0.454545,-0.636364,0.0,0.0,0.0,-2.44949,0.434577,0.0,0.0,0.170075,-0.168764,-0.041398,0,299.564648,294.744561,0.419027,0,-0.298897,-0.206814,0,43.260697,42.468286,0.273991,0.0,5.396639,-0.320586,-0.234383,0,14,0,0,0.435294,Stone,XAP,XAP,New,Approved,Consumer loans,Furniture,Cash through the bank,POS,XNA,Furniture,Unaccompanied,middle,POS industry with interest,SUNDAY,2,1,1,2,1,1,1,1,2,2,2,1,2.645751,2.645751,0.0,2.645751,2.645751,0,2.645751,2.645751,2.645751,2.645751,-2.645751,0.0,0.0,-2.645751,0.057384,0.0,0,111.243381,0.755929,0.377964,0.0,-2.092281,-1.447695,-1.601695,302.824877,297.278002,1.917938,6.0,-4.207898,0.0,37.776471,0.543372,0.42224,0,-0.333333,0.543372,0.42224,0.39478,-0.333333,Stone,XAP,XAP,New,Approved,Consumer loans,Furniture,Cash through the bank,POS,XNA,Furniture,Unaccompanied,middle,POS industry with interest,SUNDAY,1,1,1,1,1,1,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.923517,-2.0,3.260229,2.533441,2.368679,0.0,-3.60677,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
412537,0,4,0.033333,0.0,0.317765,0.639861,176852.565,0.0,2.0,0.0,-813.0,17.13791,0.41791,-1345.0,0.0,-0.039028,0.159965,44213.14125,0.5,0.0,-1584.75,3.694341,-0.858496,-1607.333333,0.0,0.0,-0.288009,0.0,0.0,0.0,0.0,0.0,-2035.0,-0.917749,-1.413318,-1821.0,Closed,currency 1,Consumer credit,2.0,1.0,2.0,2.0,,0.85503,2.0,2.0,0.0,2.0,1.372617,1.998907,1.835486,0.869337,547.035876,241.702986,-0.156111,14.777365,-3.433984,210,0.0,-28.061905,C,3.0,-0.261615,2,-0.241806,0.090837,0.850452,0.0,-172,0,0.255319,0.283642,0.124909,10,0,0,0.399038,0.741176,-0.241806,-0.176915,0.850452,0.0,-362.5,0.0,0.255319,0.283642,0.124909,10.0,0.0,0.0,0.399038,0.347059,-0.241806,-0.444667,0.850452,-0.138372,0.0,-553,0,0.255319,0.283642,10,0,0,0.399038,-0.047059,Country-wide,XAP,XAP,New,Approved,Cash loans,Mobile,Cash through the bank,POS,XNA,Connectivity,,XNA,Cash,FRIDAY,2,1,2,2,2,2,2,2,1,2,0,2,2,2,1.0,,,,,,,,,,,,,,,,,269.407684,0.0,,,-0.241806,-0.35383,0.0,-725.0,0.283642,0.0,0.0,0.399038,-1.431798,0.637949,0.694118,13,0.0,-6,0,0,412537,0.0,-12.0,0.0,0.0,0.0,-0.636364,0,0,Active,2,0.0,0.0,0.0,0.0,0.0,0,0.0,3.89444,0,0.0,-156.0,0.0,5362981.0,14,-0.106073,-186,0.363636,0,-0.106121,-0.084416,0.0,-0.106124,-0.636364,3.741657,-0.349893,-0.304246,0.0,0,0,-1.485691,0.0,5775518.0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,67.0,3.0,0.0,-351.0,0.0,2.75,0.0,27.0,0.0,-33.0,2.0,0.0,C,1.0,-1.416795,0.0,1.416795,-2.0,1.070441,8.995369,0.5,0.0,0.0,-103.0,Closed,currency 1,Consumer credit,0.782038,0.0,1.022807,2.235304,2.235304,0.0,1.491622,2.233785,2.02413,0.52767,389.92582,-22.295292,17.276235,4775019.255,54.0,0.0,316.596784,-226.030999,13,0.0,-0.106121,0.0,-0.636364,0,0,,,,,-0.106124,-0.636364,0,0.0,0.0,0.0,0,,,3.741657,-0.349893,-0.304246,-0.304246,0.0,0,0.0,0,,1.4e-05,0,0,5362981,0.0,0,0.0,0,0,5775518,6.5,0.0,7.0,0.363636,0.0,-0.106121,-0.084416,0.0,-0.636364,0,0,,,,,-0.106124,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0,,,3.741657,-0.349893,-0.304246,-0.304246,0.0,0,0.0,0.35404,3.89444,0,,111.16061,0,0.0,-78.0,0.0,0.0,2681490.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.742846,-2378.0,-0.590909,0.0,2887759.0,0,0,0.0,0,-0.106073,0.363636,0,0.0,-0.106121,-0.084416,2,0.0,0.0,0.0,0.0,0.0,0,,3.741657,-0.349893,-0.304246,-0.304246,0.0,0,0.0,0.35404,3.89444,0,,111.16061,0.0,0,0.0,-1.181818,-156,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,-1.485691,-4756,-1.181818,Active,,1,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,9.192388,,,,,,,,,,,,,,,,,,,,3792200.0,0.0,0.0,0.0,-6.0,412537.0,0.0,-0.084416,0.0,-0.636364,-0.636364,0.0,0.0,0.0,3.741657,-0.349893,-0.304246,0.0,0.35404,-0.241806,0.090837,0,0.255319,0.283642,0.124909,0,-0.241806,0.090837,0,0.255319,0.283642,0.124909,0.0,0.741176,-0.241806,0.090837,0,10,0,0,0.741176,Country-wide,XAP,XAP,New,Approved,Consumer loans,Mobile,Cash through the bank,POS,XNA,Connectivity,,middle,POS mobile with interest,TUESDAY,1,1,1,1,1,1,1,1,1,1,1,1,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,-3.143472,1.180884,11.055876,3.319149,3.687349,1.62382,0.0,5.187488,8.29334,9.635294,0.255319,0.283642,0,0.0,0.255319,0.283642,0.257749,0.0,Country-wide,XAP,XAP,New,Approved,Consumer loans,Mobile,Cash through the bank,POS,XNA,Connectivity,,middle,POS mobile with interest,TUESDAY,1,1,1,1,1,1,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-3.385278,0.0,3.574468,3.970991,3.608483,0.0,5.586526,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
149084,0,1,,,0.588603,0.0,0.0,0.0,0.0,0.0,-1553.0,-0.074212,0.301952,-457.0,,0.588603,0.0,0.0,0.0,0.0,-1553.0,-0.074212,0.301952,-457.0,,,0.588603,0.0,0.0,0.0,0.0,0.0,-1553.0,-0.074212,0.301952,-457.0,Closed,currency 1,Consumer credit,1.0,1.0,1.0,,,,,,,,,,,,,,0.588603,-0.074212,0.301952,0,,,,,,3,0.800161,0.333244,-0.228814,0.666667,-1242,0,299.564648,294.744561,265.503268,18,1,0,-0.601128,1.376471,0.012694,-0.091904,-0.228814,0.0,-1363.666667,0.0,99.544463,98.060704,88.244977,16.333333,0.333333,0.0,-0.601128,0.427451,-0.399665,-0.444667,-0.228814,-0.367268,-0.666667,-1607,0,-0.614566,-0.282836,13,0,0,-0.601128,-0.047059,Credit and cash offices,XAP,XAP,Repeater,Approved,Cash loans,XNA,Cash through the bank,Cards,x-sell,XNA,Unaccompanied,XNA,Card X-Sell,MONDAY,2,1,2,2,1,3,2,2,2,2,1,3,3,2,1.0,1.726244,0.79887,1.518348,,,0.0,-1.732051,1.732045,1.732051,1.732051,1.732051,-1.732051,1.732051,,1.732051,0.666667,210.732848,2.886751,0.57735,,0.038081,-0.275711,0.0,-4091.0,294.182111,1.0,0.0,-0.601128,0.0,0.0,1.282353,36,0.857143,-29,0,0,149084,0.571429,-38.5,0.0,0.0,0.0,-0.636364,0,0,Active,2,-0.738229,0.514643,-0.459169,0.0,0.0,0,0.409793,6.720969,0,20.571429,-1386.0,0.0,5367024.0,36,-0.236367,-889,1.454545,1,-0.263911,0.227273,0.027778,-0.290651,-0.636364,0.708283,-0.458617,0.459169,6.0,0,0,-9.500791,1.0,5367024.0,39,0.0,1.75,,0.0,,,,,0.0,,0.0,-2.0,0.0,0.0,0.0,,0.0,,,,,0.0,,0.0,0.0,-0.5,,0.0,,,0.0,,0.0,,0.0,,,0.0,0.0,0.0,Active,1.0,0.0,-0.735711,,0.0,,,0.0,,0.0,0.0,0.0,0.0,0.0,1.074526,,,0.0,0.0,39.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,0.0,,,,0.0,,,,,,0.0,,,,,,,,,0.0,0.0,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,24,0.0,-0.237379,0.857143,-0.636364,0,0,,,0.0,0.0,-0.248519,-0.636364,0,0.0,0.062819,0.0,0,0.0,0.0,-3.464102,0.005038,0.0,0.0,3.464102,0,0.0,0,0.0,0.003508,0,0,3578016,39.0,0,0.0,0,1,3578016,12.0,13.0,12.0,0.909091,0.5,-0.257278,0.136364,0.428571,-0.636364,0,0,,,0.0,0.0,-0.269585,0.0,2.0,0.0,0.031409,0.0,0.0,0.0,0,0.0,0.0,-4.181541,0.002913,0.0,0.0,1.732051,0,0.0,0.491643,5.33831,0,0.0,162.215305,0,6.857143,-462.0,0.0,0.0,1789008.0,0.0,13.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-3.16693,-14132.0,2.727273,0.333333,1789008.0,0,0,0.0,0,-0.276591,0.363636,0,0.0,-0.277177,-0.136364,2,0.0,0.0,0.0,0.0,0.0,0,0.0,-4.898979,0.000787,0.0,0.0,0.0,0,0.0,0.327777,3.605551,0,0.0,109.524973,0.0,0,0.0,-1.636364,-972,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,-6.652238,-29723,-1.636364,Active,Active,1,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,1.732051,1.601488,-0.438539,1.732051,0.0,0.0,0.0,0.0,0.0,-0.425362,-0.435979,1.597097,1.732051,12.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.04442,0.0,0.0,1.014612,1.014612,0.003006,0.0,2.44949,0.0,1789008.0,22.51666,0.57735,0.857143,-58.0,298168.0,1.0,0.272727,0.857143,-1.272727,-1.272727,0.0,0.062819,0.0,-8.363081,0.005825,0.0,3.464102,0.983286,-0.362415,0.333244,0,-0.316694,-0.279613,-0.382716,0,-0.387248,0.1674,0,-0.515276,-0.281762,-0.384653,0.0,0.901961,-0.399665,-0.164288,0,13,0,0,-0.047059,Regional / Local,XAP,XAP,New,Approved,Consumer loans,Audio/Video,Cash through the bank,POS,XNA,Consumer electronics,Unaccompanied,low_action,POS household without interest,SUNDAY,2,1,2,2,1,2,1,2,2,2,2,1,0.738229,-0.738229,0.0,-0.738229,0.738229,0,0.738229,0.738229,0.738229,0.738229,0.738229,0.0,0.0,-0.738229,0.017808,0.0,0,0.001541,2.390457,0.478091,0.0,-13.940935,6.026411,-5.491525,-18.549918,-10.143433,-13.847495,12.0,-14.427079,0.0,32.470588,-0.316694,-0.279613,0,0.444444,-0.515276,-0.281762,-0.314845,0.0,Regional / Local,XAP,XAP,New,Approved,Consumer loans,Audio/Video,Cash through the bank,POS,XNA,Consumer electronics,Unaccompanied,low_action,POS household without interest,SUNDAY,1,1,2,1,2,1,0.738229,-0.738229,-0.738229,-0.738229,-0.738229,0.738229,0.0,0.738229,0.738229,0.0,0.738229,0.738229,-0.738229,0.001541,-13.940935,16.0,-18.549918,-10.143433,-11.334421,0.0,-14.427079,-0.444667,294.744561,298.316476,18.0,0.0,0.0,-0.047059,Credit and cash offices,XAP,XAP,Repeater,Approved,Revolving loans,XNA,XNA,Cards,x-sell,XNA,XNA,Card X-Sell,MONDAY,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,0.0,0.0,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,31.206291,-17.342018,75.681611,0.0,0.0,11683.021277,11495.037873,11634.342577,-1.835294
364692,0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0,,,,,,7,2.648188,4.321776,1.460452,1.333333,-408,0,0.366612,294.744561,0.63907,18,1,0,0.32546,0.141176,0.910624,1.121602,0.334746,0.222222,-667.571429,0.0,0.123568,73.686946,0.095134,12.714286,0.25,0.0,-0.14899,0.052101,-0.119867,-0.126773,-0.228814,-0.33361,-0.333333,-1132,0,-0.175941,-0.334408,9,0,0,-0.601128,-0.047059,Country-wide,XAP,XAP,Repeater,Approved,Consumer loans,Mobile,Cash through the bank,POS,XNA,Connectivity,Unaccompanied,high,POS mobile with interest,SATURDAY,3,3,2,2,2,2,3,2,3,2,1,1,3,4,1.0,1.295815,2.038661,2.016061,1.410663,2.038661,1.476876,-0.669915,-0.552653,1.999979,0.376704,1.999979,0.946518,2.0,0.006315,-0.211074,0.57735,300.009365,3.40168,0.5,0.522402,6.374367,7.851216,1.555556,-4673.0,294.747784,1.0,0.0,-0.595959,0.0,0.0,0.364706,45,1.714286,-1,0,0,364692,0.621262,-15.444444,0.0,0.0,-0.571429,-0.636364,0,0,Active,4,0.28247,0.43453,-0.653564,0.0,0.0,0,0.947035,10.362861,0,26.714286,-695.0,0.0,16411140.0,40,9.055535,-21,0.909091,1,2.047142,-0.109091,0.05,0.025552,-0.636364,2.031646,-0.851125,0.79927,4.292145,0,0,81.885681,2.0,14587680.0,0,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,18,0.0,3.014675,1.714286,1.090909,0,0,,,,,3.014675,-0.636364,0,0.0,0.0,0.0,0,,,2.828427,1.658524,0.0,0.0,2.828427,0,0.095238,0,,2.675855,0,0,6564456,0.0,0,54.264155,0,1,6564456,6.428571,0.0,5.714286,0.181818,0.5,1.619848,-0.227273,0.178571,-0.204545,0,0,,,,,1.299733,0.0,2.25,-1.309017,-0.51193,0.0,0.0,0.0,0,,,0.416537,0.48992,0.0,0.0,1.207107,0,0.039781,0.331846,3.388506,0,,92.41947,0,3.816327,-99.285714,0.0,0.0,2344449.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,11.697954,-2431.0,-0.623377,0.285714,2083954.0,0,0,-0.428571,0,0.671697,-0.363636,0,0.0,0.359894,-0.5,1,-3.0,-1.032659,0.0,0.0,0.0,0,,-3.162278,0.011975,0.0,0.0,0.0,0,0.0,0.209288,2.160247,0,,51.110175,0.0,0,-2.285714,-2.0,-238,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0.0,-5263,-2.545455,Active,,1,0.614289,1.035491,1.794946,-0.668361,0.0,1.627293,0.940661,0.0,0.0,1.874393,0.0,0.120262,0.854563,0.0,1.821731,2.0,0.0,0.0,0.0,0.848518,0.848518,0.0,0.0,-0.854563,0.360589,0.77073,1.332309,0.0,1.658808,1.658808,1.473347,0.280765,0.0,2.613479,2.617653,-0.225886,0.0,0.0,0.0,0.0,0.0,0.0,2.071416,-0.014672,0.652254,1.229634,6.900656,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.957427,0.591169,0.0,0.0,2.664957,2.664957,0.789015,0.0,1.434291,0.0,2516614.0,0.0,0.48795,1.142857,-53.0,1458768.0,2.0,-0.909091,0.714286,-0.818182,-2.545455,-5.236068,-2.047718,0.0,1.666149,1.959679,0.0,4.828427,1.327382,2.648188,4.321776,0,0.366612,294.744561,0.63907,0,1.404306,1.996665,0,0.172068,117.92866,0.229194,0.0,0.019085,-0.119867,-0.126773,0,11,0,0,-0.047059,Country-wide,XAP,XNA,Repeater,Approved,Cash loans,XNA,Cash through the bank,Cash,x-sell,XNA,Unaccompanied,high,Cash X-Sell: high,SATURDAY,3,1,2,2,1,3,2,2,1,2,3,1,0.01905,0.334353,-0.498374,0.361022,-0.991052,0,-0.850786,0.422456,-0.259197,0.597638,0.422463,0.0,-0.498374,0.561899,1.105801,0.563981,0,146.001082,2.918592,0.495434,0.022385,63.193783,89.849943,18.418079,7.743044,5306.789686,10.313725,18.0,5.545924,0.0,0.858824,0.366612,294.744561,0,0.538889,0.196154,132.686382,134.278222,-0.333333,Credit and cash offices,XAP,XNA,Repeater,Approved,Cash loans,XNA,Cash through the bank,Cash,x-sell,XNA,Unaccompanied,high,Cash X-Sell: high,SATURDAY,1,1,2,2,2,1,-0.147465,0.137461,0.137504,0.137461,0.162649,-1.266611,0.0,0.20892,-0.411396,0.20892,0.863566,0.208925,0.756647,148.45469,61.841331,21.555556,7.846154,5307.455278,5371.128874,0.0,4.377953,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
155423,0,5,0.0,,1.072048,2.653891,0.0,0.0,0.0,0.0,-377.0,1.104515,0.453502,-420.0,,0.30973,0.922255,0.0,0.0,0.0,-1321.4,0.052938,-0.143513,-1230.333333,0.0,,-0.043352,0.0,0.0,0.0,0.0,0.0,-2870.0,-1.095857,-0.755454,-2109.0,Closed,currency 1,Consumer credit,2.0,1.0,2.0,,,1.936462,0.772377,0.0,0.0,0.0,-0.913626,-0.042391,-0.270792,-0.360863,1022.646224,846.570926,1.54865,0.264688,-0.717566,204,0.0,-33.612745,C,3.0,-0.678273,5,1.659166,4.882535,-0.228814,1.333333,-364,0,0.4018,294.744561,265.503268,16,1,0,-0.601128,27.929412,0.191106,0.730491,-0.228814,0.083333,-1261.4,-91958.25,-0.509411,73.454472,66.255265,12.4,0.5,0.0,-0.601128,7.541176,-0.416058,-0.444667,-0.228814,-0.386436,-0.666667,-2592,-367833,-1.396072,-1.195004,11,0,0,-0.601128,-0.047059,Credit and cash offices,XAP,XAP,Repeater,Approved,Cash loans,XNA,Cash through the bank,POS,XNA,XNA,Family,XNA,Card Street,FRIDAY,3,1,2,1,2,3,3,2,2,4,2,4,5,3,1.0,1.931201,2.216528,2.152257,,1.731837,1.443059,-0.594238,0.016577,1.999891,1.999775,-4.9e-05,1.530961,0.0,,1.734007,0.876652,1124.401085,2.19089,0.57735,0.0,0.764423,3.652456,0.333333,-6307.0,293.817889,2.0,0.0,-1.202257,0.0,0.0,37.705882,31,1.714286,-1,5,5,155423,0.56682,-34.806452,0.193548,0.193548,-0.428571,-0.636364,0,0,Active,2,0.386046,0.446475,-0.432959,5.266617,5.266617,0,0.941474,31.683244,0,17.571429,-1079.0,6.0,4818113.0,124,15.690886,-5,7.636364,0,0.236179,2.640029,-0.758065,-0.703277,-0.636364,8.179173,-0.120931,0.349199,1.219998,0,0,29.286165,-94.0,19272452.0,86,2.740036,0.75,144000.0,144000.0,134415.0,0.0,25.070217,2.0,2.0,1.0,74.0,-1.0,1.0,1.0,1.566333,3282.352941,4807.151163,1581.352941,0.0,0.756681,0.070588,0.081395,0.011765,0.05814,0.0,-0.75,0.0,0.0,0.0,0.0,0.0,-0.506981,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,Active,2.0,-0.326797,-0.471546,6.938667,5.463719,9.219544,0.0,-0.429391,8.601863,-0.109367,0.0,3.843842,3.843842,0.0,0.504909,17406.116717,0.300327,0.0,134.704601,20.25,279000.0,413415.0,134415.0,0.0,6.0,7.0,1.0,5.0,80.0,3.0,0.0,-78.0,-3.0,2.2,0.0,13.0,-15.0,-54.5,2.0,0.0,C,2.0,0.511836,-2.236068,-1.242917,2.236068,-1.647379,19.790781,0.447214,0.0,-15.0,-114.5,Closed,currency 1,Consumer credit,0.0,,1.458402,2.179362,0.0,0.0,0.189886,0.942422,0.944656,0.301945,889.227572,68.082221,65.81871,0.0,0.0,0.0,-91.641311,-92.189437,12,0.416667,1.946666,1.714286,1.636364,0,0,0.0,0.0,0.0,0.0,1.946666,-0.636364,0,0.0,0.0,0.0,0,0.0,0.0,9.013448,0.028597,0.0,0.0,0.0,0,0.0,0,0.0,1.664149,0,0,1865076,20.25,0,23.359997,0,0,14609762,6.2,17.2,24.8,2.045455,-0.25,0.434305,0.715426,0.428571,0.151515,0,0,0.0,0.0,0.0,0.0,0.231517,-0.25,1.333333,0.0,-0.025968,0.0,2.309401,2.309401,0,0.0,0.0,0.774964,-0.010855,-0.021513,-0.010289,0.0,0,0.0,0.287705,3.123783,0,0.0,252.062332,0,3.514286,-215.8,1.2,1.2,963622.6,26.94092,4.05,55800.0,82683.0,26883.0,0.0,1.2,1.4,0.2,703.0,1.0,1.0,5.857233,-29408.4,65.472727,-18.8,3854490.0,0,0,-0.428571,0,-0.294294,-0.181818,-1,0.0,-0.294834,-0.409091,1,0.0,-0.077904,0.0,0.0,0.0,0,0.0,-3.464102,-0.082609,-0.086054,-0.041155,0.0,0,0.0,0.196386,2.160247,0,0.0,54.83308,0.0,0,-3.0,-2.545455,-882,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,-3.538009,-115659,-2.454545,Active,Active,1,-0.164369,2.143929,1.457863,-1.474847,0.0,1.901324,1.97117,-2.0,0.0,1.625462,0.0,1.838896,-2.0,0.0,1.457863,1.722329,0.0,0.0,0.0,1.658373,1.658373,-2.0,0.0,1.732051,0.0,-1.693419,-1.732051,0.0,1.999994,1.999994,1.960987,0.0,0.0,2.130123,2.191094,-2.133473,2.236068,2.236068,0.0,2.236068,2.236068,2.236068,1.329994,-1.998041,2.23587,-2.236068,6.016644,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.5,0.0,0.57735,0.044978,0.0,0.0,5.681548,5.683285,0.049972,0.043027,0.0,0.0,935124.8,9.056075,42.038078,1.285714,-83.0,466269.0,-1.0,2.861702,1.285714,0.454545,-2.545455,0.0,-0.077904,0.0,3.099857,-0.043421,-0.086054,0.0,0.863116,1.659166,4.882535,0,0.4018,294.744561,0.777778,0,0.445386,1.783127,0,-0.281083,113.651738,-0.178228,0.0,2.318786,-0.416058,-0.185653,0,11,0,0,-0.047059,Country-wide,XAP,XAP,Repeater,Approved,Consumer loans,Mobile,Cash through the bank,POS,XNA,Connectivity,Family,high,Cash X-Sell: low,SATURDAY,3,1,2,1,1,3,1,3,3,3,2,1,0.45582,0.487428,0.0,0.386046,-0.457344,0,-0.457565,0.487463,-0.130822,0.487499,-1.379233,0.0,0.0,1.35227,0.985528,0.0,0,146.297781,2.475688,0.425024,0.0,13.806953,55.276937,-4.347458,-8.713584,3523.203868,-5.525054,24.0,-11.421437,0.0,71.882353,0.4018,294.744561,0,-0.392473,-1.13022,28.549674,254.895096,-0.666667,Stone,XAP,XAP,Repeater,Approved,Revolving loans,XNA,XNA,Cards,x-sell,Construction,Family,XNA,Card Street,WEDNESDAY,1,1,3,2,2,1,2.729938,2.743113,2.673684,0.429992,2.282591,2.025473,1.219998,2.761128,-1.22005,-2.039396,0.840191,1.570405,-1.294075,87.487149,-16.120491,-48.666667,-140.1473,3540.159549,31606.991843,0.0,-10.820309,-0.444667,0.181305,298.316476,13.0,0.0,0.0,27.929412,Stone,XAP,XAP,Repeater,Approved,Revolving loans,XNA,XNA,Cards,x-sell,Construction,XNA,Card Street,WEDNESDAY,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,0.0,1.0,1.0,1.0,1.0,0.0,0.0,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-27.525549,-38.241373,24.482498,0.0,0.0,-120.062193,15.592264,25655.216966,2401.929412


In [17]:
summarize_dataframe(fm4)


Data Total Number of Each Type:
 float64     487
Int64       209
int64         7
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      4
category      3
category      2
category      2
category      2
category      2
category      2
category      2
Name: count, dtype: int64

Float Types Count:
 MAX(bureau.AMT_ANNUITY)                   4346
MAX(bureau.AMT_CREDIT_MAX_OVERDUE)        5847
MAX(bureau.AMT_CREDIT_SUM)               10014
MAX(bureau.AMT_CREDIT_SUM_DEBT)          17748
MAX(bureau.AMT_CREDIT_SUM_LIMIT)          3844
                                         ...  
SUM(credit.previous.AMT_GOODS_PRICE)       587
SUM(credit.previous.DAYS_FIRST_DUE)       3641
SUM(credit.previous.DAYS_LAST_DUE)        2082
SUM(credit.previous.DAYS_TERMINATION)     1102
SUM(credit.previous.SELLERPLACE_AREA)     2050
Length: 487, dtype

In [18]:
columns_to_check = ['SK_ID_BUREAU', 'SK_ID_CURR', 'SK_ID_PREV', 'TARGET']

# Check if each column exists in the DataFrame and print the result
for column in columns_to_check:
    if column in fm4.columns:
        print(f"Column '{column}' is in the DataFrame.")
    else:
        print(f"Column '{column}' is NOT in the DataFrame.")

Column 'SK_ID_BUREAU' is NOT in the DataFrame.
Column 'SK_ID_CURR' is NOT in the DataFrame.
Column 'SK_ID_PREV' is NOT in the DataFrame.
Column 'TARGET' is in the DataFrame.


In [19]:
fm5 = fm4.copy()
target = fm5['TARGET']
fm5 = fm5.drop(columns = 'TARGET')
# convert 'object' to categorical features
fm5 = convert_to_categorical(fm5, min_unique=2, max_unique=20)

Converted columns to categorical: []


In [20]:
# perform one-hot encoding for categorical features
fm5 = encode_and_one_hot(fm5)

DataFrame shape before encoding: (30751, 773)
0 columns were label encoded.
DataFrame shape after encoding: (30751, 1258)


In [26]:
# load df3, df4, df3_new_features, and df4_new_features
# load the data
df3 = pd.read_csv(f'{config.CleanDataPath}df3.csv', index_col='SK_ID_CURR')
df3 = df3.loc[:, ~df3.columns.str.contains('^Unnamed')]
df3_new_features = pd.read_csv(f'{config.CleanDataPath}df3_new_features.csv', index_col='SK_ID_CURR')
df3_new_features = df3_new_features.loc[:, ~df3_new_features.columns.str.contains('^Unnamed')]

df4 = pd.read_csv(f'{config.CleanDataPath}df4.csv', index_col='SK_ID_CURR')
df4 = df4.loc[:, ~df4.columns.str.contains('^Unnamed')]
df4_new_features = pd.read_csv(f'{config.CleanDataPath}df4_new_features.csv', index_col='SK_ID_CURR')
df4_new_features = df4_new_features.loc[:, ~df4_new_features.columns.str.contains('^Unnamed')]


In [28]:
# Merge data together
fm5_df4 = merge_data_on_index(fm5, df4, how='inner')
fm5_df4_new_features = merge_data_on_index(fm5, df4_new_features, how='inner')

First data input shape: (30751, 1258)
Second data input shape: (30751, 161)
Combined data shape: (30751, 1419)
First data input shape: (30751, 1258)
Second data input shape: (30751, 214)
Combined data shape: (30751, 1472)


To boost computational efficiency and reduce noise, we’ll use Factor Analysis of Mixed Data (FAMD) to streamline the dataset by reducing its dimensionality. We’ll retain components until we capture 70% of the cumulative variance for model experimentation. By setting this threshold, we create a dataset that’s optimized for effective model testing and analysis while maintaining the most informative features.

FAMD is a great choice here because it’s built to handle both numerical and categorical variables, making it perfect for mixed datasets like ours. By reducing the dimensionality, FAMD retains the most critical variance, helping us filter out redundant or noisy features and resulting in a more efficient and manageable dataset for model training.

As with any dimensionality reduction method, FAMD does come with trade-offs. With our 70% variance threshold, we’ll lose about 30% of the variance, which means some information may be sacrificed. Additionally, the components created by FAMD are combinations of multiple original features, which can make them a bit harder to interpret. Nonetheless, the benefits of a streamlined, computationally efficient dataset often outweigh these minor drawbacks, especially when it comes to preparing our data for effective modeling!


In [43]:
import time

# Record start time
start_time = time.time()
reduced_df = preprocess_and_reduce_features(fm5_df4, n_components=120, target_variance=0.70) 
# Record end time
end_time = time.time()

# Calculate and print the elapsed time
elapsed_time = end_time - start_time
print(f"Elapsed time: {elapsed_time:.2f} seconds")


Number of Components: 120
Cumulative Variance: 0.6273510224734017
Number of Components: 121
Cumulative Variance: 0.6296745975252099
Number of Components: 122
Cumulative Variance: 0.6319613449832183
Number of Components: 123
Cumulative Variance: 0.6340994295835857
Number of Components: 124
Cumulative Variance: 0.6359731590012577
Number of Components: 125
Cumulative Variance: 0.6387124750466693
Number of Components: 126
Cumulative Variance: 0.6411434534318146
Number of Components: 127
Cumulative Variance: 0.6431347808238193
Number of Components: 128
Cumulative Variance: 0.6452237970355311
Number of Components: 129
Cumulative Variance: 0.6475538249060243
Number of Components: 130
Cumulative Variance: 0.649654452602891
Number of Components: 131
Cumulative Variance: 0.6519035047971888
Number of Components: 132
Cumulative Variance: 0.6537936668121904
Number of Components: 133
Cumulative Variance: 0.6560990941880118
Number of Components: 134
Cumulative Variance: 0.658379114423537
Number of Co

In [44]:

start_time = time.time()
reduced_df_new_features = preprocess_and_reduce_features(fm5_df4_new_features, n_components=120, target_variance=0.70)
end_time = time.time()

elapsed_time = end_time - start_time
print(f"Elapsed time: {elapsed_time:.2f} seconds")


Number of Components: 120
Cumulative Variance: 0.6161041029129454
Number of Components: 121
Cumulative Variance: 0.6179054491771163
Number of Components: 122
Cumulative Variance: 0.6210253134796486
Number of Components: 123
Cumulative Variance: 0.6228331021986655
Number of Components: 124
Cumulative Variance: 0.6249560165683875
Number of Components: 125
Cumulative Variance: 0.6271415961498457
Number of Components: 126
Cumulative Variance: 0.6297655022767376
Number of Components: 127
Cumulative Variance: 0.631906429756533
Number of Components: 128
Cumulative Variance: 0.6337998708016509
Number of Components: 129
Cumulative Variance: 0.6359125101222657
Number of Components: 130
Cumulative Variance: 0.6383061166713427
Number of Components: 131
Cumulative Variance: 0.640682410910005
Number of Components: 132
Cumulative Variance: 0.6423867533169529
Number of Components: 133
Cumulative Variance: 0.6445124239891681
Number of Components: 134
Cumulative Variance: 0.6467248308149817
Number of Co

In [49]:
#merge again with 60 top corr data
df3_reduced_df = merge_data_on_index(df3, reduced_df, how='inner')
df3_reduced_df_new_features = merge_data_on_index(df3_new_features, reduced_df_new_features, how='inner')

First data input shape: (30751, 60)
Second data input shape: (30751, 154)
Combined data shape: (30751, 214)
First data input shape: (30751, 60)
Second data input shape: (30751, 160)
Combined data shape: (30751, 220)


In [51]:
# Saving the data for model implementation
df3_reduced_df.to_csv(f'{config.CleanDataPath}df3_reduced_df.csv', index=True, index_label='SK_ID_CURR')
df3_reduced_df_new_features.to_csv(f'{config.CleanDataPath}df3_reduced_df_new_features.csv', index=True, index_label='SK_ID_CURR')
