<div style=" border-bottom: 8px solid #e3f56c; overflow: hidden; border-radius: 10px; height: 95%; width: 100%; display: flex;">
  <div style="height: 100%; width: 100%; background-color: #3800BB; float: left; text-align: center; display: flex; justify-content: left; align-items: center; font-size: 40px; ">
    <b><span style="color: #FFFFFF; padding: 20px 20px;">Santander Customer Satisfaction: Basic Preprocessing</span></b>
  </div>
</div>

<div class="alert" style="background-color: #FFFFFF; border-left: 8px solid #B12111; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **This notebook performs the initial data preprocessing for the [Santander Customer Satisfaction competition](https://www.kaggle.com/c/santander-customer-satisfaction/overview).**

**The main steps include:**

<div class="alert alert-danger">

**Workflow Overview**
</div>


1. **Import Libraries and Setup**: Load required Python libraries and configure the environment.
2. **Load Data**: Read the raw training and test datasets from CSV files.
3. **Initial Cleaning**: Remove constant columns and duplicate rows from the datasets.
4. **Feature Intersection**: Ensure that both train and test datasets have the same set of features.
5. **Low-Variance Feature Removal**: Identify and drop features with very low variance, as they add little value to modeling.
6. **Missing Value Analysis**: Check for missing values in the cleaned training data.
7. **Correlation Analysis**: Identify highly correlated features that may be redundant.
8. **Save Cleaned Data**: Store the cleaned datasets for further analysis and modeling.


<div class="alert alert-danger">

**Key Functions and Steps**
</div>

- `data_utils.load_data`: Loads CSV data into a pandas DataFrame.
- `data_utils.initial_cleaning`: Removes constant columns and duplicates.
- `data_utils.intersect_features`: Ensures feature consistency between train and test sets.
- `VarianceThreshold`: Used to remove low-variance features.


<div class="alert alert-danger">

**Outputs**
</div>

- Cleaned train and test datasets saved to the `data/processed/` directory.
- Summary statistics and information about features removed or flagged for further analysis.

</div>


---

<div class="alert alert-warning">

**Note:** This notebook is intended as a starting point for feature engineering and model development. Further steps may include advanced feature selection, handling missing values, and building predictive models.
</div>

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Import Libraries and Setup**

</div>

In [1]:
import sys
import os
# Add the parent directory to the Python path
sys.path.append(os.path.abspath('..'))


from classes import Paths
import src.data_utils as data_utils
import pandas as pd
pd.set_option("display.float_format", "{:,.2f}".format)
pd.set_option("display.max_columns", None)
import numpy as np
import warnings
warnings.filterwarnings("ignore")


from src.data_utils import setup_logging
setup_logging()

from sklearn.feature_selection import VarianceThreshold

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Load Data**

</div>

In [2]:
paths = Paths()
train_data = data_utils.load_data(paths.get_raw_data_path("train.csv"))
test_data = data_utils.load_data(paths.get_raw_data_path("test.csv"))


display(train_data.head())
print("Train shape: {}".format(train_data.shape))
display(test_data.head())
print("Test shape: {}".format(test_data.shape))

Unnamed: 0,ID,var3,var15,imp_ent_var16_ult1,imp_op_var39_comer_ult1,imp_op_var39_comer_ult3,imp_op_var40_comer_ult1,imp_op_var40_comer_ult3,imp_op_var40_efect_ult1,imp_op_var40_efect_ult3,imp_op_var40_ult1,imp_op_var41_comer_ult1,imp_op_var41_comer_ult3,imp_op_var41_efect_ult1,imp_op_var41_efect_ult3,imp_op_var41_ult1,imp_op_var39_efect_ult1,imp_op_var39_efect_ult3,imp_op_var39_ult1,imp_sal_var16_ult1,ind_var1_0,ind_var1,ind_var2_0,ind_var2,ind_var5_0,ind_var5,ind_var6_0,ind_var6,ind_var8_0,ind_var8,ind_var12_0,ind_var12,ind_var13_0,ind_var13_corto_0,ind_var13_corto,ind_var13_largo_0,ind_var13_largo,ind_var13_medio_0,ind_var13_medio,ind_var13,ind_var14_0,ind_var14,ind_var17_0,ind_var17,ind_var18_0,ind_var18,ind_var19,ind_var20_0,ind_var20,ind_var24_0,ind_var24,ind_var25_cte,ind_var26_0,ind_var26_cte,ind_var26,ind_var25_0,ind_var25,ind_var27_0,ind_var28_0,ind_var28,ind_var27,ind_var29_0,ind_var29,ind_var30_0,ind_var30,ind_var31_0,ind_var31,ind_var32_cte,ind_var32_0,ind_var32,ind_var33_0,ind_var33,ind_var34_0,ind_var34,ind_var37_cte,ind_var37_0,ind_var37,ind_var39_0,ind_var40_0,ind_var40,ind_var41_0,ind_var41,ind_var39,ind_var44_0,ind_var44,ind_var46_0,ind_var46,num_var1_0,num_var1,num_var4,num_var5_0,num_var5,num_var6_0,num_var6,num_var8_0,num_var8,num_var12_0,num_var12,num_var13_0,num_var13_corto_0,num_var13_corto,num_var13_largo_0,num_var13_largo,num_var13_medio_0,num_var13_medio,num_var13,num_var14_0,num_var14,num_var17_0,num_var17,num_var18_0,num_var18,num_var20_0,num_var20,num_var24_0,num_var24,num_var26_0,num_var26,num_var25_0,num_var25,num_op_var40_hace2,num_op_var40_hace3,num_op_var40_ult1,num_op_var40_ult3,num_op_var41_hace2,num_op_var41_hace3,num_op_var41_ult1,num_op_var41_ult3,num_op_var39_hace2,num_op_var39_hace3,num_op_var39_ult1,num_op_var39_ult3,num_var27_0,num_var28_0,num_var28,num_var27,num_var29_0,num_var29,num_var30_0,num_var30,num_var31_0,num_var31,num_var32_0,num_var32,num_var33_0,num_var33,num_var34_0,num_var34,num_var35,num_var37_med_ult2,num_var37_0,num_var37,num_var39_0,num_var40_0,num_var40,num_var41_0,num_var41,num_var39,num_var42_0,num_var42,num_var44_0,num_var44,num_var46_0,num_var46,saldo_var1,saldo_var5,saldo_var6,saldo_var8,saldo_var12,saldo_var13_corto,saldo_var13_largo,saldo_var13_medio,saldo_var13,saldo_var14,saldo_var17,saldo_var18,saldo_var20,saldo_var24,saldo_var26,saldo_var25,saldo_var28,saldo_var27,saldo_var29,saldo_var30,saldo_var31,saldo_var32,saldo_var33,saldo_var34,saldo_var37,saldo_var40,saldo_var41,saldo_var42,saldo_var44,saldo_var46,var36,delta_imp_amort_var18_1y3,delta_imp_amort_var34_1y3,delta_imp_aport_var13_1y3,delta_imp_aport_var17_1y3,delta_imp_aport_var33_1y3,delta_imp_compra_var44_1y3,delta_imp_reemb_var13_1y3,delta_imp_reemb_var17_1y3,delta_imp_reemb_var33_1y3,delta_imp_trasp_var17_in_1y3,delta_imp_trasp_var17_out_1y3,delta_imp_trasp_var33_in_1y3,delta_imp_trasp_var33_out_1y3,delta_imp_venta_var44_1y3,delta_num_aport_var13_1y3,delta_num_aport_var17_1y3,delta_num_aport_var33_1y3,delta_num_compra_var44_1y3,delta_num_reemb_var13_1y3,delta_num_reemb_var17_1y3,delta_num_reemb_var33_1y3,delta_num_trasp_var17_in_1y3,delta_num_trasp_var17_out_1y3,delta_num_trasp_var33_in_1y3,delta_num_trasp_var33_out_1y3,delta_num_venta_var44_1y3,imp_amort_var18_hace3,imp_amort_var18_ult1,imp_amort_var34_hace3,imp_amort_var34_ult1,imp_aport_var13_hace3,imp_aport_var13_ult1,imp_aport_var17_hace3,imp_aport_var17_ult1,imp_aport_var33_hace3,imp_aport_var33_ult1,imp_var7_emit_ult1,imp_var7_recib_ult1,imp_compra_var44_hace3,imp_compra_var44_ult1,imp_reemb_var13_hace3,imp_reemb_var13_ult1,imp_reemb_var17_hace3,imp_reemb_var17_ult1,imp_reemb_var33_hace3,imp_reemb_var33_ult1,imp_var43_emit_ult1,imp_trans_var37_ult1,imp_trasp_var17_in_hace3,imp_trasp_var17_in_ult1,imp_trasp_var17_out_hace3,imp_trasp_var17_out_ult1,imp_trasp_var33_in_hace3,imp_trasp_var33_in_ult1,imp_trasp_var33_out_hace3,imp_trasp_var33_out_ult1,imp_venta_var44_hace3,imp_venta_var44_ult1,ind_var7_emit_ult1,ind_var7_recib_ult1,ind_var10_ult1,ind_var10cte_ult1,ind_var9_cte_ult1,ind_var9_ult1,ind_var43_emit_ult1,ind_var43_recib_ult1,var21,num_var2_0_ult1,num_var2_ult1,num_aport_var13_hace3,num_aport_var13_ult1,num_aport_var17_hace3,num_aport_var17_ult1,num_aport_var33_hace3,num_aport_var33_ult1,num_var7_emit_ult1,num_var7_recib_ult1,num_compra_var44_hace3,num_compra_var44_ult1,num_ent_var16_ult1,num_var22_hace2,num_var22_hace3,num_var22_ult1,num_var22_ult3,num_med_var22_ult3,num_med_var45_ult3,num_meses_var5_ult3,num_meses_var8_ult3,num_meses_var12_ult3,num_meses_var13_corto_ult3,num_meses_var13_largo_ult3,num_meses_var13_medio_ult3,num_meses_var17_ult3,num_meses_var29_ult3,num_meses_var33_ult3,num_meses_var39_vig_ult3,num_meses_var44_ult3,num_op_var39_comer_ult1,num_op_var39_comer_ult3,num_op_var40_comer_ult1,num_op_var40_comer_ult3,num_op_var40_efect_ult1,num_op_var40_efect_ult3,num_op_var41_comer_ult1,num_op_var41_comer_ult3,num_op_var41_efect_ult1,num_op_var41_efect_ult3,num_op_var39_efect_ult1,num_op_var39_efect_ult3,num_reemb_var13_hace3,num_reemb_var13_ult1,num_reemb_var17_hace3,num_reemb_var17_ult1,num_reemb_var33_hace3,num_reemb_var33_ult1,num_sal_var16_ult1,num_var43_emit_ult1,num_var43_recib_ult1,num_trasp_var11_ult1,num_trasp_var17_in_hace3,num_trasp_var17_in_ult1,num_trasp_var17_out_hace3,num_trasp_var17_out_ult1,num_trasp_var33_in_hace3,num_trasp_var33_in_ult1,num_trasp_var33_out_hace3,num_trasp_var33_out_ult1,num_venta_var44_hace3,num_venta_var44_ult1,num_var45_hace2,num_var45_hace3,num_var45_ult1,num_var45_ult3,saldo_var2_ult1,saldo_medio_var5_hace2,saldo_medio_var5_hace3,saldo_medio_var5_ult1,saldo_medio_var5_ult3,saldo_medio_var8_hace2,saldo_medio_var8_hace3,saldo_medio_var8_ult1,saldo_medio_var8_ult3,saldo_medio_var12_hace2,saldo_medio_var12_hace3,saldo_medio_var12_ult1,saldo_medio_var12_ult3,saldo_medio_var13_corto_hace2,saldo_medio_var13_corto_hace3,saldo_medio_var13_corto_ult1,saldo_medio_var13_corto_ult3,saldo_medio_var13_largo_hace2,saldo_medio_var13_largo_hace3,saldo_medio_var13_largo_ult1,saldo_medio_var13_largo_ult3,saldo_medio_var13_medio_hace2,saldo_medio_var13_medio_hace3,saldo_medio_var13_medio_ult1,saldo_medio_var13_medio_ult3,saldo_medio_var17_hace2,saldo_medio_var17_hace3,saldo_medio_var17_ult1,saldo_medio_var17_ult3,saldo_medio_var29_hace2,saldo_medio_var29_hace3,saldo_medio_var29_ult1,saldo_medio_var29_ult3,saldo_medio_var33_hace2,saldo_medio_var33_hace3,saldo_medio_var33_ult1,saldo_medio_var33_ult3,saldo_medio_var44_hace2,saldo_medio_var44_hace3,saldo_medio_var44_ult1,saldo_medio_var44_ult3,var38,TARGET
0,1,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,39205.17,0
1,3,2,34,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,1,1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,0,0,0,0,0,0,0,3,3,3,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,300.0,0.0,0,300.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,300.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,3,0,0,-1.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,-1.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,300.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,3,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,88.89,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,300.0,122.22,300.0,240.75,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,49278.03,0
2,4,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,3.0,0.0,0.0,0.0,0,0.0,0.0,0,3.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3.0,0.18,3.0,2.07,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,67333.77,0
3,8,2,37,0.0,195.0,195.0,0.0,0.0,0.0,0.0,0.0,195.0,195.0,0.0,0.0,195.0,0.0,0.0,195.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,1,1,1,1,0,0,1,0,0,0,0,0,0,0,0,3,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,9,9,0,0,9,9,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,9,6,6,6,3,0,0,3,0,0,3,3,0,0,0,0,0.0,70.62,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,70.62,0.0,0.0,0.0,0,34.95,0.0,0,70.62,0.0,0,2,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,1,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,15,2,0,0,0,0,0,0,0,0,1,0,9,9,0,0,0,0,9,9,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,27,3,18,48,0,186.09,0.0,91.56,138.84,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,64007.97,0
4,10,2,39,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,6,3,0,0,0,0,0.0,0.0,0.0,0.0,135003.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,135003.0,0.0,0.0,0,0,0.0,135003.0,0.0,0.0,0.0,0,0.0,0.0,0,135003.0,0.0,0,1,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,135003.0,270003.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,6,9,3,0,3,0,1,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,6,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3.0,0.3,40501.08,13501.47,0.0,0.0,0.0,0.0,0.0,0.0,85501.89,85501.89,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,117310.98,0


Train shape: (76020, 371)


Unnamed: 0,ID,var3,var15,imp_ent_var16_ult1,imp_op_var39_comer_ult1,imp_op_var39_comer_ult3,imp_op_var40_comer_ult1,imp_op_var40_comer_ult3,imp_op_var40_efect_ult1,imp_op_var40_efect_ult3,imp_op_var40_ult1,imp_op_var41_comer_ult1,imp_op_var41_comer_ult3,imp_op_var41_efect_ult1,imp_op_var41_efect_ult3,imp_op_var41_ult1,imp_op_var39_efect_ult1,imp_op_var39_efect_ult3,imp_op_var39_ult1,imp_sal_var16_ult1,ind_var1_0,ind_var1,ind_var2_0,ind_var2,ind_var5_0,ind_var5,ind_var6_0,ind_var6,ind_var8_0,ind_var8,ind_var12_0,ind_var12,ind_var13_0,ind_var13_corto_0,ind_var13_corto,ind_var13_largo_0,ind_var13_largo,ind_var13_medio_0,ind_var13_medio,ind_var13,ind_var14_0,ind_var14,ind_var17_0,ind_var17,ind_var18_0,ind_var18,ind_var19,ind_var20_0,ind_var20,ind_var24_0,ind_var24,ind_var25_cte,ind_var26_0,ind_var26_cte,ind_var26,ind_var25_0,ind_var25,ind_var27_0,ind_var28_0,ind_var28,ind_var27,ind_var29_0,ind_var29,ind_var30_0,ind_var30,ind_var31_0,ind_var31,ind_var32_cte,ind_var32_0,ind_var32,ind_var33_0,ind_var33,ind_var34_0,ind_var34,ind_var37_cte,ind_var37_0,ind_var37,ind_var39_0,ind_var40_0,ind_var40,ind_var41_0,ind_var41,ind_var39,ind_var44_0,ind_var44,ind_var46_0,ind_var46,num_var1_0,num_var1,num_var4,num_var5_0,num_var5,num_var6_0,num_var6,num_var8_0,num_var8,num_var12_0,num_var12,num_var13_0,num_var13_corto_0,num_var13_corto,num_var13_largo_0,num_var13_largo,num_var13_medio_0,num_var13_medio,num_var13,num_var14_0,num_var14,num_var17_0,num_var17,num_var18_0,num_var18,num_var20_0,num_var20,num_var24_0,num_var24,num_var26_0,num_var26,num_var25_0,num_var25,num_op_var40_hace2,num_op_var40_hace3,num_op_var40_ult1,num_op_var40_ult3,num_op_var41_hace2,num_op_var41_hace3,num_op_var41_ult1,num_op_var41_ult3,num_op_var39_hace2,num_op_var39_hace3,num_op_var39_ult1,num_op_var39_ult3,num_var27_0,num_var28_0,num_var28,num_var27,num_var29_0,num_var29,num_var30_0,num_var30,num_var31_0,num_var31,num_var32_0,num_var32,num_var33_0,num_var33,num_var34_0,num_var34,num_var35,num_var37_med_ult2,num_var37_0,num_var37,num_var39_0,num_var40_0,num_var40,num_var41_0,num_var41,num_var39,num_var42_0,num_var42,num_var44_0,num_var44,num_var46_0,num_var46,saldo_var1,saldo_var5,saldo_var6,saldo_var8,saldo_var12,saldo_var13_corto,saldo_var13_largo,saldo_var13_medio,saldo_var13,saldo_var14,saldo_var17,saldo_var18,saldo_var20,saldo_var24,saldo_var26,saldo_var25,saldo_var28,saldo_var27,saldo_var29,saldo_var30,saldo_var31,saldo_var32,saldo_var33,saldo_var34,saldo_var37,saldo_var40,saldo_var41,saldo_var42,saldo_var44,saldo_var46,var36,delta_imp_amort_var18_1y3,delta_imp_amort_var34_1y3,delta_imp_aport_var13_1y3,delta_imp_aport_var17_1y3,delta_imp_aport_var33_1y3,delta_imp_compra_var44_1y3,delta_imp_reemb_var13_1y3,delta_imp_reemb_var17_1y3,delta_imp_reemb_var33_1y3,delta_imp_trasp_var17_in_1y3,delta_imp_trasp_var17_out_1y3,delta_imp_trasp_var33_in_1y3,delta_imp_trasp_var33_out_1y3,delta_imp_venta_var44_1y3,delta_num_aport_var13_1y3,delta_num_aport_var17_1y3,delta_num_aport_var33_1y3,delta_num_compra_var44_1y3,delta_num_reemb_var13_1y3,delta_num_reemb_var17_1y3,delta_num_reemb_var33_1y3,delta_num_trasp_var17_in_1y3,delta_num_trasp_var17_out_1y3,delta_num_trasp_var33_in_1y3,delta_num_trasp_var33_out_1y3,delta_num_venta_var44_1y3,imp_amort_var18_hace3,imp_amort_var18_ult1,imp_amort_var34_hace3,imp_amort_var34_ult1,imp_aport_var13_hace3,imp_aport_var13_ult1,imp_aport_var17_hace3,imp_aport_var17_ult1,imp_aport_var33_hace3,imp_aport_var33_ult1,imp_var7_emit_ult1,imp_var7_recib_ult1,imp_compra_var44_hace3,imp_compra_var44_ult1,imp_reemb_var13_hace3,imp_reemb_var13_ult1,imp_reemb_var17_hace3,imp_reemb_var17_ult1,imp_reemb_var33_hace3,imp_reemb_var33_ult1,imp_var43_emit_ult1,imp_trans_var37_ult1,imp_trasp_var17_in_hace3,imp_trasp_var17_in_ult1,imp_trasp_var17_out_hace3,imp_trasp_var17_out_ult1,imp_trasp_var33_in_hace3,imp_trasp_var33_in_ult1,imp_trasp_var33_out_hace3,imp_trasp_var33_out_ult1,imp_venta_var44_hace3,imp_venta_var44_ult1,ind_var7_emit_ult1,ind_var7_recib_ult1,ind_var10_ult1,ind_var10cte_ult1,ind_var9_cte_ult1,ind_var9_ult1,ind_var43_emit_ult1,ind_var43_recib_ult1,var21,num_var2_0_ult1,num_var2_ult1,num_aport_var13_hace3,num_aport_var13_ult1,num_aport_var17_hace3,num_aport_var17_ult1,num_aport_var33_hace3,num_aport_var33_ult1,num_var7_emit_ult1,num_var7_recib_ult1,num_compra_var44_hace3,num_compra_var44_ult1,num_ent_var16_ult1,num_var22_hace2,num_var22_hace3,num_var22_ult1,num_var22_ult3,num_med_var22_ult3,num_med_var45_ult3,num_meses_var5_ult3,num_meses_var8_ult3,num_meses_var12_ult3,num_meses_var13_corto_ult3,num_meses_var13_largo_ult3,num_meses_var13_medio_ult3,num_meses_var17_ult3,num_meses_var29_ult3,num_meses_var33_ult3,num_meses_var39_vig_ult3,num_meses_var44_ult3,num_op_var39_comer_ult1,num_op_var39_comer_ult3,num_op_var40_comer_ult1,num_op_var40_comer_ult3,num_op_var40_efect_ult1,num_op_var40_efect_ult3,num_op_var41_comer_ult1,num_op_var41_comer_ult3,num_op_var41_efect_ult1,num_op_var41_efect_ult3,num_op_var39_efect_ult1,num_op_var39_efect_ult3,num_reemb_var13_hace3,num_reemb_var13_ult1,num_reemb_var17_hace3,num_reemb_var17_ult1,num_reemb_var33_hace3,num_reemb_var33_ult1,num_sal_var16_ult1,num_var43_emit_ult1,num_var43_recib_ult1,num_trasp_var11_ult1,num_trasp_var17_in_hace3,num_trasp_var17_in_ult1,num_trasp_var17_out_hace3,num_trasp_var17_out_ult1,num_trasp_var33_in_hace3,num_trasp_var33_in_ult1,num_trasp_var33_out_hace3,num_trasp_var33_out_ult1,num_venta_var44_hace3,num_venta_var44_ult1,num_var45_hace2,num_var45_hace3,num_var45_ult1,num_var45_ult3,saldo_var2_ult1,saldo_medio_var5_hace2,saldo_medio_var5_hace3,saldo_medio_var5_ult1,saldo_medio_var5_ult3,saldo_medio_var8_hace2,saldo_medio_var8_hace3,saldo_medio_var8_ult1,saldo_medio_var8_ult3,saldo_medio_var12_hace2,saldo_medio_var12_hace3,saldo_medio_var12_ult1,saldo_medio_var12_ult3,saldo_medio_var13_corto_hace2,saldo_medio_var13_corto_hace3,saldo_medio_var13_corto_ult1,saldo_medio_var13_corto_ult3,saldo_medio_var13_largo_hace2,saldo_medio_var13_largo_hace3,saldo_medio_var13_largo_ult1,saldo_medio_var13_largo_ult3,saldo_medio_var13_medio_hace2,saldo_medio_var13_medio_hace3,saldo_medio_var13_medio_ult1,saldo_medio_var13_medio_ult3,saldo_medio_var17_hace2,saldo_medio_var17_hace3,saldo_medio_var17_ult1,saldo_medio_var17_ult3,saldo_medio_var29_hace2,saldo_medio_var29_hace3,saldo_medio_var29_ult1,saldo_medio_var29_ult3,saldo_medio_var33_hace2,saldo_medio_var33_hace3,saldo_medio_var33_ult1,saldo_medio_var33_ult3,saldo_medio_var44_hace2,saldo_medio_var44_hace3,saldo_medio_var44_ult1,saldo_medio_var44_ult3,var38
0,2,2,32,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,6.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,6.0,0.0,0.0,0.0,0,0.0,0.0,0,6.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,3,0,0,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6.0,2.43,6.0,4.8,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,40532.1
1,5,2,35,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,3.0,0.0,0.0,0.0,0,0.0,0.0,0,3.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,3,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,0,6,0,3.0,2.55,3.0,2.85,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,45486.72
2,6,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,60.0,60.0,60.0,60.0,60.0,60.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,2,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,3,3,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,6,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,30.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,30.0,0.0,0.0,0.0,0,0.0,0.0,0,30.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,3,3,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,6,0,9,0,90.0,57.0,51.45,66.15,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,46993.95
3,7,2,24,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,187898.61
4,9,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,30.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,30.0,0.0,0.0,0.0,0,0.0,0.0,0,30.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,30.0,3.87,30.0,21.3,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,73649.73


Test shape: (75818, 370)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Initial Cleaning**

</div>

In [3]:
# Initial cleaning and feature intersection
data_utils.initial_cleaning(train_data)

# Some feature values are present in train and absent in test and vice-versa.
y_train = train_data['TARGET']
df_train, df_test = data_utils.intersect_features(train_dataframe=train_data, test_dataframe=test_data)

y_train.head()

Removed constant columns: ['ind_var2_0', 'ind_var2', 'ind_var27_0', 'ind_var28_0', 'ind_var28', 'ind_var27', 'ind_var41', 'ind_var46_0', 'ind_var46', 'num_var27_0', 'num_var28_0', 'num_var28', 'num_var27', 'num_var41', 'num_var46_0', 'num_var46', 'saldo_var28', 'saldo_var27', 'saldo_var41', 'saldo_var46', 'imp_amort_var18_hace3', 'imp_amort_var34_hace3', 'imp_reemb_var13_hace3', 'imp_reemb_var33_hace3', 'imp_trasp_var17_out_hace3', 'imp_trasp_var33_out_hace3', 'num_var2_0_ult1', 'num_var2_ult1', 'num_reemb_var13_hace3', 'num_reemb_var33_hace3', 'num_trasp_var17_out_hace3', 'num_trasp_var33_out_hace3', 'saldo_var2_ult1', 'saldo_medio_var13_medio_hace3']
Finding common features between train and test datasets.
Finding common features between train and test datasets.


0    0
1    0
2    0
3    0
4    0
Name: TARGET, dtype: int64

In [4]:
df_train.shape, df_test.shape

((76020, 370), (75818, 370))

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Feature Intersection**

</div>

In [5]:
print(df_train.shape)
df_train.info()
df_train.describe(include='all')  # Transpose for easier reading

(76020, 370)
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 76020 entries, 0 to 76019
Columns: 370 entries, num_trasp_var17_out_hace3 to num_meses_var13_medio_ult3
dtypes: float64(111), int64(259)
memory usage: 214.6 MB


Unnamed: 0,num_trasp_var17_out_hace3,ind_var20,saldo_var20,imp_op_var41_comer_ult3,saldo_var13_largo,num_aport_var13_hace3,num_op_var40_ult1,num_op_var40_efect_ult3,delta_imp_reemb_var17_1y3,num_op_var41_efect_ult3,imp_trasp_var33_out_ult1,var21,num_venta_var44_hace3,imp_compra_var44_ult1,ind_var25,num_var40,num_var45_hace3,ind_var12,num_op_var41_comer_ult1,saldo_medio_var13_medio_hace2,imp_var7_emit_ult1,num_var24,ind_var5,num_aport_var17_ult1,num_var37_0,num_var45_ult1,saldo_var29,ind_var25_0,var15,var3,ind_var5_0,ind_var46,ind_var31,num_var13_medio_0,saldo_medio_var12_hace2,num_compra_var44_hace3,saldo_medio_var13_medio_ult1,num_var14_0,num_var8_0,imp_trasp_var33_in_ult1,num_var29_0,ind_var25_cte,num_var13,ind_var9_ult1,imp_venta_var44_hace3,delta_num_aport_var13_1y3,delta_num_trasp_var33_in_1y3,imp_trasp_var17_out_ult1,num_meses_var17_ult3,num_var43_recib_ult1,saldo_var40,num_var44_0,ind_var24,saldo_medio_var5_hace3,delta_num_trasp_var17_out_1y3,num_op_var39_efect_ult3,delta_imp_trasp_var33_out_1y3,num_reemb_var33_hace3,saldo_medio_var17_hace3,saldo_var6,ind_var32_cte,num_var26_0,num_reemb_var13_hace3,saldo_medio_var33_ult1,ind_var13,num_var6,delta_num_aport_var33_1y3,num_var6_0,num_op_var39_efect_ult1,num_var5_0,imp_reemb_var33_ult1,saldo_var1,ind_var20_0,num_var17,num_med_var22_ult3,ind_var18,num_var27_0,num_var13_medio,saldo_var41,ind_var31_0,imp_trasp_var17_in_hace3,num_meses_var5_ult3,saldo_medio_var29_ult1,imp_op_var40_efect_ult3,num_var7_recib_ult1,saldo_medio_var8_ult3,delta_imp_amort_var34_1y3,num_var46,ind_var34_0,ind_var44,imp_op_var40_efect_ult1,num_var28_0,num_meses_var39_vig_ult3,ind_var1_0,num_var13_corto,num_var1,ind_var6_0,num_aport_var33_ult1,num_op_var40_comer_ult1,imp_reemb_var17_ult1,num_var35,ind_var24_0,ind_var13_0,ind_var39,ind_var14_0,delta_num_trasp_var17_in_1y3,var38,num_var7_emit_ult1,ind_var7_recib_ult1,saldo_medio_var13_corto_hace2,ind_var39_0,num_aport_var33_hace3,imp_venta_var44_ult1,ind_var26_0,num_reemb_var33_ult1,num_var17_0,num_var34_0,num_var20_0,ind_var18_0,delta_num_reemb_var13_1y3,num_reemb_var17_hace3,delta_num_aport_var17_1y3,imp_op_var40_ult1,ind_var26,num_var39,imp_op_var41_comer_ult1,num_var18_0,ind_var29_0,num_op_var41_comer_ult3,ind_var43_emit_ult1,num_var37_med_ult2,imp_trasp_var17_in_ult1,num_op_var41_efect_ult1,saldo_medio_var13_largo_hace2,delta_imp_aport_var17_1y3,num_var22_hace3,saldo_medio_var33_ult3,imp_trasp_var33_out_hace3,num_var5,imp_aport_var17_ult1,num_var41,saldo_medio_var8_ult1,saldo_var33,num_var2_0_ult1,num_var45_ult3,delta_num_reemb_var33_1y3,num_var12_0,ind_var6,saldo_var44,ind_var12_0,saldo_var13_corto,ind_var40_0,num_var45_hace2,ind_var29,ind_var28,imp_compra_var44_hace3,ind_var10_ult1,imp_op_var41_efect_ult3,saldo_medio_var13_largo_ult3,saldo_var37,num_sal_var16_ult1,num_var25_0,saldo_medio_var29_hace2,saldo_medio_var13_corto_ult1,num_meses_var29_ult3,ind_var34,num_op_var40_hace2,num_meses_var13_largo_ult3,delta_imp_venta_var44_1y3,ind_var10cte_ult1,ind_var33,imp_trasp_var17_out_hace3,saldo_var8,imp_aport_var17_hace3,num_op_var39_comer_ult3,ind_var27_0,num_var28,delta_imp_aport_var33_1y3,ind_var2_0,num_var18,delta_imp_reemb_var33_1y3,imp_op_var40_comer_ult3,imp_op_var39_efect_ult3,num_var31,ind_var43_recib_ult1,ID,saldo_var24,ind_var8_0,imp_op_var41_ult1,saldo_medio_var5_hace2,num_var12,saldo_medio_var5_ult1,num_op_var40_ult3,ind_var17,ind_var2,ind_var37_cte,num_aport_var17_hace3,num_var26,saldo_var28,num_reemb_var13_ult1,saldo_var13_medio,num_var4,num_op_var39_hace2,saldo_medio_var13_largo_hace3,imp_sal_var16_ult1,num_var31_0,saldo_medio_var8_hace2,num_var1_0,saldo_medio_var29_hace3,num_trasp_var17_out_ult1,imp_aport_var13_hace3,delta_imp_reemb_var13_1y3,num_var13_largo_0,num_var40_0,saldo_medio_var8_hace3,num_var27,num_var39_0,num_op_var40_efect_ult1,saldo_medio_var13_corto_hace3,imp_trans_var37_ult1,delta_imp_aport_var13_1y3,num_var29,num_op_var39_comer_ult1,saldo_medio_var12_ult3,saldo_var18,num_var30_0,imp_op_var39_comer_ult3,ind_var7_emit_ult1,num_venta_var44_ult1,num_compra_var44_ult1,imp_aport_var13_ult1,num_trasp_var33_out_ult1,ind_var14,delta_imp_compra_var44_1y3,imp_amort_var18_hace3,num_var14,num_meses_var33_ult3,imp_op_var41_efect_ult1,saldo_medio_var44_ult1,ind_var27,saldo_var12,imp_aport_var33_ult1,saldo_var34,num_meses_var12_ult3,ind_var37,num_op_var41_ult1,ind_var13_largo_0,ind_var8,imp_op_var39_efect_ult1,ind_var13_corto,imp_var7_recib_ult1,saldo_medio_var5_ult3,num_trasp_var33_in_hace3,imp_op_var39_ult1,ind_var40,saldo_medio_var44_hace3,num_var2_ult1,num_ent_var16_ult1,num_reemb_var17_ult1,ind_var17_0,imp_var43_emit_ult1,ind_var13_corto_0,num_var22_ult1,ind_var1,ind_var32,num_var32,saldo_medio_var44_ult3,saldo_var2_ult1,imp_amort_var34_hace3,saldo_var27,ind_var37_0,num_var30,ind_var41_0,num_var13_0,saldo_var30,num_op_var40_comer_ult3,delta_num_compra_var44_1y3,num_med_var45_ult3,num_trasp_var33_out_hace3,var36,num_trasp_var33_in_ult1,saldo_var13,ind_var26_cte,num_var44,num_var32_0,num_var34,delta_num_venta_var44_1y3,num_var33_0,num_op_var39_hace3,saldo_var26,imp_aport_var33_hace3,num_op_var39_ult1,num_var13_largo,imp_reemb_var33_hace3,num_op_var41_hace2,num_var24_0,saldo_medio_var17_ult1,ind_var13_medio_0,saldo_medio_var29_ult3,imp_reemb_var17_hace3,num_trasp_var11_ult1,saldo_medio_var12_hace3,num_var41_0,num_var42_0,saldo_var42,delta_imp_trasp_var17_out_1y3,num_var22_hace2,num_aport_var13_ult1,saldo_var5,ind_var13_largo,saldo_medio_var33_hace3,imp_amort_var34_ult1,saldo_medio_var13_largo_ult1,ind_var30_0,num_var22_ult3,num_var37,ind_var46_0,num_var8,imp_reemb_var13_hace3,saldo_medio_var44_hace2,ind_var32_0,delta_num_trasp_var33_out_1y3,saldo_var46,saldo_var17,delta_imp_trasp_var17_in_1y3,imp_amort_var18_ult1,saldo_medio_var33_hace2,ind_var13_medio,ind_var9_cte_ult1,imp_ent_var16_ult1,num_meses_var13_corto_ult3,saldo_medio_var13_medio_hace3,num_meses_var44_ult3,num_op_var40_hace3,ind_var28_0,saldo_var14,ind_var30,ind_var19,saldo_var32,num_var43_emit_ult1,ind_var33_0,num_trasp_var17_in_ult1,saldo_medio_var13_corto_ult3,saldo_medio_var17_hace2,num_meses_var8_ult3,num_var46_0,imp_trasp_var33_in_hace3,num_op_var39_ult3,saldo_medio_var13_medio_ult3,num_trasp_var17_in_hace3,num_var42,saldo_medio_var12_ult1,num_var33,num_op_var41_ult3,ind_var41,imp_op_var39_comer_ult1,num_var20,saldo_var31,delta_imp_trasp_var33_in_1y3,num_var13_corto_0,ind_var44_0,delta_num_reemb_var17_1y3,num_var25,saldo_medio_var17_ult3,delta_imp_amort_var18_1y3,imp_reemb_var13_ult1,saldo_var25,imp_op_var40_comer_ult1,num_op_var41_hace3,num_meses_var13_medio_ult3
count,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0
mean,0.0,0.0,27.4,113.06,1493.68,0.08,0.06,0.0,2630886.61,1.21,0.04,32.55,0.0,116.78,0.02,0.01,3.89,0.05,2.12,0.18,2.72,0.11,0.66,0.0,0.42,4.36,0.41,0.02,33.21,-1523.2,0.96,0.0,0.0,0.0,3997.02,0.0,0.51,0.07,0.1,0.31,0.0,0.03,0.16,0.09,3.79,48671402.24,657721.65,1.91,0.0,0.81,4.37,0.01,0.04,891.37,526177.32,1.22,131544.33,0.0,36.46,0.41,0.0,0.09,0.0,12.22,0.05,0.0,131544.33,0.0,0.72,2.89,0.02,48.45,0.0,0.01,0.64,0.0,0.0,0.0,0.0,0.0,1.87,1.98,0.25,0.57,0.01,110.03,263088.66,0.0,0.0,0.0,0.41,0.0,1.59,0.01,0.12,0.01,0.0,0.0,0.07,12.57,3.3,0.04,0.05,0.0,0.02,526177.32,117235.81,0.0,0.0,3639.42,0.88,0.0,81.43,0.02,0.0,0.01,0.0,0.01,0.0,4998684.56,0.0,5130228.89,3.16,0.02,0.01,68.8,0.0,0.0,3.46,0.07,0.26,2.51,0.72,771.23,5130228.89,1.18,8.78,0.0,2.0,31.11,0.0,124.62,12.53,0.0,13.65,131544.33,0.21,0.0,96.35,0.07,4993.75,0.01,5.39,0.0,0.0,13.96,0.08,113.23,750.96,36.91,0.0,0.09,0.21,4852.26,0.0,0.0,0.02,0.02,5524861.88,0.09,0.0,0.0,141.23,98.79,3.61,0.0,0.0,131544.33,0.0,0.0,131544.33,6.47,113.79,0.02,0.13,75964.05,5925.12,0.03,137.24,1579.14,0.14,1077.26,0.08,0.0,0.0,0.07,0.0,0.09,0.0,0.0,0.51,1.08,1.62,162.17,5.48,0.02,68.28,0.03,0.0,0.0,2823.95,4998684.56,0.04,0.03,9.51,0.0,2.72,0.0,556.18,1932.95,48671402.24,0.0,2.19,4401.0,43.41,3.37,119.53,0.0,0.0,0.01,619.59,0.0,0.01,9208103.13,0.0,0.02,0.0,68.21,76.03,0.0,6021.62,0.05,0.67,0.1,0.07,2.86,0.01,0.03,68.62,0.04,127.7,1048.86,0.0,140.4,0.0,1.86,0.0,0.19,0.0,0.0,854.12,0.04,0.56,0.0,0.0,0.0,56.61,0.0,0.0,0.0,0.07,2.38,0.88,0.17,13679.67,0.14,9208103.13,4.02,0.0,40.45,0.0,6487.95,0.03,0.01,0.0,0.0,5524861.88,0.0,0.09,76.08,2.99,2.92,0.04,0.0,1.6,0.13,131.03,0.0,0.19,0.16,0.12,613.53,2.7,3.2,7191.73,526177.32,1.3,0.02,1028.47,0.01,1.37,0.02,956.95,1.0,3.04,0.42,0.0,0.09,0.0,31.51,0.0,131544.33,0.0,183.41,526177.32,0.23,7.94,0.0,0.1,86.21,0.1,0.0,0.0,0.0,0.0,69.1,0.73,0.0,3.35,0.39,0.0,0.0,3857.85,91.17,0.05,0.0,2.79,4.63,0.34,0.0,2.22,5703.01,0.0,4.55,0.0,72.36,0.01,292.29,657721.65,0.13,0.0,2630886.61,0.09,109.22,263088.66,46.18,72.74,3.56,0.09,0.0
std,0.0,0.05,2477.81,512.15,20016.49,0.55,1.83,0.2,162179809.19,5.16,10.88,393.83,0.03,13620.44,0.15,0.18,10.42,0.21,8.8,34.63,554.25,0.57,0.47,0.17,2.24,14.41,83.1,0.15,12.96,39033.46,0.2,0.0,0.06,0.02,37773.14,0.1,113.6,0.61,0.53,53.41,0.03,0.16,0.71,0.28,811.98,695953653.99,81097906.5,357.41,0.08,3.56,113.97,0.13,0.19,9888.6,72536649.82,5.18,36269040.57,0.0,8612.4,83.1,0.03,0.63,0.0,783.21,0.22,0.02,36269040.57,0.03,3.23,0.66,4.35,10937.47,0.06,0.29,1.84,0.01,0.0,0.02,0.0,0.07,388.25,1.3,52.08,36.51,0.23,1935.31,51291831.71,0.0,0.01,0.04,30.6,0.0,0.72,0.11,0.6,0.18,0.01,0.03,2.09,1093.13,2.87,0.2,0.22,0.06,0.15,72536649.82,182664.6,0.02,0.05,26359.17,0.32,0.07,11282.41,0.16,0.01,0.36,0.02,0.18,0.01,223522965.12,0.01,226443467.84,95.27,0.16,0.18,319.61,0.02,0.01,14.14,0.25,1.65,508.97,3.21,13082.16,226443467.84,3.26,538.44,0.0,1.43,2457.09,0.0,2205.25,797.84,0.0,33.3,36269040.57,0.89,0.01,5218.43,0.25,32619.13,0.11,14.5,0.01,0.0,1151.51,0.27,950.09,12422.52,472.1,0.15,0.61,41.82,31886.62,0.01,0.01,0.9,0.21,234986852.92,0.29,0.03,0.0,2515.66,22120.72,14.92,0.0,0.0,36269040.57,0.0,0.02,36269040.57,153.74,953.58,0.33,0.34,43781.95,48008.53,0.18,697.71,12148.45,0.64,9614.91,2.56,0.04,0.0,0.26,0.1,0.63,0.0,0.07,113.6,0.91,7.39,4698.87,465.39,0.41,1733.84,0.32,0.53,0.02,25334.47,223522965.12,0.4,0.32,519.39,0.0,1.14,0.15,7182.64,25355.72,695953653.99,0.02,9.13,35507.18,10934.97,1.34,546.27,0.01,0.26,0.32,11253.0,0.01,0.07,303310817.97,0.0,0.22,0.06,531.9,4040.34,0.0,48144.71,6.02,141.45,0.49,0.25,10.89,0.1,0.17,535.47,0.2,6369.0,8189.95,0.03,712.77,0.06,147.79,0.0,1.0,0.1,0.04,14255.89,0.2,2.1,0.06,0.03,0.14,2852.58,0.0,0.0,0.0,0.25,1.64,0.33,0.75,63014.08,4.29,303310817.97,10.93,0.0,47.36,0.03,38416.75,0.16,0.12,0.14,0.02,234986852.92,0.1,1.23,739.78,226.86,11.11,0.38,0.0,7.32,0.61,14956.53,0.01,31.88,43.62,1.17,9292.75,1.11,0.94,49145.31,72536649.82,3.45,0.29,9852.14,0.1,113.96,4.1,16006.98,0.07,6.21,2.24,0.0,0.5,0.0,2013.13,0.03,36269040.57,0.0,22698.13,72536649.82,57.32,455.89,0.01,0.3,1614.76,0.49,0.0,0.09,0.18,0.0,2839.62,0.44,0.06,126.68,2.22,0.03,0.02,25572.25,15392.48,0.33,0.0,323.81,17.18,73.38,0.02,1.5,46202.54,0.09,16.89,0.0,339.32,0.16,23316.4,81097906.5,0.62,0.04,162179809.19,0.61,13082.16,51291831.71,2859.74,726.88,93.16,1.22,0.01
min,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,5.0,-999999.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,-1.0,0.0,0.0,0.0,-0.9,0.0,0.0,-8.04,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,-0.9,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1844.52,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,5163.75,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,-3401.34,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,-4942.26,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,-128.37,0.0,-922.38,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-287.67,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-476.07,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-4942.26,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-4942.26,0.0,0.0,0.0,-2895.72,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.03,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,23.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,67870.61,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,38104.75,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
50%,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,28.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.99,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,106409.16,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,76043.0,0.0,0.0,0.0,3.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.73,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,1.0,0.0,3.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,3.0,3.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
75%,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,3.0,0.0,0.0,40.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,12.22,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,118756.25,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,12.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,113748.75,0.0,0.0,0.0,90.0,0.0,90.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,83.79,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,1.0,0.0,236.0,0.0,0.0,3.0,0.0,99.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,3.0,120.0,0.0,0.0,0.0,90.0,0.0,0.0,0.0,0.0,1.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
max,0.0,1.0,455858.16,16566.81,1500000.0,24.0,234.0,24.0,9999999999.0,156.0,3000.0,30000.0,6.0,3410058.66,1.0,3.0,339.0,1.0,438.0,7741.95,145384.92,6.0,1.0,21.0,114.0,510.0,19531.8,1.0,105.0,238.0,1.0,0.0,1.0,3.0,3000538.14,9.0,30000.0,111.0,6.0,13207.32,3.0,1.0,18.0,1.0,209834.4,9999999999.0,9999999999.0,69622.29,3.0,264.0,8192.61,6.0,1.0,1542339.36,9999999999.0,156.0,9999999999.0,0.0,2368558.95,19531.8,1.0,33.0,0.0,138831.63,1.0,3.0,9999999999.0,3.0,90.0,15.0,1200.0,3000000.0,1.0,27.0,78.0,1.0,0.0,3.0,0.0,1.0,96781.44,3.0,13793.67,6600.0,24.0,177582.0,9999999999.0,0.0,1.0,1.0,6600.0,0.0,3.0,1.0,6.0,6.0,1.0,6.0,210.0,182132.97,36.0,1.0,1.0,1.0,1.0,9999999999.0,22034738.76,3.0,1.0,450000.0,1.0,12.0,2754476.46,1.0,3.0,36.0,3.0,3.0,1.0,9999999999.0,3.0,9999999999.0,8237.82,1.0,3.0,12888.03,3.0,1.0,438.0,1.0,105.0,133730.58,90.0,840000.0,9999999999.0,108.0,91778.73,0.0,15.0,432457.32,0.0,228031.8,142078.8,0.0,801.0,9999999999.0,111.0,1.0,740006.61,1.0,450000.0,1.0,342.0,1.0,0.0,210001.35,1.0,131100.0,1034482.74,60000.0,15.0,33.0,10430.01,450000.0,2.0,1.0,117.0,3.0,9999999999.0,1.0,1.0,0.0,240045.0,6083691.87,600.0,0.0,0.0,9999999999.0,0.0,3.0,9999999999.0,11073.57,131100.0,27.0,1.0,151838.0,3008077.32,1.0,47598.09,812137.26,15.0,601428.6,351.0,1.0,0.0,1.0,12.0,33.0,0.0,3.0,30000.0,7.0,249.0,534000.0,105000.0,36.0,231351.99,6.0,145.2,3.0,840000.0,9999999999.0,18.0,6.0,77586.21,0.0,33.0,24.0,304838.7,2310003.0,9999999999.0,3.0,438.0,2272859.43,3000000.0,114.0,21024.81,1.0,39.0,39.0,450000.0,3.0,1.0,9999999999.0,0.0,12.0,3.0,45990.0,681462.9,0.0,3008077.32,1260.0,36000.0,3.0,1.0,468.0,1.0,1.0,45990.0,1.0,1039260.0,544365.57,3.0,47598.09,1.0,24650.01,0.0,60.0,21.0,1.0,1155003.0,1.0,96.0,1.0,1.0,12.0,397884.3,0.0,0.0,0.0,1.0,33.0,1.0,18.0,3458077.32,582.0,9999999999.0,267.0,0.0,99.0,6.0,1500000.0,1.0,3.0,12.0,3.0,9999999999.0,12.0,81.0,69756.72,36000.0,468.0,18.0,0.0,249.0,9.0,3998687.46,1.0,7331.34,12027.15,93.0,668335.32,33.0,114.0,3008077.32,9999999999.0,123.0,30.0,619329.15,1.0,20385.72,1096.02,1500000.0,1.0,234.0,114.0,0.0,3.0,0.0,438329.22,1.0,9999999999.0,0.0,6119500.14,9999999999.0,15691.8,50003.88,1.0,1.0,210000.0,3.0,0.0,3.0,48.0,0.0,450000.0,1.0,1.0,12210.78,180.0,1.0,3.0,450000.0,4210084.23,3.0,0.0,49581.27,468.0,18870.99,6.0,18.0,3004185.6,6.0,468.0,0.0,12888.03,3.0,6119500.14,9999999999.0,6.0,1.0,9999999999.0,33.0,3525776.88,9999999999.0,450000.0,69756.72,8237.82,81.0,2.0


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Low-Variance Feature Removal**

</div>

In [6]:
# Identify constant or near-constant features (low-variance features)
# These add no value and can be dropped.

selector = VarianceThreshold(threshold=0.01)
selector.fit(df_train.select_dtypes(include=['float64', 'int64']))

low_variance_cols = df_train.select_dtypes(include=['float64', 'int64']).columns[~selector.get_support()]
print("There are {} low-variance columns".format(len(low_variance_cols)))
print("Low-variance columns:", low_variance_cols.tolist())


There are 97 low-variance columns
Low-variance columns: ['num_trasp_var17_out_hace3', 'ind_var20', 'num_venta_var44_hace3', 'ind_var46', 'ind_var31', 'num_var13_medio_0', 'num_compra_var44_hace3', 'num_var29_0', 'num_meses_var17_ult3', 'num_reemb_var33_hace3', 'ind_var32_cte', 'num_reemb_var13_hace3', 'num_var6', 'num_var6_0', 'ind_var20_0', 'ind_var18', 'num_var27_0', 'num_var13_medio', 'saldo_var41', 'ind_var31_0', 'num_var46', 'ind_var34_0', 'ind_var44', 'num_var28_0', 'ind_var6_0', 'num_aport_var33_ult1', 'ind_var39', 'num_var7_emit_ult1', 'ind_var7_recib_ult1', 'num_aport_var33_hace3', 'num_reemb_var33_ult1', 'num_var34_0', 'ind_var18_0', 'num_reemb_var17_hace3', 'num_var18_0', 'ind_var29_0', 'imp_trasp_var33_out_hace3', 'num_var41', 'num_var2_0_ult1', 'ind_var6', 'ind_var29', 'ind_var28', 'num_meses_var29_ult3', 'ind_var34', 'ind_var33', 'imp_trasp_var17_out_hace3', 'ind_var27_0', 'num_var28', 'ind_var2_0', 'num_var18', 'ind_var17', 'ind_var2', 'saldo_var28', 'num_reemb_var13_ult

In [7]:
df_train.drop(columns=low_variance_cols.to_list(), inplace=True)
df_train, df_test = data_utils.intersect_features(train_dataframe=df_train, test_dataframe=df_test)

Finding common features between train and test datasets.


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Missing Value Analysis**

</div>

In [8]:
missing = df_train.isnull().mean().sort_values(ascending=False)
print(missing[missing > 0])

Series([], dtype: float64)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Correlation Analysis**

</div>

In [9]:
corr = df_train.corr(numeric_only=True).abs()
upper = corr.where(np.triu(np.ones(corr.shape), k=1).astype(bool))

# Find highly correlated features (e.g., >0.95)
to_drop = [column for column in upper.columns if any(upper[column] > 0.95)]
print("Highly correlated features to drop:", to_drop)

Highly correlated features to drop: ['num_op_var40_ult1', 'num_var4', 'num_var24_0', 'num_op_var39_hace2', 'imp_compra_var44_ult1', 'ind_var25', 'num_var13_largo_0', 'num_var40_0', 'delta_num_reemb_var13_1y3', 'ind_var25_0', 'num_var39_0', 'ind_var26', 'num_var39', 'num_op_var39_comer_ult1', 'saldo_medio_var12_ult3', 'imp_op_var39_comer_ult3', 'saldo_medio_var13_medio_ult1', 'num_var37', 'delta_imp_aport_var17_1y3', 'num_var13', 'delta_num_aport_var13_1y3', 'num_var5', 'delta_num_trasp_var33_out_1y3', 'saldo_var12', 'saldo_var34', 'saldo_var33', 'num_meses_var12_ult3', 'num_op_var41_ult1', 'saldo_var17', 'ind_var24', 'ind_var8', 'imp_op_var39_efect_ult1', 'num_op_var39_efect_ult3', 'delta_imp_trasp_var17_in_1y3', 'delta_imp_trasp_var33_out_1y3', 'imp_amort_var18_ult1', 'saldo_medio_var17_hace3', 'saldo_var6', 'ind_var40_0', 'num_var26_0', 'num_meses_var13_corto_ult3', 'saldo_medio_var33_ult1', 'ind_var10_ult1', 'ind_var13', 'saldo_medio_var13_largo_ult3', 'num_var25_0', 'num_op_var39_e

In [10]:
df_train.drop(columns=to_drop, inplace=True)
df_train, df_test = data_utils.intersect_features(train_dataframe=df_train, test_dataframe=df_test)
print("After dropping highly correlated features, train shape:", df_train.shape)

Finding common features between train and test datasets.
After dropping highly correlated features, train shape: (76020, 184)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Save Cleaned Data**

</div>

In [11]:
# Save the cleaned data
df_train.to_csv(paths.get_processed_data_path("train_cleaned.csv"), index=False)
df_test.to_csv(paths.get_processed_data_path("test_cleaned.csv"), index=False)


<div class="alert" style="background-color:#FFFFFF; border-left: 8px solid #5ad197; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

<div class="alert alert-success">

### **Conclusion**
</div>

The initial data preprocessing phase has been successfully completed. The following key steps were performed:

- Removed constant and duplicate features to eliminate noise and redundancy.  
- Ensured feature consistency between the training and test sets.  
- Dropped low-variance features that do not contribute significantly to model learning.  
- Verified that there are no missing values in the dataset.  
- Identified and removed highly correlated features to reduce multicollinearity.  

The cleaned datasets have been saved and are now ready for advanced feature engineering, exploratory data analysis (EDA), and model development.

</div>
