<div style=" border-bottom: 8px solid #e3f56c; overflow: hidden; border-radius: 10px; height: 95%; width: 100%; display: flex;">
  <div style="height: 100%; width: 100%; background-color: #3800BB; float: left; text-align: center; display: flex; justify-content: left; align-items: center; font-size: 40px; ">
    <b><span style="color: #FFFFFF; padding: 20px 20px;">Santander Customer Satisfaction: Basic Preprocessing</span></b>
  </div>
</div>

<div class="alert" style="background-color: #FFFFFF; border-left: 8px solid #B12111; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **This notebook performs the initial data preprocessing for the [Santander Customer Satisfaction competition](https://www.kaggle.com/c/santander-customer-satisfaction/overview).**

**The main steps include:**

<div class="alert alert-danger">

**Workflow Overview**
</div>


1. **Import Libraries and Setup**: Load required Python libraries and configure the environment.
2. **Load Data**: Read the raw training and test datasets from CSV files.
3. **Initial Cleaning**: Remove constant columns and duplicate rows from the datasets.
4. **Feature Intersection**: Ensure that both train and test datasets have the same set of features.
5. **Low-Variance Feature Removal**: Identify and drop features with very low variance, as they add little value to modeling.
6. **Missing Value Analysis**: Check for missing values in the cleaned training data.
7. **Correlation Analysis**: Identify highly correlated features that may be redundant.
8. **Save Cleaned Data**: Store the cleaned datasets for further analysis and modeling.


<div class="alert alert-danger">

**Key Functions and Steps**
</div>

- `data_utils.load_data`: Loads CSV data into a pandas DataFrame.
- `data_utils.initial_cleaning`: Removes constant columns and duplicates.
- `data_utils.intersect_features`: Ensures feature consistency between train and test sets.
- `VarianceThreshold`: Used to remove low-variance features.


<div class="alert alert-danger">

**Outputs**
</div>

- Cleaned train and test datasets saved to the `data/processed/` directory.
- Summary statistics and information about features removed or flagged for further analysis.

</div>


---

<div class="alert alert-warning">

**Note:** This notebook is intended as a starting point for feature engineering and model development. Further steps may include advanced feature selection, handling missing values, and building predictive models.
</div>

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Import Libraries and Setup**

</div>

In [1]:
import sys
import os
# Add the parent directory to the Python path
sys.path.append(os.path.abspath('..'))


from classes import Paths
import src.data_utils as data_utils
import pandas as pd
pd.set_option("display.float_format", "{:,.2f}".format)
pd.set_option("display.max_columns", None)
import numpy as np
import warnings
warnings.filterwarnings("ignore")


from src.data_utils import setup_logging
setup_logging()

from sklearn.feature_selection import VarianceThreshold

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Load Data**

</div>

In [2]:
paths = Paths()
train_data = data_utils.load_data(paths.get_raw_data_path("train.csv"))
test_data = data_utils.load_data(paths.get_raw_data_path("test.csv"))


display(train_data.head())
print("Train shape: {}".format(train_data.shape))
display(test_data.head())
print("Test shape: {}".format(test_data.shape))

Unnamed: 0,ID,var3,var15,imp_ent_var16_ult1,imp_op_var39_comer_ult1,imp_op_var39_comer_ult3,imp_op_var40_comer_ult1,imp_op_var40_comer_ult3,imp_op_var40_efect_ult1,imp_op_var40_efect_ult3,imp_op_var40_ult1,imp_op_var41_comer_ult1,imp_op_var41_comer_ult3,imp_op_var41_efect_ult1,imp_op_var41_efect_ult3,imp_op_var41_ult1,imp_op_var39_efect_ult1,imp_op_var39_efect_ult3,imp_op_var39_ult1,imp_sal_var16_ult1,ind_var1_0,ind_var1,ind_var2_0,ind_var2,ind_var5_0,ind_var5,ind_var6_0,ind_var6,ind_var8_0,ind_var8,ind_var12_0,ind_var12,ind_var13_0,ind_var13_corto_0,ind_var13_corto,ind_var13_largo_0,ind_var13_largo,ind_var13_medio_0,ind_var13_medio,ind_var13,ind_var14_0,ind_var14,ind_var17_0,ind_var17,ind_var18_0,ind_var18,ind_var19,ind_var20_0,ind_var20,ind_var24_0,ind_var24,ind_var25_cte,ind_var26_0,ind_var26_cte,ind_var26,ind_var25_0,ind_var25,ind_var27_0,ind_var28_0,ind_var28,ind_var27,ind_var29_0,ind_var29,ind_var30_0,ind_var30,ind_var31_0,ind_var31,ind_var32_cte,ind_var32_0,ind_var32,ind_var33_0,ind_var33,ind_var34_0,ind_var34,ind_var37_cte,ind_var37_0,ind_var37,ind_var39_0,ind_var40_0,ind_var40,ind_var41_0,ind_var41,ind_var39,ind_var44_0,ind_var44,ind_var46_0,ind_var46,num_var1_0,num_var1,num_var4,num_var5_0,num_var5,num_var6_0,num_var6,num_var8_0,num_var8,num_var12_0,num_var12,num_var13_0,num_var13_corto_0,num_var13_corto,num_var13_largo_0,num_var13_largo,num_var13_medio_0,num_var13_medio,num_var13,num_var14_0,num_var14,num_var17_0,num_var17,num_var18_0,num_var18,num_var20_0,num_var20,num_var24_0,num_var24,num_var26_0,num_var26,num_var25_0,num_var25,num_op_var40_hace2,num_op_var40_hace3,num_op_var40_ult1,num_op_var40_ult3,num_op_var41_hace2,num_op_var41_hace3,num_op_var41_ult1,num_op_var41_ult3,num_op_var39_hace2,num_op_var39_hace3,num_op_var39_ult1,num_op_var39_ult3,num_var27_0,num_var28_0,num_var28,num_var27,num_var29_0,num_var29,num_var30_0,num_var30,num_var31_0,num_var31,num_var32_0,num_var32,num_var33_0,num_var33,num_var34_0,num_var34,num_var35,num_var37_med_ult2,num_var37_0,num_var37,num_var39_0,num_var40_0,num_var40,num_var41_0,num_var41,num_var39,num_var42_0,num_var42,num_var44_0,num_var44,num_var46_0,num_var46,saldo_var1,saldo_var5,saldo_var6,saldo_var8,saldo_var12,saldo_var13_corto,saldo_var13_largo,saldo_var13_medio,saldo_var13,saldo_var14,saldo_var17,saldo_var18,saldo_var20,saldo_var24,saldo_var26,saldo_var25,saldo_var28,saldo_var27,saldo_var29,saldo_var30,saldo_var31,saldo_var32,saldo_var33,saldo_var34,saldo_var37,saldo_var40,saldo_var41,saldo_var42,saldo_var44,saldo_var46,var36,delta_imp_amort_var18_1y3,delta_imp_amort_var34_1y3,delta_imp_aport_var13_1y3,delta_imp_aport_var17_1y3,delta_imp_aport_var33_1y3,delta_imp_compra_var44_1y3,delta_imp_reemb_var13_1y3,delta_imp_reemb_var17_1y3,delta_imp_reemb_var33_1y3,delta_imp_trasp_var17_in_1y3,delta_imp_trasp_var17_out_1y3,delta_imp_trasp_var33_in_1y3,delta_imp_trasp_var33_out_1y3,delta_imp_venta_var44_1y3,delta_num_aport_var13_1y3,delta_num_aport_var17_1y3,delta_num_aport_var33_1y3,delta_num_compra_var44_1y3,delta_num_reemb_var13_1y3,delta_num_reemb_var17_1y3,delta_num_reemb_var33_1y3,delta_num_trasp_var17_in_1y3,delta_num_trasp_var17_out_1y3,delta_num_trasp_var33_in_1y3,delta_num_trasp_var33_out_1y3,delta_num_venta_var44_1y3,imp_amort_var18_hace3,imp_amort_var18_ult1,imp_amort_var34_hace3,imp_amort_var34_ult1,imp_aport_var13_hace3,imp_aport_var13_ult1,imp_aport_var17_hace3,imp_aport_var17_ult1,imp_aport_var33_hace3,imp_aport_var33_ult1,imp_var7_emit_ult1,imp_var7_recib_ult1,imp_compra_var44_hace3,imp_compra_var44_ult1,imp_reemb_var13_hace3,imp_reemb_var13_ult1,imp_reemb_var17_hace3,imp_reemb_var17_ult1,imp_reemb_var33_hace3,imp_reemb_var33_ult1,imp_var43_emit_ult1,imp_trans_var37_ult1,imp_trasp_var17_in_hace3,imp_trasp_var17_in_ult1,imp_trasp_var17_out_hace3,imp_trasp_var17_out_ult1,imp_trasp_var33_in_hace3,imp_trasp_var33_in_ult1,imp_trasp_var33_out_hace3,imp_trasp_var33_out_ult1,imp_venta_var44_hace3,imp_venta_var44_ult1,ind_var7_emit_ult1,ind_var7_recib_ult1,ind_var10_ult1,ind_var10cte_ult1,ind_var9_cte_ult1,ind_var9_ult1,ind_var43_emit_ult1,ind_var43_recib_ult1,var21,num_var2_0_ult1,num_var2_ult1,num_aport_var13_hace3,num_aport_var13_ult1,num_aport_var17_hace3,num_aport_var17_ult1,num_aport_var33_hace3,num_aport_var33_ult1,num_var7_emit_ult1,num_var7_recib_ult1,num_compra_var44_hace3,num_compra_var44_ult1,num_ent_var16_ult1,num_var22_hace2,num_var22_hace3,num_var22_ult1,num_var22_ult3,num_med_var22_ult3,num_med_var45_ult3,num_meses_var5_ult3,num_meses_var8_ult3,num_meses_var12_ult3,num_meses_var13_corto_ult3,num_meses_var13_largo_ult3,num_meses_var13_medio_ult3,num_meses_var17_ult3,num_meses_var29_ult3,num_meses_var33_ult3,num_meses_var39_vig_ult3,num_meses_var44_ult3,num_op_var39_comer_ult1,num_op_var39_comer_ult3,num_op_var40_comer_ult1,num_op_var40_comer_ult3,num_op_var40_efect_ult1,num_op_var40_efect_ult3,num_op_var41_comer_ult1,num_op_var41_comer_ult3,num_op_var41_efect_ult1,num_op_var41_efect_ult3,num_op_var39_efect_ult1,num_op_var39_efect_ult3,num_reemb_var13_hace3,num_reemb_var13_ult1,num_reemb_var17_hace3,num_reemb_var17_ult1,num_reemb_var33_hace3,num_reemb_var33_ult1,num_sal_var16_ult1,num_var43_emit_ult1,num_var43_recib_ult1,num_trasp_var11_ult1,num_trasp_var17_in_hace3,num_trasp_var17_in_ult1,num_trasp_var17_out_hace3,num_trasp_var17_out_ult1,num_trasp_var33_in_hace3,num_trasp_var33_in_ult1,num_trasp_var33_out_hace3,num_trasp_var33_out_ult1,num_venta_var44_hace3,num_venta_var44_ult1,num_var45_hace2,num_var45_hace3,num_var45_ult1,num_var45_ult3,saldo_var2_ult1,saldo_medio_var5_hace2,saldo_medio_var5_hace3,saldo_medio_var5_ult1,saldo_medio_var5_ult3,saldo_medio_var8_hace2,saldo_medio_var8_hace3,saldo_medio_var8_ult1,saldo_medio_var8_ult3,saldo_medio_var12_hace2,saldo_medio_var12_hace3,saldo_medio_var12_ult1,saldo_medio_var12_ult3,saldo_medio_var13_corto_hace2,saldo_medio_var13_corto_hace3,saldo_medio_var13_corto_ult1,saldo_medio_var13_corto_ult3,saldo_medio_var13_largo_hace2,saldo_medio_var13_largo_hace3,saldo_medio_var13_largo_ult1,saldo_medio_var13_largo_ult3,saldo_medio_var13_medio_hace2,saldo_medio_var13_medio_hace3,saldo_medio_var13_medio_ult1,saldo_medio_var13_medio_ult3,saldo_medio_var17_hace2,saldo_medio_var17_hace3,saldo_medio_var17_ult1,saldo_medio_var17_ult3,saldo_medio_var29_hace2,saldo_medio_var29_hace3,saldo_medio_var29_ult1,saldo_medio_var29_ult3,saldo_medio_var33_hace2,saldo_medio_var33_hace3,saldo_medio_var33_ult1,saldo_medio_var33_ult3,saldo_medio_var44_hace2,saldo_medio_var44_hace3,saldo_medio_var44_ult1,saldo_medio_var44_ult3,var38,TARGET
0,1,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,39205.17,0
1,3,2,34,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,1,1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,0,0,0,0,0,0,0,3,3,3,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,300.0,0.0,0,300.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,300.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,3,0,0,-1.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,-1.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,300.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,3,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,88.89,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,300.0,122.22,300.0,240.75,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,49278.03,0
2,4,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,3.0,0.0,0.0,0.0,0,0.0,0.0,0,3.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3.0,0.18,3.0,2.07,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,67333.77,0
3,8,2,37,0.0,195.0,195.0,0.0,0.0,0.0,0.0,0.0,195.0,195.0,0.0,0.0,195.0,0.0,0.0,195.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,1,1,1,1,0,0,1,0,0,0,0,0,0,0,0,3,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,9,9,0,0,9,9,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,9,6,6,6,3,0,0,3,0,0,3,3,0,0,0,0,0.0,70.62,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,70.62,0.0,0.0,0.0,0,34.95,0.0,0,70.62,0.0,0,2,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,1,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,15,2,0,0,0,0,0,0,0,0,1,0,9,9,0,0,0,0,9,9,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,27,3,18,48,0,186.09,0.0,91.56,138.84,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,64007.97,0
4,10,2,39,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,6,3,0,0,0,0,0.0,0.0,0.0,0.0,135003.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,135003.0,0.0,0.0,0,0,0.0,135003.0,0.0,0.0,0.0,0,0.0,0.0,0,135003.0,0.0,0,1,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,135003.0,270003.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0.0,0.0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,6,9,3,0,3,0,1,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,6,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3.0,0.3,40501.08,13501.47,0.0,0.0,0.0,0.0,0.0,0.0,85501.89,85501.89,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,117310.98,0


Train shape: (76020, 371)


Unnamed: 0,ID,var3,var15,imp_ent_var16_ult1,imp_op_var39_comer_ult1,imp_op_var39_comer_ult3,imp_op_var40_comer_ult1,imp_op_var40_comer_ult3,imp_op_var40_efect_ult1,imp_op_var40_efect_ult3,imp_op_var40_ult1,imp_op_var41_comer_ult1,imp_op_var41_comer_ult3,imp_op_var41_efect_ult1,imp_op_var41_efect_ult3,imp_op_var41_ult1,imp_op_var39_efect_ult1,imp_op_var39_efect_ult3,imp_op_var39_ult1,imp_sal_var16_ult1,ind_var1_0,ind_var1,ind_var2_0,ind_var2,ind_var5_0,ind_var5,ind_var6_0,ind_var6,ind_var8_0,ind_var8,ind_var12_0,ind_var12,ind_var13_0,ind_var13_corto_0,ind_var13_corto,ind_var13_largo_0,ind_var13_largo,ind_var13_medio_0,ind_var13_medio,ind_var13,ind_var14_0,ind_var14,ind_var17_0,ind_var17,ind_var18_0,ind_var18,ind_var19,ind_var20_0,ind_var20,ind_var24_0,ind_var24,ind_var25_cte,ind_var26_0,ind_var26_cte,ind_var26,ind_var25_0,ind_var25,ind_var27_0,ind_var28_0,ind_var28,ind_var27,ind_var29_0,ind_var29,ind_var30_0,ind_var30,ind_var31_0,ind_var31,ind_var32_cte,ind_var32_0,ind_var32,ind_var33_0,ind_var33,ind_var34_0,ind_var34,ind_var37_cte,ind_var37_0,ind_var37,ind_var39_0,ind_var40_0,ind_var40,ind_var41_0,ind_var41,ind_var39,ind_var44_0,ind_var44,ind_var46_0,ind_var46,num_var1_0,num_var1,num_var4,num_var5_0,num_var5,num_var6_0,num_var6,num_var8_0,num_var8,num_var12_0,num_var12,num_var13_0,num_var13_corto_0,num_var13_corto,num_var13_largo_0,num_var13_largo,num_var13_medio_0,num_var13_medio,num_var13,num_var14_0,num_var14,num_var17_0,num_var17,num_var18_0,num_var18,num_var20_0,num_var20,num_var24_0,num_var24,num_var26_0,num_var26,num_var25_0,num_var25,num_op_var40_hace2,num_op_var40_hace3,num_op_var40_ult1,num_op_var40_ult3,num_op_var41_hace2,num_op_var41_hace3,num_op_var41_ult1,num_op_var41_ult3,num_op_var39_hace2,num_op_var39_hace3,num_op_var39_ult1,num_op_var39_ult3,num_var27_0,num_var28_0,num_var28,num_var27,num_var29_0,num_var29,num_var30_0,num_var30,num_var31_0,num_var31,num_var32_0,num_var32,num_var33_0,num_var33,num_var34_0,num_var34,num_var35,num_var37_med_ult2,num_var37_0,num_var37,num_var39_0,num_var40_0,num_var40,num_var41_0,num_var41,num_var39,num_var42_0,num_var42,num_var44_0,num_var44,num_var46_0,num_var46,saldo_var1,saldo_var5,saldo_var6,saldo_var8,saldo_var12,saldo_var13_corto,saldo_var13_largo,saldo_var13_medio,saldo_var13,saldo_var14,saldo_var17,saldo_var18,saldo_var20,saldo_var24,saldo_var26,saldo_var25,saldo_var28,saldo_var27,saldo_var29,saldo_var30,saldo_var31,saldo_var32,saldo_var33,saldo_var34,saldo_var37,saldo_var40,saldo_var41,saldo_var42,saldo_var44,saldo_var46,var36,delta_imp_amort_var18_1y3,delta_imp_amort_var34_1y3,delta_imp_aport_var13_1y3,delta_imp_aport_var17_1y3,delta_imp_aport_var33_1y3,delta_imp_compra_var44_1y3,delta_imp_reemb_var13_1y3,delta_imp_reemb_var17_1y3,delta_imp_reemb_var33_1y3,delta_imp_trasp_var17_in_1y3,delta_imp_trasp_var17_out_1y3,delta_imp_trasp_var33_in_1y3,delta_imp_trasp_var33_out_1y3,delta_imp_venta_var44_1y3,delta_num_aport_var13_1y3,delta_num_aport_var17_1y3,delta_num_aport_var33_1y3,delta_num_compra_var44_1y3,delta_num_reemb_var13_1y3,delta_num_reemb_var17_1y3,delta_num_reemb_var33_1y3,delta_num_trasp_var17_in_1y3,delta_num_trasp_var17_out_1y3,delta_num_trasp_var33_in_1y3,delta_num_trasp_var33_out_1y3,delta_num_venta_var44_1y3,imp_amort_var18_hace3,imp_amort_var18_ult1,imp_amort_var34_hace3,imp_amort_var34_ult1,imp_aport_var13_hace3,imp_aport_var13_ult1,imp_aport_var17_hace3,imp_aport_var17_ult1,imp_aport_var33_hace3,imp_aport_var33_ult1,imp_var7_emit_ult1,imp_var7_recib_ult1,imp_compra_var44_hace3,imp_compra_var44_ult1,imp_reemb_var13_hace3,imp_reemb_var13_ult1,imp_reemb_var17_hace3,imp_reemb_var17_ult1,imp_reemb_var33_hace3,imp_reemb_var33_ult1,imp_var43_emit_ult1,imp_trans_var37_ult1,imp_trasp_var17_in_hace3,imp_trasp_var17_in_ult1,imp_trasp_var17_out_hace3,imp_trasp_var17_out_ult1,imp_trasp_var33_in_hace3,imp_trasp_var33_in_ult1,imp_trasp_var33_out_hace3,imp_trasp_var33_out_ult1,imp_venta_var44_hace3,imp_venta_var44_ult1,ind_var7_emit_ult1,ind_var7_recib_ult1,ind_var10_ult1,ind_var10cte_ult1,ind_var9_cte_ult1,ind_var9_ult1,ind_var43_emit_ult1,ind_var43_recib_ult1,var21,num_var2_0_ult1,num_var2_ult1,num_aport_var13_hace3,num_aport_var13_ult1,num_aport_var17_hace3,num_aport_var17_ult1,num_aport_var33_hace3,num_aport_var33_ult1,num_var7_emit_ult1,num_var7_recib_ult1,num_compra_var44_hace3,num_compra_var44_ult1,num_ent_var16_ult1,num_var22_hace2,num_var22_hace3,num_var22_ult1,num_var22_ult3,num_med_var22_ult3,num_med_var45_ult3,num_meses_var5_ult3,num_meses_var8_ult3,num_meses_var12_ult3,num_meses_var13_corto_ult3,num_meses_var13_largo_ult3,num_meses_var13_medio_ult3,num_meses_var17_ult3,num_meses_var29_ult3,num_meses_var33_ult3,num_meses_var39_vig_ult3,num_meses_var44_ult3,num_op_var39_comer_ult1,num_op_var39_comer_ult3,num_op_var40_comer_ult1,num_op_var40_comer_ult3,num_op_var40_efect_ult1,num_op_var40_efect_ult3,num_op_var41_comer_ult1,num_op_var41_comer_ult3,num_op_var41_efect_ult1,num_op_var41_efect_ult3,num_op_var39_efect_ult1,num_op_var39_efect_ult3,num_reemb_var13_hace3,num_reemb_var13_ult1,num_reemb_var17_hace3,num_reemb_var17_ult1,num_reemb_var33_hace3,num_reemb_var33_ult1,num_sal_var16_ult1,num_var43_emit_ult1,num_var43_recib_ult1,num_trasp_var11_ult1,num_trasp_var17_in_hace3,num_trasp_var17_in_ult1,num_trasp_var17_out_hace3,num_trasp_var17_out_ult1,num_trasp_var33_in_hace3,num_trasp_var33_in_ult1,num_trasp_var33_out_hace3,num_trasp_var33_out_ult1,num_venta_var44_hace3,num_venta_var44_ult1,num_var45_hace2,num_var45_hace3,num_var45_ult1,num_var45_ult3,saldo_var2_ult1,saldo_medio_var5_hace2,saldo_medio_var5_hace3,saldo_medio_var5_ult1,saldo_medio_var5_ult3,saldo_medio_var8_hace2,saldo_medio_var8_hace3,saldo_medio_var8_ult1,saldo_medio_var8_ult3,saldo_medio_var12_hace2,saldo_medio_var12_hace3,saldo_medio_var12_ult1,saldo_medio_var12_ult3,saldo_medio_var13_corto_hace2,saldo_medio_var13_corto_hace3,saldo_medio_var13_corto_ult1,saldo_medio_var13_corto_ult3,saldo_medio_var13_largo_hace2,saldo_medio_var13_largo_hace3,saldo_medio_var13_largo_ult1,saldo_medio_var13_largo_ult3,saldo_medio_var13_medio_hace2,saldo_medio_var13_medio_hace3,saldo_medio_var13_medio_ult1,saldo_medio_var13_medio_ult3,saldo_medio_var17_hace2,saldo_medio_var17_hace3,saldo_medio_var17_ult1,saldo_medio_var17_ult3,saldo_medio_var29_hace2,saldo_medio_var29_hace3,saldo_medio_var29_ult1,saldo_medio_var29_ult3,saldo_medio_var33_hace2,saldo_medio_var33_hace3,saldo_medio_var33_ult1,saldo_medio_var33_ult3,saldo_medio_var44_hace2,saldo_medio_var44_hace3,saldo_medio_var44_ult1,saldo_medio_var44_ult3,var38
0,2,2,32,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,6.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,6.0,0.0,0.0,0.0,0,0.0,0.0,0,6.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,3,0,0,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6.0,2.43,6.0,4.8,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,40532.1
1,5,2,35,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,3.0,0.0,0.0,0.0,0,0.0,0.0,0,3.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,3,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,0,6,0,3.0,2.55,3.0,2.85,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,45486.72
2,6,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,60.0,60.0,60.0,60.0,60.0,60.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,2,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,3,3,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,6,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,30.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,30.0,0.0,0.0,0.0,0,0.0,0.0,0,30.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,3,3,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,6,0,9,0,90.0,57.0,51.45,66.15,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,46993.95
3,7,2,24,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,3,0,0,3,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0,0.0,0.0,0,99,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,187898.61
4,9,2,23,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,1,0,0,0,0,0,0,0,0,1,3,3,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,3,0,0,0,0,0,0,0,0,3,0,0,0,3,0,0,3,0,0,3,3,0,0,0,0,0.0,30.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0,0,0.0,30.0,0.0,0.0,0.0,0,0.0,0.0,0,30.0,0.0,0,3,0,0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0.0,0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0.0,0,0,0.0,0.0,0.0,0.0,0,0,0.0,0.0,0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,3,0,0,0,0,0,0,0,0,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,30.0,3.87,30.0,21.3,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,73649.73


Test shape: (75818, 370)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Initial Cleaning**

</div>

In [3]:
# Initial cleaning and feature intersection
data_utils.initial_cleaning(train_data)

# Some feature values are present in train and absent in test and vice-versa.
y_train = train_data['TARGET']
df_train, df_test = data_utils.intersect_features(train_dataframe=train_data, test_dataframe=test_data)

y_train.head()

Removed constant columns: ['ind_var2_0', 'ind_var2', 'ind_var27_0', 'ind_var28_0', 'ind_var28', 'ind_var27', 'ind_var41', 'ind_var46_0', 'ind_var46', 'num_var27_0', 'num_var28_0', 'num_var28', 'num_var27', 'num_var41', 'num_var46_0', 'num_var46', 'saldo_var28', 'saldo_var27', 'saldo_var41', 'saldo_var46', 'imp_amort_var18_hace3', 'imp_amort_var34_hace3', 'imp_reemb_var13_hace3', 'imp_reemb_var33_hace3', 'imp_trasp_var17_out_hace3', 'imp_trasp_var33_out_hace3', 'num_var2_0_ult1', 'num_var2_ult1', 'num_reemb_var13_hace3', 'num_reemb_var33_hace3', 'num_trasp_var17_out_hace3', 'num_trasp_var33_out_hace3', 'saldo_var2_ult1', 'saldo_medio_var13_medio_hace3']
Finding common features between train and test datasets.


0    0
1    0
2    0
3    0
4    0
Name: TARGET, dtype: int64

In [4]:
df_train.shape, df_test.shape

((76020, 370), (75818, 370))

<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Feature Intersection**

</div>

In [5]:
print(df_train.shape)
df_train.info()
df_train.describe(include='all')  # Transpose for easier reading

(76020, 370)
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 76020 entries, 0 to 76019
Columns: 370 entries, ind_var34_0 to num_var41_0
dtypes: float64(111), int64(259)
memory usage: 214.6 MB


Unnamed: 0,ind_var34_0,num_var24,num_var13_corto,num_op_var41_efect_ult3,saldo_medio_var13_corto_hace2,num_var5_0,delta_imp_aport_var17_1y3,imp_trans_var37_ult1,saldo_medio_var13_medio_hace2,num_var24_0,saldo_medio_var17_ult3,ind_var12_0,saldo_var13_medio,delta_imp_reemb_var17_1y3,saldo_medio_var44_hace3,saldo_var40,imp_ent_var16_ult1,num_var17_0,num_var46,num_sal_var16_ult1,ind_var26_0,num_trasp_var11_ult1,ind_var29,imp_trasp_var17_in_ult1,delta_imp_venta_var44_1y3,ind_var32_0,delta_imp_trasp_var17_in_1y3,num_var33_0,ind_var28,ind_var6_0,saldo_medio_var12_ult3,ind_var29_0,num_op_var39_comer_ult1,ind_var39,num_var32,num_meses_var12_ult3,delta_imp_trasp_var17_out_1y3,saldo_medio_var44_ult3,delta_imp_trasp_var33_in_1y3,saldo_medio_var12_hace3,imp_amort_var34_hace3,num_trasp_var17_out_ult1,num_var31_0,num_var8_0,ind_var7_emit_ult1,saldo_medio_var12_hace2,num_trasp_var33_out_hace3,ind_var13_medio_0,num_op_var41_ult3,num_var45_hace3,num_var13,imp_trasp_var33_in_ult1,saldo_var13,num_var8,num_compra_var44_hace3,ind_var17_0,delta_imp_aport_var13_1y3,num_op_var41_comer_ult3,imp_aport_var33_hace3,ind_var25,imp_reemb_var13_ult1,ind_var13_largo,num_var13_medio_0,num_trasp_var33_in_ult1,num_op_var41_hace2,num_var14,num_op_var40_comer_ult3,imp_op_var41_efect_ult3,num_op_var39_efect_ult3,num_var46_0,num_var5,ind_var1,var38,ind_var13_medio,ind_var41_0,saldo_medio_var13_corto_ult3,ind_var37_0,num_op_var40_efect_ult1,num_var44,saldo_var25,imp_op_var40_comer_ult3,saldo_medio_var5_ult1,ind_var13_corto,saldo_medio_var13_medio_ult3,num_var13_largo,imp_var7_emit_ult1,ind_var31_0,num_meses_var17_ult3,ind_var7_recib_ult1,num_aport_var13_hace3,var36,num_var6_0,num_var44_0,saldo_medio_var29_hace3,ind_var25_cte,ind_var46,num_var28,imp_amort_var34_ult1,saldo_medio_var8_ult3,num_var31,ind_var14_0,ind_var27_0,imp_trasp_var33_out_ult1,ind_var33_0,delta_imp_reemb_var33_1y3,saldo_medio_var44_hace2,ind_var32_cte,saldo_var13_corto,saldo_var29,num_op_var39_comer_ult3,num_var39,ind_var32,saldo_medio_var29_ult1,num_var12_0,num_var14_0,num_var2_0_ult1,num_venta_var44_ult1,imp_op_var40_comer_ult1,num_var43_recib_ult1,saldo_medio_var8_hace3,saldo_medio_var17_hace2,saldo_medio_var5_hace2,ind_var34,saldo_var13_largo,saldo_var33,saldo_var28,imp_op_var40_efect_ult3,saldo_medio_var13_medio_ult1,num_var30,num_var40_0,ind_var2,saldo_medio_var17_ult1,num_trasp_var33_out_ult1,ind_var8,imp_reemb_var13_hace3,delta_num_compra_var44_1y3,num_trasp_var17_in_ult1,num_op_var41_efect_ult1,imp_var43_emit_ult1,num_op_var39_hace2,ind_var14,num_var22_ult1,imp_op_var39_efect_ult3,saldo_var2_ult1,ID,saldo_var12,imp_trasp_var33_in_hace3,saldo_medio_var13_largo_ult3,delta_imp_trasp_var33_out_1y3,num_meses_var13_corto_ult3,ind_var39_0,saldo_medio_var33_ult3,saldo_medio_var29_ult3,ind_var10_ult1,num_var6,ind_var18,num_trasp_var17_out_hace3,imp_venta_var44_ult1,num_var4,saldo_var37,delta_num_trasp_var33_in_1y3,num_meses_var8_ult3,num_var29_0,saldo_var24,num_var13_medio,ind_var19,num_reemb_var13_hace3,ind_var43_emit_ult1,saldo_medio_var12_ult1,num_var22_ult3,ind_var20_0,num_var41,ind_var37,var21,num_var42,delta_num_trasp_var33_out_1y3,ind_var40_0,saldo_medio_var17_hace3,imp_op_var39_efect_ult1,saldo_var41,ind_var46_0,num_var37_0,num_var2_ult1,saldo_var30,num_reemb_var13_ult1,saldo_medio_var13_corto_hace3,imp_venta_var44_hace3,ind_var17,num_var27,saldo_var8,num_var20,num_trasp_var17_in_hace3,num_aport_var17_ult1,num_var25,num_var7_emit_ult1,num_op_var39_ult3,num_reemb_var33_hace3,imp_op_var39_comer_ult3,delta_num_aport_var13_1y3,num_var45_hace2,num_var32_0,imp_trasp_var17_out_hace3,num_var18,saldo_var14,saldo_var46,num_var30_0,saldo_var26,num_var25_0,imp_var7_recib_ult1,num_op_var40_hace2,imp_sal_var16_ult1,num_var28_0,var3,imp_reemb_var17_hace3,num_venta_var44_hace3,saldo_var18,ind_var5_0,num_meses_var44_ult3,num_var26_0,ind_var2_0,imp_op_var40_ult1,num_meses_var13_largo_ult3,delta_imp_amort_var18_1y3,num_var33,ind_var13_largo_0,ind_var10cte_ult1,saldo_medio_var13_largo_hace3,saldo_medio_var13_corto_ult1,num_var12,num_var22_hace3,num_ent_var16_ult1,saldo_medio_var29_hace2,num_var39_0,ind_var12,ind_var26,imp_op_var40_efect_ult1,delta_num_aport_var33_1y3,saldo_medio_var13_medio_hace3,ind_var13,ind_var9_ult1,num_var34_0,ind_var8_0,saldo_medio_var5_hace3,num_reemb_var33_ult1,ind_var24_0,saldo_var42,num_trasp_var33_in_hace3,saldo_medio_var33_hace3,imp_trasp_var17_in_hace3,num_var34,num_var13_largo_0,num_var13_0,ind_var20,num_var7_recib_ult1,ind_var31,delta_num_trasp_var17_in_1y3,num_var27_0,saldo_var32,delta_num_aport_var17_1y3,num_var43_emit_ult1,delta_imp_compra_var44_1y3,ind_var9_cte_ult1,saldo_var17,num_op_var40_ult3,ind_var13_0,delta_num_trasp_var17_out_1y3,saldo_medio_var44_ult1,saldo_medio_var8_ult1,ind_var41,ind_var40,imp_op_var41_efect_ult1,num_meses_var13_medio_ult3,num_op_var41_ult1,saldo_var1,delta_num_reemb_var33_1y3,num_var18_0,num_meses_var39_vig_ult3,delta_imp_reemb_var13_1y3,ind_var25_0,imp_aport_var33_ult1,num_var20_0,ind_var44_0,num_var40,imp_op_var41_ult1,num_var37,num_med_var45_ult3,num_var45_ult1,num_var29,delta_imp_aport_var33_1y3,num_meses_var5_ult3,imp_compra_var44_ult1,saldo_var44,num_var37_med_ult2,ind_var30_0,imp_trasp_var33_out_hace3,ind_var33,delta_num_reemb_var17_1y3,num_aport_var33_ult1,num_var22_hace2,ind_var6,imp_amort_var18_ult1,num_aport_var13_ult1,imp_aport_var13_hace3,saldo_var31,num_var42_0,num_reemb_var17_hace3,imp_op_var41_comer_ult3,num_op_var40_hace3,imp_compra_var44_hace3,saldo_medio_var8_hace2,num_aport_var17_hace3,ind_var28_0,ind_var5,num_var35,imp_trasp_var17_out_ult1,saldo_medio_var33_hace2,num_op_var40_efect_ult3,saldo_medio_var5_ult3,imp_op_var41_comer_ult1,num_var26,num_op_var40_ult1,num_op_var39_efect_ult1,imp_reemb_var33_hace3,saldo_var34,num_aport_var33_hace3,num_op_var40_comer_ult1,num_meses_var33_ult3,ind_var43_recib_ult1,ind_var44,ind_var27,num_op_var41_comer_ult1,imp_op_var39_comer_ult1,saldo_var5,ind_var37_cte,num_op_var39_ult1,ind_var26_cte,num_var1_0,imp_op_var39_ult1,saldo_var6,ind_var18_0,saldo_var27,ind_var1_0,imp_reemb_var17_ult1,num_reemb_var17_ult1,num_var1,ind_var30,saldo_var20,num_var13_corto_0,num_op_var41_hace3,imp_reemb_var33_ult1,ind_var24,imp_aport_var17_hace3,var15,ind_var13_corto_0,delta_num_reemb_var13_1y3,delta_num_venta_var44_1y3,num_compra_var44_ult1,num_meses_var29_ult3,saldo_medio_var33_ult1,saldo_medio_var13_largo_ult1,num_var17,imp_aport_var13_ult1,delta_imp_amort_var34_1y3,imp_amort_var18_hace3,num_var45_ult3,num_med_var22_ult3,imp_aport_var17_ult1,saldo_medio_var13_largo_hace2,num_op_var39_hace3,num_var41_0
count,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0,76020.0
mean,0.0,0.11,0.12,1.21,3639.42,2.89,5130228.89,1932.95,0.18,0.13,109.22,0.07,0.51,2630886.61,1.86,4.37,86.21,0.01,0.0,0.0,0.02,0.12,0.0,2.51,5524861.88,0.0,526177.32,0.0,0.0,0.0,4401.0,0.0,2.19,0.0,0.0,0.1,526177.32,56.61,657721.65,613.53,0.0,0.0,0.02,0.1,0.0,3997.02,0.0,0.0,4.55,3.89,0.16,0.31,6487.95,0.09,0.0,0.0,48671402.24,3.46,2.99,0.02,46.18,0.01,0.0,0.0,1.6,0.02,0.14,113.23,1.22,0.0,2.0,0.0,117235.81,0.0,0.88,3857.85,0.07,0.0,0.01,72.74,6.47,1077.26,0.04,0.34,0.04,2.72,0.0,0.0,0.0,0.08,40.45,0.0,0.01,0.0,0.03,0.0,0.0,0.02,110.03,0.02,0.02,0.0,0.04,0.0,131544.33,31.51,0.0,4993.75,0.41,3.61,0.01,0.0,0.25,0.21,0.07,0.0,0.0,3.56,0.81,9.51,91.17,1579.14,0.0,1493.68,12.53,0.0,0.57,0.51,2.38,0.03,0.0,131.03,0.0,0.03,0.0,9208103.13,0.0,0.72,854.12,1.62,0.01,0.56,113.79,0.0,75964.05,6021.62,2.79,750.96,131544.33,0.1,0.88,8.78,0.19,0.08,0.0,0.0,0.0,81.43,1.08,36.91,657721.65,0.05,0.0,5925.12,0.0,0.0,0.0,0.07,5703.01,3.04,0.0,0.0,0.07,32.55,2.22,131544.33,0.01,36.46,68.62,0.0,0.0,0.42,0.0,13679.67,0.0,556.18,3.79,0.0,0.0,141.23,0.01,0.0,0.0,0.09,0.0,4.63,0.0,119.53,48671402.24,5.39,0.0,0.0,0.0,69.1,0.0,3.37,76.08,0.09,127.7,0.02,5.48,0.0,-1523.2,0.16,0.0,43.41,0.96,0.0,0.09,0.0,3.16,0.02,263088.66,0.0,0.01,0.09,162.17,4852.26,0.14,1.18,0.19,0.21,2.72,0.05,0.02,0.41,131544.33,0.0,0.05,0.09,0.0,0.03,891.37,0.0,0.04,7191.73,0.0,1.37,1.87,0.0,0.04,0.17,0.0,0.01,0.0,526177.32,0.0,3.35,5130228.89,0.39,9208103.13,0.1,183.41,0.08,0.05,526177.32,76.03,124.62,0.0,0.0,68.21,0.0,2.86,48.45,131544.33,0.0,1.59,4998684.56,0.02,0.05,0.01,0.0,0.01,137.24,0.42,4.02,4.36,0.0,131544.33,1.98,116.78,96.35,0.26,1.0,0.0,0.0,2630886.61,0.0,1.3,0.0,0.23,0.02,2823.95,292.29,3.2,0.0,113.06,0.0,13.96,68.28,0.0,0.0,0.66,3.3,1.91,7.94,0.0,1048.86,68.8,0.09,0.06,0.72,0.0,0.67,0.0,0.07,0.0,0.13,0.0,0.0,2.12,72.36,1028.47,0.07,2.92,0.03,0.03,140.4,0.41,0.0,0.0,0.01,12.57,0.0,0.01,0.73,27.4,0.13,0.09,0.02,0.04,98.79,33.21,0.04,4998684.56,5524861.88,0.01,0.0,12.22,956.95,0.01,619.59,263088.66,0.0,13.65,0.64,31.11,771.23,0.09,2.7
std,0.01,0.57,0.6,5.16,26359.17,0.66,226443467.84,25355.72,34.63,0.61,13082.16,0.25,113.6,162179809.19,147.79,113.97,1614.76,0.36,0.0,0.15,0.16,1.17,0.01,508.97,234986852.92,0.03,72536649.82,0.1,0.0,0.01,35507.18,0.01,9.13,0.06,0.14,0.49,72536649.82,2852.58,81097906.5,9292.75,0.0,0.02,0.41,0.53,0.01,37773.14,0.0,0.01,16.89,10.42,0.71,53.41,38416.75,0.5,0.1,0.04,695953653.99,14.14,226.86,0.15,2859.74,0.1,0.02,0.03,7.32,0.22,4.29,950.09,5.18,0.0,1.43,0.06,182664.6,0.01,0.33,25572.25,0.25,0.15,0.12,726.88,153.74,9614.91,0.2,73.38,0.38,554.25,0.07,0.08,0.05,0.55,47.36,0.03,0.13,0.53,0.16,0.0,0.0,4.1,1935.31,0.33,0.15,0.0,10.88,0.03,36269040.57,2013.13,0.03,32619.13,83.1,14.92,0.18,0.03,52.08,0.89,0.61,0.0,0.26,93.16,3.56,519.39,15392.48,12148.45,0.01,20016.49,797.84,0.0,36.51,113.6,1.64,0.32,0.0,14956.53,0.01,0.17,0.0,303310817.97,0.02,3.21,14255.89,7.39,0.07,2.1,953.58,0.0,43781.95,48144.71,323.81,12422.52,36269040.57,0.49,0.32,538.44,31.88,0.27,0.02,0.01,0.0,11282.41,0.91,472.1,81097906.5,0.33,0.03,48008.53,0.02,0.06,0.0,0.25,46202.54,6.21,0.06,0.0,0.25,393.83,1.5,36269040.57,0.11,8612.4,535.47,0.0,0.0,2.24,0.0,63014.08,0.07,7182.64,811.98,0.04,0.0,2515.66,0.16,0.02,0.17,0.61,0.02,17.18,0.0,546.27,695953653.99,14.5,0.14,0.0,0.02,2839.62,0.0,1.34,739.78,0.61,6369.0,0.9,465.39,0.0,39033.46,43.62,0.03,10934.97,0.2,0.09,0.63,0.0,95.27,0.21,51291831.71,0.09,0.1,0.29,4698.87,31886.62,0.64,3.26,1.0,41.82,1.14,0.21,0.16,30.6,36269040.57,0.0,0.22,0.28,0.02,0.18,9888.6,0.01,0.2,49145.31,0.03,113.96,388.25,0.02,0.4,0.75,0.05,0.23,0.06,72536649.82,0.0,126.68,226443467.84,2.22,303310817.97,0.3,22698.13,2.56,0.22,72536649.82,4040.34,2205.25,0.0,0.06,531.9,0.01,10.89,10937.47,36269040.57,0.02,0.72,223522965.12,0.15,6.02,0.18,0.04,0.18,697.71,2.24,10.93,14.41,0.02,36269040.57,1.3,13620.44,5218.43,1.65,0.07,0.0,0.03,162179809.19,0.03,3.45,0.01,57.32,0.29,25334.47,23316.4,0.94,0.01,512.15,0.18,1151.51,1733.84,0.1,0.0,0.47,2.87,357.41,455.89,0.2,8189.95,319.61,0.63,1.83,3.23,0.0,141.45,0.07,2.09,0.06,0.34,0.04,0.0,8.8,339.32,9852.14,0.26,11.11,0.16,0.32,712.77,83.1,0.01,0.0,0.11,1093.13,0.1,0.18,0.44,2477.81,0.62,1.22,4.35,0.19,22120.72,12.96,0.2,223522965.12,234986852.92,0.32,0.01,783.21,16006.98,0.29,11253.0,51291831.71,0.0,33.3,1.84,2457.09,13082.16,1.23,1.11
min,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,-0.9,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,5163.75,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-922.38,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1844.52,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-0.03,-128.37,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-4942.26,0.0,0.0,0.0,0.0,0.0,-4942.26,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-999999.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,-8.04,0.0,0.0,-4942.26,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,-1.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,-3401.34,0.0,0.0,0.0,0.0,0.0,-0.9,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-287.67,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-476.07,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,-2895.72,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,5.0,0.0,0.0,-1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,67870.61,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,38104.75,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,23.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0
50%,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,106409.16,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,76043.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.99,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,3.0,0.0,0.0,0.0,2.73,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,28.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0
75%,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,118756.25,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,90.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,99.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,90.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,113748.75,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,236.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,12.22,0.0,0.0,120.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,3.0,0.0,0.0,3.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,3.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,3.0,0.0,0.0,0.0,83.79,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,90.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,40.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,12.0,0.0,0.0,0.0,0.0,3.0
max,1.0,6.0,6.0,156.0,450000.0,15.0,9999999999.0,2310003.0,7741.95,9.0,3525776.88,1.0,30000.0,9999999999.0,24650.01,8192.61,210000.0,36.0,0.0,15.0,1.0,93.0,1.0,133730.58,9999999999.0,1.0,9999999999.0,12.0,0.0,1.0,2272859.43,1.0,438.0,1.0,12.0,3.0,9999999999.0,397884.3,9999999999.0,668335.32,0.0,3.0,36.0,6.0,1.0,3000538.14,0.0,1.0,468.0,339.0,18.0,13207.32,1500000.0,3.0,9.0,1.0,9999999999.0,438.0,36000.0,1.0,450000.0,1.0,3.0,6.0,249.0,12.0,582.0,131100.0,156.0,0.0,15.0,1.0,22034738.76,1.0,1.0,450000.0,1.0,24.0,3.0,69756.72,11073.57,601428.6,1.0,18870.99,18.0,145384.92,1.0,3.0,1.0,24.0,99.0,3.0,6.0,145.2,1.0,0.0,0.0,1096.02,177582.0,27.0,1.0,0.0,3000.0,1.0,9999999999.0,438329.22,1.0,450000.0,19531.8,600.0,3.0,1.0,13793.67,111.0,111.0,0.0,39.0,8237.82,264.0,77586.21,4210084.23,812137.26,1.0,1500000.0,142078.8,0.0,6600.0,30000.0,33.0,6.0,0.0,3998687.46,3.0,1.0,0.0,9999999999.0,3.0,90.0,1155003.0,249.0,1.0,96.0,131100.0,0.0,151838.0,3008077.32,49581.27,1034482.74,9999999999.0,3.0,1.0,91778.73,7331.34,1.0,3.0,1.0,0.0,2754476.46,7.0,60000.0,9999999999.0,3.0,3.0,3008077.32,3.0,1.0,0.0,1.0,3004185.6,234.0,1.0,0.0,1.0,30000.0,18.0,9999999999.0,1.0,2368558.95,45990.0,0.0,0.0,114.0,0.0,3458077.32,3.0,304838.7,209834.4,1.0,0.0,240045.0,3.0,6.0,21.0,33.0,3.0,468.0,0.0,21024.81,9999999999.0,342.0,12.0,0.0,3.0,450000.0,0.0,114.0,69756.72,33.0,1039260.0,117.0,105000.0,0.0,238.0,12027.15,6.0,3000000.0,1.0,3.0,33.0,0.0,8237.82,3.0,9999999999.0,6.0,1.0,1.0,534000.0,450000.0,15.0,108.0,60.0,10430.01,33.0,1.0,1.0,6600.0,9999999999.0,0.0,1.0,1.0,3.0,1.0,1542339.36,3.0,1.0,3008077.32,3.0,20385.72,96781.44,3.0,18.0,18.0,1.0,24.0,1.0,9999999999.0,0.0,12210.78,9999999999.0,180.0,9999999999.0,1.0,6119500.14,351.0,1.0,9999999999.0,681462.9,228031.8,0.0,1.0,45990.0,2.0,468.0,3000000.0,9999999999.0,3.0,3.0,9999999999.0,1.0,1260.0,3.0,1.0,3.0,47598.09,114.0,267.0,510.0,3.0,9999999999.0,3.0,3410058.66,740006.61,105.0,1.0,0.0,1.0,9999999999.0,6.0,123.0,1.0,15691.8,30.0,840000.0,6119500.14,114.0,3.0,16566.81,48.0,210001.35,231351.99,12.0,0.0,1.0,36.0,69622.29,50003.88,24.0,544365.57,12888.03,33.0,234.0,90.0,0.0,36000.0,12.0,210.0,3.0,1.0,1.0,0.0,438.0,12888.03,619329.15,1.0,468.0,1.0,6.0,47598.09,19531.8,1.0,0.0,1.0,182132.97,21.0,6.0,1.0,455858.16,6.0,81.0,1200.0,1.0,6083691.87,105.0,1.0,9999999999.0,9999999999.0,39.0,2.0,138831.63,1500000.0,27.0,450000.0,9999999999.0,0.0,801.0,78.0,432457.32,840000.0,81.0,33.0


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Low-Variance Feature Removal**

</div>

In [6]:
# Identify constant or near-constant features (low-variance features)
# These add no value and can be dropped.

selector = VarianceThreshold(threshold=0.01)
selector.fit(df_train.select_dtypes(include=['float64', 'int64']))

low_variance_cols = df_train.select_dtypes(include=['float64', 'int64']).columns[~selector.get_support()]
print("There are {} low-variance columns".format(len(low_variance_cols)))
print("Low-variance columns:", low_variance_cols.tolist())


There are 97 low-variance columns
Low-variance columns: ['ind_var34_0', 'num_var46', 'ind_var29', 'ind_var32_0', 'ind_var28', 'ind_var6_0', 'ind_var29_0', 'ind_var39', 'imp_amort_var34_hace3', 'num_trasp_var17_out_ult1', 'ind_var7_emit_ult1', 'num_trasp_var33_out_hace3', 'ind_var13_medio_0', 'num_compra_var44_hace3', 'ind_var17_0', 'ind_var13_largo', 'num_var13_medio_0', 'num_trasp_var33_in_ult1', 'num_var46_0', 'ind_var1', 'ind_var13_medio', 'ind_var31_0', 'num_meses_var17_ult3', 'ind_var7_recib_ult1', 'num_var6_0', 'ind_var46', 'num_var28', 'ind_var27_0', 'ind_var33_0', 'ind_var32_cte', 'ind_var32', 'num_var2_0_ult1', 'ind_var34', 'saldo_var28', 'ind_var2', 'num_trasp_var33_out_ult1', 'imp_reemb_var13_hace3', 'num_trasp_var17_in_ult1', 'ind_var14', 'saldo_var2_ult1', 'num_var6', 'ind_var18', 'num_trasp_var17_out_hace3', 'num_var29_0', 'num_var13_medio', 'ind_var19', 'num_reemb_var13_hace3', 'ind_var20_0', 'num_var41', 'saldo_var41', 'ind_var46_0', 'num_var2_ult1', 'num_reemb_var13_ul

In [7]:
df_train.drop(columns=low_variance_cols.to_list(), inplace=True)
df_train, df_test = data_utils.intersect_features(train_dataframe=df_train, test_dataframe=df_test)

Finding common features between train and test datasets.


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Missing Value Analysis**

</div>

In [8]:
missing = df_train.isnull().mean().sort_values(ascending=False)
print(missing[missing > 0])

Series([], dtype: float64)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Correlation Analysis**

</div>

In [9]:
corr = df_train.corr(numeric_only=True).abs()
upper = corr.where(np.triu(np.ones(corr.shape), k=1).astype(bool))

# Find highly correlated features (e.g., >0.95)
to_drop = [column for column in upper.columns if any(upper[column] > 0.95)]
print("Highly correlated features to drop:", to_drop)

Highly correlated features to drop: ['num_var13_corto', 'saldo_medio_var13_medio_hace2', 'num_var37', 'saldo_var13_medio', 'delta_imp_reemb_var33_1y3', 'num_var39', 'ind_var26_0', 'delta_num_reemb_var17_1y3', 'num_var25_0', 'saldo_medio_var17_hace2', 'saldo_medio_var13_medio_ult1', 'imp_amort_var18_ult1', 'saldo_medio_var17_ult1', 'saldo_var18', 'num_var32', 'saldo_var31', 'imp_op_var41_comer_ult3', 'num_var26_0', 'num_var26', 'saldo_medio_var13_corto_ult1', 'num_op_var39_efect_ult1', 'saldo_var34', 'num_op_var41_ult3', 'saldo_var12', 'num_var8', 'num_op_var41_comer_ult1', 'ind_var12', 'ind_var26', 'delta_num_aport_var33_1y3', 'delta_imp_trasp_var33_out_1y3', 'delta_imp_aport_var13_1y3', 'num_op_var41_comer_ult3', 'imp_op_var39_comer_ult1', 'num_meses_var13_corto_ult3', 'saldo_medio_var33_ult3', 'saldo_medio_var29_ult3', 'ind_var25', 'ind_var13', 'ind_var9_ult1', 'ind_var26_cte', 'num_var1_0', 'imp_op_var39_ult1', 'ind_var8_0', 'saldo_var6', 'imp_venta_var44_ult1', 'num_var4', 'ind_var

In [10]:
df_train.drop(columns=to_drop, inplace=True)
df_train, df_test = data_utils.intersect_features(train_dataframe=df_train, test_dataframe=df_test)
print("After dropping highly correlated features, train shape:", df_train.shape)

Finding common features between train and test datasets.
After dropping highly correlated features, train shape: (76020, 184)


<div class="alert" style="background-color: #FEF9E7; border-left: 8px solid #D4AC0D; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

### **Save Cleaned Data**

</div>

In [11]:
train_data_with_target = df_train.copy()
train_data_with_target['TARGET'] = y_train
train_data_with_target

Unnamed: 0,ind_var13_corto,num_var24,saldo_medio_var13_medio_ult3,num_var13_largo,num_op_var41_efect_ult3,num_var37_0,saldo_var1,delta_num_reemb_var33_1y3,imp_var7_emit_ult1,saldo_medio_var13_corto_hace2,num_meses_var39_vig_ult3,saldo_var30,num_var5_0,saldo_medio_var13_corto_hace3,delta_imp_reemb_var13_1y3,delta_imp_aport_var17_1y3,imp_venta_var44_hace3,imp_trans_var37_ult1,num_aport_var13_hace3,ind_var25_0,imp_aport_var33_ult1,num_var20_0,var36,saldo_var8,num_var40,imp_op_var41_ult1,num_var44_0,num_med_var45_ult3,saldo_medio_var29_hace3,num_var20,num_var24_0,saldo_medio_var17_ult3,num_var45_ult1,ind_var25_cte,num_aport_var17_ult1,num_var25,imp_amort_var34_ult1,ind_var12_0,saldo_medio_var8_ult3,num_var31,ind_var14_0,num_op_var39_ult3,imp_trasp_var33_out_ult1,delta_imp_aport_var33_1y3,num_meses_var5_ult3,imp_op_var39_comer_ult3,saldo_medio_var44_hace2,delta_num_aport_var13_1y3,imp_compra_var44_ult1,num_var45_hace2,delta_imp_reemb_var17_1y3,saldo_var44,num_var32_0,saldo_var13_corto,saldo_var29,num_var37_med_ult2,saldo_medio_var44_hace3,saldo_var40,saldo_var14,num_op_var39_comer_ult3,imp_ent_var16_ult1,num_var17_0,num_var30_0,saldo_medio_var29_ult1,num_sal_var16_ult1,saldo_var26,num_trasp_var11_ult1,num_var12_0,imp_trasp_var17_in_ult1,num_var14_0,delta_imp_venta_var44_1y3,num_venta_var44_ult1,delta_imp_trasp_var17_in_1y3,imp_var7_recib_ult1,imp_op_var40_comer_ult1,num_var43_recib_ult1,saldo_medio_var8_hace3,num_var22_hace2,num_var33_0,num_op_var40_hace2,saldo_medio_var5_hace2,imp_sal_var16_ult1,var3,imp_reemb_var17_hace3,saldo_var13_largo,saldo_var33,imp_op_var40_efect_ult3,num_aport_var13_ult1,saldo_medio_var12_ult3,num_var30,num_var40_0,num_op_var39_comer_ult1,num_meses_var12_ult3,ind_var5_0,imp_aport_var13_hace3,num_var42_0,delta_imp_trasp_var17_out_1y3,num_op_var40_hace3,ind_var8,imp_compra_var44_hace3,saldo_medio_var8_hace2,saldo_medio_var44_ult3,delta_num_compra_var44_1y3,num_aport_var17_hace3,ind_var5,num_var35,imp_trasp_var17_out_ult1,saldo_medio_var33_hace2,num_op_var41_efect_ult1,delta_imp_trasp_var33_in_1y3,num_op_var40_efect_ult3,imp_op_var40_ult1,saldo_medio_var5_ult3,saldo_medio_var12_hace3,imp_var43_emit_ult1,num_op_var39_hace2,num_var31_0,num_var8_0,imp_op_var41_comer_ult1,num_meses_var13_largo_ult3,num_var22_ult1,delta_imp_amort_var18_1y3,imp_op_var39_efect_ult3,ind_var13_largo_0,ind_var10cte_ult1,saldo_medio_var13_largo_hace3,num_op_var40_ult1,ID,num_op_var40_comer_ult1,saldo_medio_var12_hace2,num_var12,ind_var43_recib_ult1,num_var22_hace3,num_var45_hace3,num_var13,imp_trasp_var33_in_ult1,num_ent_var16_ult1,saldo_var13,imp_trasp_var33_in_hace3,saldo_medio_var13_largo_ult3,saldo_medio_var29_hace2,num_var39_0,imp_op_var40_efect_ult1,imp_aport_var33_hace3,saldo_var5,ind_var39_0,ind_var10_ult1,ind_var37_cte,num_op_var39_ult1,imp_reemb_var13_ult1,saldo_medio_var5_hace3,num_var14,saldo_var37,imp_reemb_var17_ult1,num_op_var40_comer_ult3,num_meses_var8_ult3,saldo_medio_var33_hace3,imp_trasp_var17_in_hace3,ind_var30,saldo_var20,num_op_var41_hace3,num_var7_recib_ult1,var15,saldo_var32,var38,ind_var43_emit_ult1,num_var43_emit_ult1,num_compra_var44_ult1,num_var22_ult3,ind_var37,var21,num_var42,num_var17,imp_aport_var13_ult1,delta_imp_amort_var34_1y3,imp_op_var39_efect_ult1,num_med_var22_ult3,saldo_medio_var8_ult1,imp_aport_var17_ult1,num_op_var40_efect_ult1,saldo_medio_var13_largo_hace2,num_var44,imp_op_var40_comer_ult3,saldo_medio_var5_ult1,TARGET
0,0,0,0.00,0,0,0,0.00,0,0.00,0.00,2,0.00,3,0.00,0,0.00,0.00,0.00,0,0,0,0,99,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,0.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,0,0,0,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0.00,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,1,0,0.00,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,0.00,1,0,0,0,0.00,0.00,0,0.00,0.00,0,0,0.00,0.00,0,0.00,0,0,23,0.00,39205.17,0,0,0,0,0,0,0,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,0.00,0
1,1,0,0.00,0,0,0,0.00,0,0.00,300.00,2,300.00,3,122.22,0,0.00,0.00,0.00,3,0,0,0,3,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,1,0.00,0.00,-1.00,0.00,0,0,0.00,0,300.00,0.00,0,0.00,0.00,0.00,0,0.00,0,6,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,0.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,3,0,0,0,1,300.00,3,0,0,0,0.00,0.00,0.00,0.00,0,0,3,0.00,0.00,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,3,0,0.00,0,0,0,0,3,0.00,0,300.00,0.00,0.00,0.00,3,0.00,0,0.00,1,0,0,0,0.00,88.89,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,34,0.00,49278.03,0,0,0,0,0,0,0,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,0.00,0
2,0,0,0.00,0,0,0,0.00,0,0.00,0.00,1,3.00,3,0.00,0,0.00,0.00,0.00,0,0,0,0,99,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,3,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,3.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,3,0,0,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,1,3,0.00,0.00,0,0,0,0.00,2.07,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,4,0,0.00,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,3.00,1,0,0,0,0.00,0.18,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,23,0.00,67333.77,0,0,0,0,0,0,3,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,3.00,0
3,0,0,0.00,0,0,6,0.00,0,0.00,0.00,1,70.62,3,0.00,0,0.00,0.00,0.00,0,0,0,0,2,0.00,0,195.00,0,15,0.00,0,0,0.00,18,0,0,0,0.00,0,0.00,0,0,9,0,0.00,2,195.00,0.00,0.00,0.00,27,0,0.00,0,0.00,0.00,6,0.00,0.00,0.00,9,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,3,0,0,186.09,0.00,2,0.00,0.00,0.00,0.00,0,0.00,3,0,9,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,1,9,0.00,0.00,0,0,0,0.00,138.84,0.00,0.00,0,0,0,195.00,0,0,0,0.00,0,1,0.00,0,8,0,0.00,0,0,0,3,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,70.62,1,1,1,9,0.00,0.00,0,34.95,0.00,0,0,0.00,0.00,1,0.00,0,0,37,0.00,64007.97,0,0,0,3,1,0,3,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,91.56,0
4,0,3,0.00,0,0,0,0.00,0,0.00,0.00,2,135003.00,3,0.00,0,0.00,0.00,270003.00,0,0,0,0,1,0.00,0,0.00,0,0,0.00,0,3,0.00,0,0,0,0,0.00,1,0.00,0,0,0,0,0.00,3,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,6,0.00,0,0.00,3,3,0.00,0,0.00,0,0,0.00,0.00,6,0.00,3,0,0,3.00,0.00,2,0.00,0.00,0.00,0.00,0,85501.89,3,0,0,1,1,0.00,6,0,0,0,0.00,0.00,0.00,0.00,0,0,3,0.00,0.00,0,0,0,0.00,13501.47,0.00,135003.00,0,0,0,0.00,0,6,0,0.00,0,0,0.00,0,10,0,0.00,3,1,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,0.00,1,0,0,0,0.00,0.30,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,39,0.00,117310.98,1,3,0,9,0,0,3,0,0.00,0,0.00,3,0.00,0.00,0,0.00,0,0.00,40501.08,0
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
76015,0,0,0.00,0,0,0,0.00,0,0.00,0.00,2,0.00,3,0.00,0,0.00,0.00,0.00,0,0,0,0,99,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,0.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,0,0,0,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0.00,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,151829,0,0.00,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,0.00,1,0,0,0,0.00,0.00,0,0.00,0.00,0,0,0.00,0.00,0,0.00,0,0,48,0.00,60926.49,0,0,0,0,0,0,0,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,0.00,0
76016,0,3,0.00,0,0,0,0.00,0,0.00,0.00,2,48191.22,3,0.00,0,0.00,0.00,0.00,0,0,0,0,2,0.00,0,0.00,0,15,0.00,0,3,0.00,9,0,0,0,0.00,1,0.00,0,0,0,0,0.00,1,0.00,0.00,0.00,0.00,36,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,6,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0.00,0,0.00,24,0,0,130.65,0.00,2,0.00,0.00,0.00,0.00,0,31837.02,3,0,0,2,1,0.00,6,0,0,0,0.00,0.00,0.00,0.00,0,0,3,0.00,0.00,0,0,0,0.00,0.00,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,151830,0,15498.42,3,0,0,3,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,0.00,1,0,0,0,0.00,0.00,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,39,0.00,118634.52,0,0,0,24,0,0,3,0,0.00,0,0.00,6,0.00,0.00,0,0.00,0,0.00,0.00,0
76017,0,0,0.00,0,0,0,0.00,0,0.00,0.00,1,3.00,3,0.00,0,0.00,0.00,0.00,0,0,0,0,99,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,2,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,3.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,3,0,0,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,1,3,0.00,0.00,0,0,0,0.00,3.00,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,151835,0,0.00,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,3.00,1,0,0,0,0.00,0.00,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,23,0.00,74028.15,0,0,0,0,0,0,3,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,3.00,0
76018,0,0,0.00,0,0,0,0.00,0,0.00,0.00,2,3.00,3,0.00,0,0.00,0.00,0.00,0,0,0,0,99,0.00,0,0.00,0,0,0.00,0,0,0.00,0,0,0,0,0.00,0,0.00,0,0,0,0,0.00,3,0.00,0.00,0.00,0.00,0,0,0.00,0,0.00,0.00,0,0.00,0.00,0.00,0,0.00,0,3,0.00,0,0.00,0,0,0.00,0,0.00,0,0,0.00,0.00,0,0.00,0,0,0,3.00,0.00,2,0.00,0.00,0.00,0.00,0,0.00,3,0,0,0,1,0.00,3,0,0,0,0.00,0.00,0.00,0.00,0,1,3,0.00,0.00,0,0,0,0.00,2.58,0.00,0.00,0,0,0,0.00,0,0,0,0.00,0,0,0.00,0,151836,0,0.00,0,0,0,0,0,0.00,0,0.00,0.00,0.00,0.00,3,0.00,0,3.00,1,0,0,0,0.00,1.74,0,0.00,0.00,0,0,0.00,0.00,1,0.00,0,0,25,0.00,84278.16,0,0,0,0,0,0,3,0,0.00,0,0.00,0,0.00,0.00,0,0.00,0,0.00,3.00,0


In [12]:
# Save the cleaned data
# df_train.to_csv(paths.get_processed_data_path("train_cleaned.csv"), index=False)
train_data_with_target.to_csv(paths.get_processed_data_path("train_cleaned.csv"), index=False)
df_test.to_csv(paths.get_processed_data_path("test_cleaned.csv"), index=False)


<div class="alert" style="background-color:#FFFFFF; border-left: 8px solid #5ad197; padding: 14px; border-radius: 8px; font-size: 14px; color: #000000;">

<div class="alert alert-success">

### **Conclusion**
</div>

The initial data preprocessing phase has been successfully completed. The following key steps were performed:

- Removed constant and duplicate features to eliminate noise and redundancy.  
- Ensured feature consistency between the training and test sets.  
- Dropped low-variance features that do not contribute significantly to model learning.  
- Verified that there are no missing values in the dataset.  
- Identified and removed highly correlated features to reduce multicollinearity.  

The cleaned datasets have been saved and are now ready for advanced feature engineering, exploratory data analysis (EDA), and model development.

</div>
