## Lending Club Credit Risk Modeling - Modified Version
- In this project, I will build three **machine learning** models to predict the three components of expected loss in the context of **credit risk modeling** at the **Lending Club** (a peer-to-peer credit company): **Probability of Default (PD), Exposure at Default (EAD) and Loss Given Default (LGD)**. The expected loss will be the product of these elements: **Expected Loss (EL) = PD * EAD * LGD**.
- The PD modelling encompasses an imbalanced binary classification problem with target being **0 in case of non-default and 1 in case of default** (minority class). A Logistic Regression model will be built.
- The LGD and EAD modelling encompasses a regression problem with recovery rate and credit conversion factor as dependent variables.
- The **solution pipeline** is based on the **crisp-dm** framework.

### Import Libraries

In [None]:
# Data manipulation and visualization
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

# Modeling
from sklearn.pipeline import Pipeline
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import roc_auc_score, roc_curve
import statsmodels.api as sm

# Utils
from src.eda_utils import *
from src.modelling_utils import *

# Python version: 3.10
import sys
print(f"Python version: {sys.version}")