AML Group 22 Project

Background and Context

The ability to predict loan approval outcomes and assess financial risk is crucial for banks and financial institutions to make informed lending decisions. Creditworthiness and financial stability are key factors that influence loan approvals, and accurate predictions in these areas can reduce default rates, streamline lending processes, and enhance financial inclusion.

This project aims to explore several machine learning models that can predict the binary outcome of loan approval based on a range of demographic, financial, and historical data. If time permits, we will also work on predicting the risk score using regression.

Dataset Description

The dataset used for this project is a synthetic dataset designed for risk assessment and loan approval modeling, sourced from Kaggle:
Financial Risk for Loan Approval Dataset

It contains 20,000 records with 36 attributes related to demographic information, credit history, income levels, existing debt, and financial stability. Key features include:

Age
Credit score
Employment status
Loan amount
Debt-to-income ratio
Previous loan defaults, and more.

This dataset provides a comprehensive foundation for building models that predict Loan Approval Status (a binary classification problem).

Contributions

data exploration: Ruoqi Yan, Kaushal Damania
Data preprocessing: Naveen Reddy Dyava, Kaushal Damania
Model Training: Naveen Reddy Dyava
Model Evaluation: Naveen Reddy Dyava, Kaushal Damania
Model Interpretation: Kaushal Damania
Report: Naveen Reddy Dyava, Kaushal Damania

Proposed Machine Learning Models

The following machine learning models are proposed for this project:

Logistic
CatBoost
AdaBoost
LightGBM
MLP

Running the project

To run the project please follow the following instructions:

# create a python environment
python3.10 -m venv .venv

# Install the required packages
pip install -r requirements.txt

# activate the environment
source .venv/bin/activate

You can now run any notebook in the project

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
catboost_info		catboost_info
AdaBoostClassifier.ipynb		AdaBoostClassifier.ipynb
CatBoostClassifier.ipynb		CatBoostClassifier.ipynb
Data Processing.ipynb		Data Processing.ipynb
Data_Exploration.ipynb		Data_Exploration.ipynb
Data_Exploration_2.ipynb		Data_Exploration_2.ipynb
DecisionTreeClassifier.ipynb		DecisionTreeClassifier.ipynb
ExtraTreesClassifier.ipynb		ExtraTreesClassifier.ipynb
GradientBostingClassifier.ipynb		GradientBostingClassifier.ipynb
HistGradientBoostingClassifier.ipynb		HistGradientBoostingClassifier.ipynb
LightGBMClassifier.ipynb		LightGBMClassifier.ipynb
Loan.csv		Loan.csv
LogisticRegression.ipynb		LogisticRegression.ipynb
MLP.ipynb		MLP.ipynb
README.md		README.md
RandomForestClassifier.ipynb		RandomForestClassifier.ipynb
SVM.ipynb		SVM.ipynb
XGBostClassifier.ipynb		XGBostClassifier.ipynb
adaboost_first_tree.pdf		adaboost_first_tree.pdf
lime_explanation_Gradient_Boost.html		lime_explanation_Gradient_Boost.html
lime_explanation_Random_Forest.html		lime_explanation_Random_Forest.html
lime_explanation_XGBoost.html		lime_explanation_XGBoost.html
lime_explanation_adaboost.html		lime_explanation_adaboost.html
lime_explanation_decision.html		lime_explanation_decision.html
lime_explanation_extra.html		lime_explanation_extra.html
lime_explanation_hist.html		lime_explanation_hist.html
lime_explanation_lgbm.html		lime_explanation_lgbm.html
lime_explanation_logistic.html		lime_explanation_logistic.html
lime_explanation_mlp.html		lime_explanation_mlp.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AML Group 22 Project

Background and Context

Dataset Description

Contributions

Proposed Machine Learning Models

Running the project

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AML Group 22 Project

Background and Context

Dataset Description

Contributions

Proposed Machine Learning Models

Running the project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages