This Project is based on Prediction of Interest Rates of Loans from the Lending Club Loan Data.
CLAAT Document Link: https://codelabs-preview.appspot.com/?file_id=1rYZCIi7hOFTiRhG1IEAnrO7JBcLSmFqHaxtiy4rvMDk#0
Steps to Regenerate the Project:
- Clone the Project into any directory of your choice
- Create a sub-folder called "Mice Data" inside the Data folder
- Download the Data from https://www.kaggle.com/wendykan/lending-club-loan-data and place it inside the "Original Data" in the Data folder
- Check Folders Under the Code Folder
- Understanding the Data: Notebook to Understand the Data more prominently
- Cleansing, Preprocessing and EDA: Notebooks to Cleanse and Preprocess Data. , Notebook to impute Missing Values using MICE , Notebook to Normalize the Data
- Feature Selection: Notebook to Implement Feature Tools , Notebook to Select Features using LassoCV
- Models: This Folder Contains Sub-Folders with Notebooks to implement Linear Regression, Random Forests, Neural Networks and also implement AutoML using AutoSKLearm, H20.ai and Tpot