Skip to content
Predict whether a loan will be repaid using automated feature engineering.
Jupyter Notebook
Branch: master
Clone or download
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore rework file structure Dec 17, 2018
Automated Loan Repayment.ipynb Update Automated Loan Repayment.ipynb Aug 26, 2019
LICENSE added license Dec 17, 2018 Update Feb 2, 2019
requirements.txt Update requirements.txt Aug 26, 2019

Predicting whether an applicant is capable of repaying a loan


As a bank decides which applicants to provide loans, they may wish to predict if the applicant will default on the loan. Through automated feature engineering, we can identify the predictive patterns in the financial data that can be used to ensure that clients capable of repayment are not rejected.

In this tutorial, we show how Featuretools can be used to perform feature engineering on a multi-table dataset of 300 thousand applicant financial information provided by Home Credit to train an accurate machine learning model to predict what if an applicant will repay a loan.


  • We automatically generate 1820 features using Deep Feature Synthesis.
  • We are able to generate features, check that we are content with those features, and create the feature matrix.
  • We develop are able to generate features in 1 hour vs 10 hours with manual feature engineering.

Running the tutorial

  1. Clone the repo

    git clone
  2. Install the requirements

    pip install -r requirements.txt

    You will also need to install graphviz for this demo. Please install graphviz according to the instructions in the Featuretools Documentation

  3. Download the data

    You can download the data from Kaggle. After downloading, save the CSV to a directory called input in the root of this repository.

  4. Run the Tutorial notebook:
    Automated Loan Repayment

    jupyter notebook

Feature Labs


Featuretools is an open source project created by Feature Labs. To see the other open source projects we're working on visit Feature Labs Open Source. If building impactful data science pipelines is important to you or your business, please get in touch.


Any questions can be directed to

You can’t perform that action at this time.