Skip to content
Predict whether or not a patient will show up to their next appointment using automated feature engineering
Branch: master
Clone or download
kmax12 Merge pull request #2 from bukosabino/master
update Featuretools v0.6.0 prerelease
Latest commit 51462a8 Feb 4, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
data add data Oct 19, 2018
.gitignore add data Oct 19, 2018
LICENSE Add files via upload Dec 11, 2018 Update Feb 2, 2019
Tutorial.ipynb showing entityset Feb 1, 2019
requirements.txt showing entityset Feb 1, 2019 update pandas dtypes Oct 19, 2018

Using Featuretools to Predict Missed Appointments


In this tutorial, we show how Featuretools can be used to predict whether or not a patient will show up to a scheduled appointment using a dataset from Kaggle. We make all of the features from the most popular kernel on kaggle, and make some other interesting features automatically.

The Tutorial notebook from this repository exists on Kaggle. If you would prefer to work in that environment, you can fork the existing kernel to use as a starting point.


  • We generate interesting aggregations by age and location automatically.
  • We use a secondary time index to generate features from the no-show column without leaking invalid information.

Running the tutorial

If you would like to work on Kaggle, the Tutorial notebook has been uploaded as a kernel. You can fork that notebook to use as a starting point. If you prefer to work locally:

  1. Clone the repo

    git clone
  2. Install the requirements

    pip install -r requirements.txt

    You will also need to install graphviz for this demo. Please install graphviz according to the instructions in the Featuretools Documentation

  3. Download the data

    You can download the data from Kaggle or create a kernel and use Featuretools there. After downloading, save the CSV to a directory called data in the root of this repository.

  4. Run the Tutorial using Jupyter

    jupyter notebook

Feature Labs


Featuretools is an open source project created by Feature Labs. To see the other open source projects we're working on visit Feature Labs Open Source. If building impactful data science pipelines is important to you or your business, please get in touch.


Any questions can be directed to

You can’t perform that action at this time.