Predicting a customer's next purchase using automated feature engineering

As customers use your product, they leave behind a trail of behaviors that indicate how they will act in the future. Through automated feature engineering we can identify the predictive patterns in granular customer behavioral data that can be used to improve the customer's experience and generate additional revenue for your business.

In this tutorial, we show how Featuretools can be used to perform feature engineering on a multi-table dataset of 3 million online grocery orders provided by Instacart to train an accurate machine learning model to predict what product a customer buys next.

Note: If you are running this notebook yourself, refer to the read me on Github for instructions to download the Instacart dataset

Highlights

We automatically generate 150+ features using Deep Feature Synthesis and select the 20 most important features for predictive modeling
We build a pipeline that it can be reused for numerous prediction problems (you can try this yourself!)
We quickly develop a model on a subset of the data and validate on the entire dataset in a scalable manner using Dask.

Read the tutorial

Link to notebook: Tutorial

Running the tutorial

Clone the repo

git clone https://github.com/Featuretools/predict_next_purchase.git

Install the requirements

pip install -r requirements.txt

Download the data

You can download the data directly from Instacart here.

After downloading the data save the CSVs to a directory called data in the root of this repository. Then run the following command in your terminal from the root of this repo.

>> python process_data.py
 70%|██████████████████████████▌           | 145/207 [07:43<03:18,  3.20s/it]

Expect this command to take up to 20 minutes to run as it prepares the data for the tutorial notebook

Feature Labs

Featuretools was created by the developers at Feature Labs. If building impactful data science pipelines is important to you or your business, please get in touch.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Tutorial.ipynb		Tutorial.ipynb
process_data.py		process_data.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting a customer's next purchase using automated feature engineering

Highlights

Read the tutorial

Running the tutorial

Feature Labs

About

Releases

Packages

Languages

License

lfpelison/predict-next-purchase

Folders and files

Latest commit

History

Repository files navigation

Predicting a customer's next purchase using automated feature engineering

Highlights

Read the tutorial

Running the tutorial

Feature Labs

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages