Predict remaining useful life of a component based on historical sensor observations using automated feature engineering
Switch branches/tags
Nothing to show
Clone or download
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.gitignore added cyber demo Mar 23, 2018
Advanced Featuretools RUL.ipynb Run notebooks in python 3 Jul 25, 2018
README.md Use NASA data directly Mar 30, 2018
Simple Featuretools RUL Demo.ipynb Run notebooks in python 3 Jul 25, 2018
utils.py Run notebooks in python 3 Jul 25, 2018

README.md

Predicting Remaining Useful Life

Featuretools NASA

The general setup for the problem is a common one: we have a single table of sensor observations over time. Now that collecting information is easier than ever, most industries have already generated time-series type problems by the way that they store data. As such, it is crucial to be able to handle data in this form. Thankfully, built-in functionality from Featuretools handles time varying data well.

We'll demonstrate an end-to-end workflow using a Turbofan Engine Degradation Simulation Data Set from NASA. This notebook demonstrates a rapid way to predict the Remaining Useful Life (RUL) of an engine using an initial dataframe of time-series data. There are three sections of the notebook:

  1. Understand the Data
  2. Generate features
  3. Make predictions with Machine Learning

If you're running this notebook yourself, note that the Challenge Dataset will be downloaded into the data folder in this repository. If you'd prefer to download the data yourself, download and unzip the file from https://ti.arc.nasa.gov/c/13/.

Highlights

  • Quickly make end-to-end workflow using time-series data
  • Find interesting automatically generated features
  • An advanced notebook using custom primitives and hyperparameter tuning

Demonstration

The main notebook can be found here. To run that notebook, you will need to download Featuretools with

pip install featuretools

and the turbofan data from NASA. The function load_data in utils.py takes the path to the text file and returns a pandas dataframe. With the notebook as written, we expect the path to the train data first.

Feature Labs

Featuretools

Featuretools was created by the developers at Feature Labs. If building impactful data science pipelines is important to you or your business, please get in touch.