Part of the Misk-Udacity data engineering sessions activities. The task here is to create an etl pipeline that will prepare data to be used to train a machine learning model.
The dataset we'll be using contains real messages that were sent during disaster events.