Skip to content


Subversion checkout URL

You can clone with
Download ZIP
Contains the code for the model that won Kaggle's Air Quality Prediction Hackathon
Branch: master

Winning Code for the EMC Data Science Global Hackathon (Air Quality Prediction)

Competition page:

Blog post on methodology:

To train and recreate the winning submission (may be slightly different, as the random number generator didn't have a static seed),

  1. Download TrainingData.csv from and put it in this folder
  2. Run make_predictions.m from the Matlab command prompt
  3. Copy the resulting predictions from predictions.csv to the appropriate spreadsheet in SubmissionConversion.xls
  4. Save the submission worksheet as a new CSV file
Something went wrong with that request. Please try again.