Benchmark and sample code for the Cause Effect Pairs Challenge on Kaggle
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Cause Effect Pairs Challenge

This repo contains a benchmark and sample code in Python for the Cause Effect Pairs Challenge, a machine learning challenged hosted by Kaggle and organized by ChaLearn.

This version of the repo contains the Basic Python Benchmark. Future benchmarks may be included here as well and will be marked with git tags.

Executing this benchmark requires Python 2.7 along with the following packages:

  • pandas (tested with version 10.1)
  • sklearn (tested with version 0.13)
  • numpy (tested with version 1.6.2)
  • scipy (tested with version 0.10.)

To run the benchmark,

  1. Download the data
  2. Modify SETTINGS.json to point to the training and validation data on your system, as well as a place to save the trained model and a place to save the submission
  3. Train the model by running python
  4. Make predictions on the validation set by running python
  5. Make a submission with the output file

This benchmark took less than 10 minutes to execute on a Windows 8 laptop with 8GB of RAM and 4 cores at 2.7GHz.