Kaggle's Causality Challenge Solution for team FirfiD
Switch branches/tags
Nothing to show
Clone or download
Latest commit cb67e0b Sep 5, 2013
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Models/Matlab First commit Sep 5, 2013
Optimisation First commit Sep 5, 2013
Paper First commit Sep 5, 2013
features First commit Sep 5, 2013
matlab-code/Code First commit Sep 5, 2013
.gitignore First commit Sep 5, 2013
LICENSE Initial commit Sep 5, 2013
README.md Updated README.md Sep 5, 2013
SETTINGS.json First commit Sep 5, 2013
causal_classifier.py First commit Sep 5, 2013
clean.sh First commit Sep 5, 2013
data_io.py First commit Sep 5, 2013
exploit_symmetries.py First commit Sep 5, 2013
extract_matlab_train.sh First commit Sep 5, 2013
extract_matlab_valid.sh First commit Sep 5, 2013
fe.py First commit Sep 5, 2013
feature_selection.py First commit Sep 5, 2013
kaggle_scores.py First commit Sep 5, 2013
merge_features.py First commit Sep 5, 2013
merge_train_data.py First commit Sep 5, 2013
mp_optimisation.py First commit Sep 5, 2013
natsort.py First commit Sep 5, 2013
predict.py First commit Sep 5, 2013
process_matlab.py First commit Sep 5, 2013
score.py First commit Sep 5, 2013
sto_SOO.py First commit Sep 5, 2013
train.py First commit Sep 5, 2013

README.md

Cause Effect Pairs Challenge FirfiD Submission

Pre-requisites: You need the following installed: python 2.7.1 python sklearn version 0.13.1 python numpy version 1.7.1 python joblib python pandas version >=0.11

Matlab Preferably Debian Based Linux Installation

Kaggle Causality Challenge framework. Mostly based on kaggle's python code code for the challenge.

To train: A. Configure

  1. Put your training data in the following files (or modify file names accordingly):

"train_pairs_path": "./Competition/CEdata_final_train_pairs.csv" "train_info_path": "./Competition/CEdata_final_train_publicinfo.csv" "train_target_path": "./Competition/CEdata_final_train_target.csv"

B. Extracting Features

  1. Modify SETTINGS.json "feature_extraction_threads" to the number of threads your machine can handle.
  2. Run "python fe.py"
  3. Add Matlab features by running "./extract_matlab_valid.sh"
  4. Merge the futures by running "python process_matlab.py -t valid"

C. Train:

  1. Run "python train.py"

To predict:

A. Clean-up

  1. Replace ./Competition/CEfinal_valid*.csv with the respective files you are interested in extracting features from. By default this is set to a minimal subset of valid features.
  2. Run "./clean.sh"

B. Extracting Features

  1. Modify SETTINGS.json "feature_extraction_threads" to the number of threads your machine can handle.
  2. Run "python fe.py"
  3. Add Matlab features by running "./extract_matlab_valid.sh"
  4. Merge the futures by running "python process_matlab.py -t valid"

C. Generating results

  1. Run "python predict.py". The results file should be "./Submisions/firfi-tree-trees.csv".