Skip to content
Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition
Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
.gitignore
LICENSE
README.md
basic_benchmark.py
competition_utilities.py
features.py
prior_benchmark.py
sample_train.py
split_train.py
uniform_benchmark.py

README.md

Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition

The benchmarks require several Python packages:

  • numpy
  • pandas
  • sklearn

These packages can be installed with easy_install or pip, or Windows users can download compiled versions of these packages

To run the benchmarks, you also need to download the data. The only files necessary for the benchmarks are train-sample.csv and public_leaderboard.csv. Two variables need to be updated in competition_utilities.py as well: data_path should be set to the path to the data, and submissions_path should be set to the location for writing the submission files.

Something went wrong with that request. Please try again.