Kaggle-Competition-Tools

A set of tools I developed while doing Kaggle Competitions. Hopefully others will get some use out of this. Some of these are probably outdated and in need of fixing, and most are pretty messy.

Model management

Accountant

Accountant is a lightweight tool for managing and keeping track of models and their performance. It allows the user to descibe a model in a descriptor file with few lines of python, then a training program can call accountant with that file to start training using the specification. Additionally flags like "evaluate" can be passed such to accountant and potentially affect program in the descriptor file. Usually I heavily modify accountant to fit the characteristics of the competition.

Models

Bayesian Optimization for LightGBM

Bayesian Optimization finds good hyperparameters quite effectively. Here I have it implemented for LightGBM.

Random Forest Embeddings with LightGBM

Uses the leafs of a random forest as features. Trained by having a random forest predict a completely random objective.

Multithreaded Approximate Nearest Neighbors

Uses Spotify's Annoy library to do Approximate Nearest Neighbors, but is uses Multiprocessing to get a speed up.

Gradient Boosting Additive Models With Pairwise Interactions

From my Interpretable Machine Learning repository. This model looks no further than pairwise interactions, and is more of a tool to find useful patterns in data then getting strong performance.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
model_management		model_management
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model_management

model_management

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Kaggle-Competition-Tools

Model management

Accountant

Models

Bayesian Optimization for LightGBM

Random Forest Embeddings with LightGBM

Multithreaded Approximate Nearest Neighbors

Gradient Boosting Additive Models With Pairwise Interactions

About

Releases

Packages

Languages

License

thohoff/Kaggle-Competition-Tools

Folders and files

Latest commit

History

Repository files navigation

Kaggle-Competition-Tools

Model management

Accountant

Models

Bayesian Optimization for LightGBM

Random Forest Embeddings with LightGBM

Multithreaded Approximate Nearest Neighbors

Gradient Boosting Additive Models With Pairwise Interactions

About

Resources

License

Stars

Watchers

Forks

Languages