GitHub - gaborfodor/MLSP_2013: Winning Model Source Code

MLSP 2013 Bird Classification Challenge - Winning Model Details about the competition: http://www.kaggle.com/c/mlsp-2013-birds

I have written my model in Python 2.7.3. I used the following libraries:

numpy
pandas
scipy
pickle
scikit-image
wave
matplotlib
scikit-learn

Harware/OS: Intel Core i5 2500 with 16 GB RAM Windows 7

Short explanation:

The process is splitted into four parts:

1_pattern_extraction.py
2a_data_preparation.py
2b_data_preparation_logarithm
3_train.py

The first three files contain the data preparation steps and the fourth will train RandomForestRegressors and create the submission files.

creates the spectrograms and does multiple the image processing steps to capture interesting patches.

2-3) After the patches are extracted the next step is feature generation using template matching. These are the most time consuming steps (10-12 hours) but you can run them parallel.

Finally I merge my features with the provided histogram of segments and location information. During cross validation the submission files will be exported if the cv AUC is higher than 0.93.

The current settings should produce submissions around 0.954 private leaderboard score. A bit more about my solution can be found here: http://www.kaggle.com/c/mlsp-2013-birds/forums/t/5457/congratulations-to-the-winners/29159#post29159

How to reproduce the results:

Each file contains a 'folder' variable which should be manually corrected before running the code. The easiest way is to download and extract the compressed model file which already has the required folder structure. After the extraction you will need to copy the essential dataset into the 'essential_data' folder (mainly the audio source files and labeling informations). The competition dataset can be downloaded from here http://www.kaggle.com/c/mlsp-2013-birds/data . After the 'folder' variable has been modified you can start run the python sources in increasing alphabetic order. If you want to skip a few steps you can jump right to the training part since the 'DP' folder contains the neccessary features. At the end you will find the resulted files in the 'Submission' folder.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
misc		misc
1_pattern_extraction.py		1_pattern_extraction.py
2a_data_preparation.py		2a_data_preparation.py
2b_data_preparation_logarithm.py		2b_data_preparation_logarithm.py
3_train.py		3_train.py
Beluga_MLSP_2013_Model.7z		Beluga_MLSP_2013_Model.7z
LICENSE.txt		LICENSE.txt
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLSP 2013 Bird Classification Challenge - Winning Model Details about the competition: http://www.kaggle.com/c/mlsp-2013-birds

Short explanation:

How to reproduce the results:

About

Releases

Packages

Languages

License

gaborfodor/MLSP_2013

Folders and files

Latest commit

History

Repository files navigation

MLSP 2013 Bird Classification Challenge - Winning Model Details about the competition: http://www.kaggle.com/c/mlsp-2013-birds

Short explanation:

How to reproduce the results:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages