Spoken language identification with deep learning

Read more in the following blog posts:

Theano/Lasagne models are here. The basic steps to run them are:

Download the dataset from here or use your own dataset.
Create spectrograms for recording using create_spectrograms.py or augment_data.py. The latter will also augment the data by randomly perturbing the spectrograms and cropping a random interval of length 9s from the recording.
Create listfiles for training set and validation set, where each row of the a listfile describes one example and has 2 values seperated by a comma. The first one is the name of the example, the second one is the label (counting starts from 0). A typical listfile will look like this.
Change the png_folder and listfile paths in theano/main.py.
Run theano/main.py.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
ensembling		ensembling
prototxt		prototxt
theano		theano
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
augment_data.py		augment_data.py
choose_equal_split.py		choose_equal_split.py
concatenate_csvs.py		concatenate_csvs.py
create_spectrograms.py		create_spectrograms.py
get_score_from_probabilities.py		get_score_from_probabilities.py
get_score_from_top3_prediction.py		get_score_from_top3_prediction.py
get_sum_of_csvs.py		get_sum_of_csvs.py
majority_vote_ensembling.py		majority_vote_ensembling.py
make_submission.py		make_submission.py
test_augm_network.py		test_augm_network.py
test_main_network.py		test_main_network.py

Provide feedback