CNN to classify samples of voice recordings into the language that was spoken
Jupyter Notebook
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
data Initial commit Apr 10, 2017
.gitignore added gitignore Apr 10, 2017
Language Classifier.ipynb Initial commit Apr 10, 2017
README.md Update README.md Apr 28, 2017
speech_v9.h5 Initial commit Apr 10, 2017

README.md

Spoken Language Recognition

This notebook trains a convolutional neural network to classify audio files of voice recordings into the languages that were spoken. The dataset I used contained 66.000 files across 176 languages. I found it on TopCoder (https://goo.gl/G5XBJl). I liked the idea behind this problem, because it's very hard for humans to do. It's intersting to see that CNNs perform well on problems where intuition doesn't get you anywhere.

I included a saved version of my pretrained model, which evaluates to an accuracy of 98,79%. Further notes on development can be seen in the Jupyter Notebook.