Play around with the model and make predictions using the web app I made using Flask.
Read the blog post.
Classifying audio of human speech into various accents/countries of origin using MFCC coefficients extracted from audio .wav files.
The model in flask_app/static/sklearn_models/final_model.pkl
is an ensemble of K-Nearest Neighbor and Logistic
Regression models. The overall predictive accuracy of the model is 0.89
and it has an ROC AUC score of 0.95
. The
blog post about it is here.
This was developed over a 2-week span in August 2020 as a project for the Metis data science program.
Fokoue, E. (2020). UCI Machine Learning Repository - Speaker Accent Recognition Data Set. Irvine, CA: University of California, School of Information and Computer Science.
- Running
main.py
will launch the Flask app locally. utilities.py
contains files needed for the Flask app to run.notebooks/
contains the Jupyter Notebook used to do all data analysis and modeling, as well as an accompanying file of Python functions.templates/
contains HTML files for the Flask applicationstatic/
contains static files for the Flask application as well as the pickled scikit-learn model.heroku.yml
andDockerfile
are used for deployment of the Flask app to Heroku
The contents of /notebooks
can't be fully run because the Jupyter Notebook connects to a remote SQL database. In order
to install the dependencies for the flask app, run:
pip install -r requirements.txt