DSL Winter Project 2022-2023 - Speech Recognition

This is a project made with Elisa Cascina for the Data Science and Engineering course Data Science Lab: Process and Methods in PoliTo.

In this project we introduce a possible approach for classification problem in the field of natural language processing and speech recognition. In particular, the proposed approach is based on the Mel-frequency Cepstral Coefficients (MFCC), a widely used technique for extracting features from the audio signal. Every mfcc is divided into blocks in the time domain, some summary statistics are computed and used as input for a Random Forest classification model. The presented method overcomes the benchmark set for the issue and delivers overall acceptable outcomes.

You can find an in-depth explaination in this report.

Set up

Install requirements:

pip3 install -r requirements.txt

Download the dataset here and move the folder 'dsl_data' into the project folder.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Icon		Icon
Project (branch)_x_report.ipynb		Project (branch)_x_report.ipynb
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DSL Winter Project 2022-2023 - Speech Recognition

Set up

About

Releases

Packages

Languages

andry2327/DSL-Winter-Project-2022-2023

Folders and files

Latest commit

History

Repository files navigation

DSL Winter Project 2022-2023 - Speech Recognition

Set up

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages