Skip to content

Code for the prestudy of my degree project

Notifications You must be signed in to change notification settings

HelmerNylen/prestudy

Repository files navigation

Degree Project Prestudy

The code for the experimental part of the prestudy for my degree project, where five types of classifiers are tested on noisy speech. Please refer to the report for details.

Relies on https://github.com/FirstHandScientist/genhmm for the GenHMM and GMMHMM implementations (which in turn requires PyTorch, CUDA, etc.), and Matlab and the Audio Degradation Toolbox for dataset generation. Noise samples are included but not the TIMIT database.

Some pretty graphs

The optimal (that I found) average performance across 10 instances of each classifier type (except SVM, as it had no element of randomness).

Classifier performance on different datasets

Accuracy on the test set over time on one of the datasets. Training is stopped when this accuracy no longer increases, which is why each line ends with a dip. The second-to-last epoch is then used for comparison. The colored region displays the interquartile range.

Training progress on snr_20

Accuracy distribution of each classifier, evaluated in the second-to-last epochs. The box shows the interquartile range and the whiskers the maximum and minimum values. The yellow line is the median.

Accuracy distribution on snr_20

About

Code for the prestudy of my degree project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published