MusicSegmentationML

About

MusicSegmentationML is a supervised sequence-to-sequence machine learning project that labels parts of songs (audio WAV files) with their corresponding song segment type (such as verse, chorus, bridge, etc.). You can read more about this project and how to use the code provided here.

This project is written in Python with Keras and provides functions and classes for extracting features (feat_extract.py), generating data to feed into the models (feat_extract.py), and training various models (ml.py).

The primary features extracted from the audio WAV files (at the moment) are MFCC features. Right now, the best model, Faded2DConvModel (see Model_Thoughts.txt for params), achieves ~70% 24-fold validation accuracy (when averaged on the model's maximum accuracies on each fold) on a dataset of 24 different songs.

To provide comments/critiques/suggestions (would be much appreciated!), to request the data used for this project or to make any other inquiries, send an email to me[at]nathancontreras[dot]com.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
2DConvModel_Tests.py		2DConvModel_Tests.py
Model_Thoughts.txt		Model_Thoughts.txt
Pooling1DModel_Tests.py		Pooling1DModel_Tests.py
README.md		README.md
SimpleRNN_Tests.py		SimpleRNN_Tests.py
banner.png		banner.png
feat_extract.py		feat_extract.py
ml.py		ml.py
model_playground.py		model_playground.py
playground.py		playground.py
playground2.py		playground2.py
playground3.py		playground3.py
test.wav		test.wav
test_playground.py		test_playground.py

tothepowerofn/MusicSegmentationML

Folders and files

Latest commit

History

Repository files navigation

MusicSegmentationML

About

About

Resources

Stars

Watchers

Forks

Languages