deep-karaoke-maker

A program to separate a given track into vocal and instrumental stems, based on PyTorch. Based on [1], who have their own implementation (in MATLAB) here.

Running the Code

Get the MedleyDB dataset, the dataset used to train the model is extracted from it. You need to ask them for access to download, they are very responsive by email. Note that the DB is heavy (41GB compressed).
Update the file medleydb_deepkaraoke.json with the path to where you extracted MedleyDB. If you use the MedleyDB Sample, make sure to erase all entries from the json file and keep only the ones you have in the sample.
Run python spectrum_helper/__init__.py to generate the dataset.
Run python deep_karaoke.py train to train the network. (run with --help to see options)

References

[1] Simpson A.J.R., Roma G., Plumbley M.D. (2015) Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network. In: Vincent E., Yeredor A., Koldovský Z., Tichavský P. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2015. Lecture Notes in Computer Science, vol 9237. Springer, Cham.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
spectrum_helper		spectrum_helper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
deep_karaoke.py		deep_karaoke.py
medleydb_deepkaraoke.json		medleydb_deepkaraoke.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spectrum_helper

spectrum_helper

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

deep_karaoke.py

deep_karaoke.py

medleydb_deepkaraoke.json

medleydb_deepkaraoke.json

Repository files navigation

deep-karaoke-maker

Running the Code

References

About

Releases

Packages

Languages

License

bachsh/deep-karaoke-maker

Folders and files

Latest commit

History

Repository files navigation

deep-karaoke-maker

Running the Code

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages