GitHub - hackerpeter1/SVQTD: Singing Voice Quality and Technique Database (SVQTD) is a classical male singing dataset for describing classical tenor singing voices from vocal pedagogy point of view.

Data Request instructions are in the project page here.

Dataset preparation

download youtube videos with a python script and convert to audios using ffmpeg
performing music source separation based on spleeter
energy-based segmentation, reference code can be found in ./split.py
extracting feature set using OPENSMILE (optional, only if you are interested in training with traditional feature set)

Training files

Some pooling method for recognition neural network can be found in ./modules.
Some models are in ./models.
Some config files for respectively training Transformer and ResNet are in ./config.
./E2E.py can be used to train neural networks based on config files.
./RPSVM.py can be used to extract embeddings and train a SVM classifier using them.
./FSSVM.py can be used to train a SVM classifier using features from ComParE feature set.

Since our code is not user-friendly, if you have any questions about dataset downloading or the training code, please feel free to contact me through yanze.xu@outlook.com. Also welcome to talk with me if you are interested in timbre phenoemena.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset preparation

Training files

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
models		models
modules		modules
utils		utils
.gitignore		.gitignore
E2E.py		E2E.py
FSSVM.py		FSSVM.py
LICENSE		LICENSE
README.md		README.md
RPSVM.py		RPSVM.py
split.py		split.py

License

hackerpeter1/SVQTD

Folders and files

Latest commit

History

Repository files navigation

Dataset preparation

Training files

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages