Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features

Links to the original paper: IEEE, arxiv

Quick navigations to some important codes

If you are interested in implementation details of the presented model, you may want to see the following scripts:

net_chordrec.py -- The definition of the chord estimation model.
net_generative.py -- The proposed VAE is defined as the UnifiedModel class.
training.py -- Codes for training loops.
const.py -- Some hyper-parameters are specified.
Experiment_supervised_training.py -- Performs cross-validation experiments for supervised training (976+0 in Fig.3).
Experiment_vae_training.py -- Performs cross-validation experiment for the proposed semi-supervised training, which uses part of annotated songs as unsupervised data (left half of Fig.3).
Experiment_vae_training_with_unsupervised_data.py -- Performs cross-validation experiment for the proposed semi-supervised training, which uses the non-annotated data as unsupervised data (right half of Fig.3).

Training dataset

The dataset for training our model is stored in dataset folder, which includes annotated data pairs of 1217 songs, and non-annotated data of 700 songs. chordlab folder stores the ground-truth chord labels of each annotated songs. chroma folder stores the 36-dimension feature sequences extracted from each song, using a DNN extractor proposed in "Automatic Audio Chord Recognition with MIDI-Traind Deep Feature and BLSTM-CRF Sequence Decoding Model" (see this repository).

Dependencies

The experiments were performed on Python 3.6 and the following libraries were used:

Chainer 7.0.0
Cupy 7.0.0
librosa 0.7.0
mir_eval 0.5

It is ok to use the later versions of those libraries since (as we know) currently there are no major changes in API from the above versions.

How to estimate chord from raw audio files?

Code for estimating chord from raw audio is not provided in this repository.

How to reproduce the experiments?

Run the script in scripts folder under the root directory. When experiment is finished, the chord estimation results can be found at corresponding 'estimated' folders. Example: source scripts\experiment_semisup_fulldata.sh

experiment_supervised.sh -- the three different training methods for 976+0 in Fig.3.
experiment_proposed_vae.sh -- VAE_MR_SSL and VAE_MR_SL experiments for the left part of Fig.3.
experiment_nomarkov_vae.sh -- VAE_UN_SSL and VAE_UN_SL experiments for the left part of Fig.3.
experiment_semisupervised_vae.sh -- VAE_MR_SSL experiments for 976+700 in Fig.3.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dataset		dataset
scores		scores
scripts		scripts
.gitignore		.gitignore
EstimateFromFeat.py		EstimateFromFeat.py
Experiment_supervised_training.py		Experiment_supervised_training.py
Experiment_vae_training.py		Experiment_vae_training.py
Experiment_vae_training_with_unsupervised_data.py		Experiment_vae_training_with_unsupervised_data.py
GenMFCC.py		GenMFCC.py
GenSTFT.py		GenSTFT.py
MadmomChord.py		MadmomChord.py
PlotAnalogies.py		PlotAnalogies.py
PlotAnalogiesQuality.py		PlotAnalogiesQuality.py
PlotBoxPlots.py		PlotBoxPlots.py
PlotConfMatrix.py		PlotConfMatrix.py
PlotDistribution.py		PlotDistribution.py
PlotLogs.py		PlotLogs.py
PlotMixedDataResults.py		PlotMixedDataResults.py
PlotMixedDataResultsMulti.py		PlotMixedDataResultsMulti.py
README.md		README.md
SaveBillboard.py		SaveBillboard.py
TestRun.py		TestRun.py
TestTrainVAE.py		TestTrainVAE.py
TestTraining.py		TestTraining.py
TrainsetScores.py		TrainsetScores.py
chord.py		chord.py
const.py		const.py
convert_songle.py		convert_songle.py
count_shorthands.py		count_shorthands.py
dataset.py		dataset.py
dataset_indexes.py		dataset_indexes.py
folds.py		folds.py
folds_billboard.npy		folds_billboard.npy
folds_isophonics.npy		folds_isophonics.npy
folds_rwc.npy		folds_rwc.npy
folds_uspop.npy		folds_uspop.npy
net_chordrec.py		net_chordrec.py
net_generative.py		net_generative.py
requirements.txt		requirements.txt
training.py		training.py
util.py		util.py
vae.py		vae.py

Xiao-Ming/VAEChordEstimation

Folders and files

Latest commit

History

Repository files navigation

Semi-supervised Neural Chord Estimation Based on a Variational Autoencoder with Latent Chord Labels and Features

Quick navigations to some important codes

Training dataset

Dependencies

How to estimate chord from raw audio files?

How to reproduce the experiments?

About

Resources

Stars

Watchers

Forks

Languages