CFP Neural Network

This is the code companion with the following paper:
Yu-Te Wu, Berlin Chen, and Li Su, "Automatic music transcription leveraging generalized cepstral features and deep learning," IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), April 2018

For our study, we can visualize the work as this: This means we teach our computer to have "absolute pitch" ability, just like expert musicians have.

There are two parts of the code: one is for feature extraction; and the another one is for NN construction, training, and testing. To run the code, you need to install keras with tensorflow backend. You can either use GPU for training or not. Just modify the code in _FullTrainTest.py, line 12, set the os environment variable equals to "" for not to use GPU.

Before running the training code, make sure that you have already done the feature extraction. If not, run the CFP_Extraction.m for generating the necessary files. Or you can also run the python version: GenFeature.py. But you have to write additional code for processing through all the dataset automatically. Also, you have to modify some path variables inside the _FullTrainTest.py file. Set it to your path where your files are.

The structure of CNN model is visualized as below:
And the structure of DNN is just the last four layers of CNN.

Below are transcription results:

chpn-p15_ENSTDkAm
grieg_kobold_ENSTDkAm

The full testing result reported in the paper here.

Enjoy~

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Feature_Extraction		Feature_Extraction
figures		figures
LICENSE		LICENSE
PreProcess.py		PreProcess.py
README.md		README.md
Statistics.py		Statistics.py
loadFile.py		loadFile.py
runCNN_FullTrainTest.py		runCNN_FullTrainTest.py
runDNN_FullTrainTest.py		runDNN_FullTrainTest.py
test_config2.txt		test_config2.txt
train_config2_fold_1.txt		train_config2_fold_1.txt
train_config2_fold_2.txt		train_config2_fold_2.txt
train_config2_fold_3.txt		train_config2_fold_3.txt
train_config2_fold_4.txt		train_config2_fold_4.txt
val_config2_fold_1.txt		val_config2_fold_1.txt
val_config2_fold_2.txt		val_config2_fold_2.txt
val_config2_fold_3.txt		val_config2_fold_3.txt
val_config2_fold_4.txt		val_config2_fold_4.txt

License

BreezeWhite/CFP_NeuralNetwork

Folders and files

Latest commit

History

Repository files navigation

CFP Neural Network

About

Resources

License

Stars

Watchers

Forks

Languages