MR-MT3

Code accompanying paper: MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage.

Setup steps

conda create --name mrmt3 python==3.10 -y
conda activate mrmt3
python -m pip install --upgrade pip
python -m pip install transformers==4.18.0
python -m pip install torch
python -m pip install librosa==0.9.1
python -m pip install t5==0.9.3
python -m pip install note-seq==0.0.3
python -m pip install pretty-midi==0.2.9
python -m pip install einops==0.4.1
python -m pip install ddsp                                 <- changed : Dont specify version, when installing python 3.10, Because of llvmlite, numba
python -m pip install tensorflow==2.11
python -m pip install tensorflow-text==2.11
python -m pip install protobuf==3.20
python -m pip install numpy==1.23.5                        <- changed : Scipy requires numpy>=1.23.5
python -m pip install hydra-core==1.2.0
python -m pip install typing_extensions
python -m pip install torchaudio                           <- changed : required 
python -m pip install tensorflow_probability==0.19.0       <- changed : tensorflow==2.11 requires downgrading this library 
python -m pip install pytorch_lightning                    <- changed : for import the model

Next, you need to download the required datasets and postprocess them:

For Slakh

Re-sample Slakh .flac to 16kHz - python3 resample.py.
Create the grouped stem version as ground truth instead of the existing all_src.mid. Some bass notes have octave errors - python3 midi_script.py.
python3 tools/generate_inst_names.py

For ComMU

Download ComMU dataset - https://github.com/POZAlabs/ComMU-code/tree/master/dataset
cd scripts/commu/ -> ./process_commu_dataset.sh

For NSynth

Download NSynth dataset validation split.
cd scripts/nsynth/ -> python3 convert_nsynth_json_to_midi.py

Training

Refer to train.sh for a list of train commands corresponding to all of our experiments.

Evaluation

You can download our pretrained models at: https://huggingface.co/gudgud1014/MR-MT3/tree/main

Refer to test.sh, for a list of test commands corresponding to all of our experiments.

Basically, each command runs test.py that:

transcribe MIDI files based on a given eval.audio_dir;
compute multi-instrument F1 score w.r.t. the ground truth MIDI.

License

MIT

Citations

If you find our research useful, kindly cite us at:

@article{tan2024mr,
  title={MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage},
  author={Tan, Hao Hao and Cheuk, Kin Wai and Cho, Taemin and Liao, Wei-Hsiang and Mitsufuji, Yuki},
  journal={arXiv preprint arXiv:2403.10024},
  year={2024}
}

Credits

Huge shoutout to @kunato as we largely based our initial MT3 experiments on his implementation - https://github.com/kunato/mt3-pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
config		config
contrib		contrib
dataset		dataset
models		models
plots		plots
pretrained		pretrained
scripts		scripts
tasks		tasks
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
NOTES.txt		NOTES.txt
README.md		README.md
evaluate.py		evaluate.py
generate_inst_names.py		generate_inst_names.py
inference.py		inference.py
midi_script.py		midi_script.py
requirements.txt		requirements.txt
resample.py		resample.py
test.py		test.py
test.sh		test.sh
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MR-MT3

Setup steps

For Slakh

For ComMU

For NSynth

Training

Evaluation

License

Citations

Credits

About

Releases

Packages

Contributors 2

Languages

License

gudgud96/MR-MT3

Folders and files

Latest commit

History

Repository files navigation

MR-MT3

Setup steps

For Slakh

For ComMU

For NSynth

Training

Evaluation

License

Citations

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages