This is an implementation of U-Net for vocal, bass, drums separation with tensorflow
- librosa==0.6.2
- numpy==1.14.3
- tensorflow==1.13.0
- python==3.6.5
I download dsd100 dataset.
$ python download_data.py --DATADIR ./data
I prepare CCMixter datasets in "./data" and Each track consisted of Mixed, bass, drums, other, vocal version
$ python CCMixter_process.py --DATADIR ./data
- Train
$ python Training.py
- Test
$ python Test.py
Jaehoon Oh et al. spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source
- convert wav files to mp3 files
- make tfrecord format files