Leaderboard C (Open)

Final LB Rank : 8th

Final LB Score: 8.537 dB

The final submission is a weighted blending of three models. Training details for each model is given below.

NOTE: All models were trained on 4x 80GB A100 GPUs.

Datasets used:

Training: total 347 songs Public datasets:

Validation: total 16 songs

Memory requirements: ~66GB per GPU

CUDA_VISIBLE_DEVICES=0,1,2,3 bash launcher.sh train_dist.py configs/dwt_transformer_unet/mdx/config_mdx_dwt.yaml

Memory requirements: ~47GB per GPU

CUDA_VISIBLE_DEVICES=0,1,2,3 bash launcher.sh train_dist.py configs/bsrnn/mdx_labelnoise/config_md_labelnoise_bass.yaml

CUDA_VISIBLE_DEVICES=0,1,2,3 bash launcher.sh train_dist.py configs/bsrnn/mdx_labelnoise/config_md_labelnoise_drums.yaml

CUDA_VISIBLE_DEVICES=0,1,2,3 bash launcher.sh train_dist.py configs/bsrnn/mdx_labelnoise/config_md_labelnoise_other.yaml

CUDA_VISIBLE_DEVICES=0,1,2,3 bash launcher.sh train_dist.py configs/bsrnn/mdx_labelnoise/config_md_labelnoise_vocals.yaml

Trained on 800 songs

Inference settings:

The final submission is a weighted blending of the three models. The weights were determined by grid search on the 16 validation songs.

The weights are as follows:

Source	BSRNN	DWT-Transformer-UNet	HTDemucs
bass	0	0.3	0.7
drums	0.3	0.2	0.5
other	0	0.4	0.6
vocals	0.4	0.2	0.4