Deep clustering/Chimera recipe #96

mpariente · 2020-05-11T11:32:12Z

Adding the Deep clustering / Chimera++ recipe on wsj2mix and wsj3mix.
Mainly based on @sunits initial work.

Data prep, dataloader, training and evaluation script.

Evaluatio script will use the mask-inference head whenever possible and DC head only if loss_alpha is equal to 1.

First successful iteration on the DC head only gets 10.1 dB SDR improvement (not SI-SDR).

Things left to do :

Upload some more results on DC alone, Chimera++ (maybe even 3mix)
Try to use kmeans in native PyTorch instead of sklearn because evaluation is really slow.

mpariente · 2020-05-11T11:33:24Z

egs/wsj0-mix/DeepClustering/model.py

+        proj = proj.view(batch, n_frames, -1, self.embedding_dim).transpose(1, 2)
+        # (batch, freq * frames, emb)
+        proj = proj.reshape(batch, -1, self.embedding_dim)


The bug was here. Without the transpose, the time bins where not aligned with each other and training was impossible.
I added a note about it in the DC loss.

mpariente · 2020-05-11T11:34:14Z

egs/wsj0-mix/DeepClustering/model.py

+    try:
+        # Last best model summary
+        with open(os.path.join(exp_dir, 'best_k_models.json'), "r") as f:
+            best_k = json.load(f)
+        best_model_path = min(best_k, key=best_k.get)
+    except FileNotFoundError:
+        # Get last checkpoint
+        all_ckpt = os.listdir(os.path.join(exp_dir, 'checkpoints/'))
+        all_ckpt.sort()
+        best_model_path = os.path.join(exp_dir, 'checkpoints', all_ckpt[-1])


Also, this would be a way to bypass the best_k_models.json.

mpariente · 2020-05-11T11:34:52Z

egs/wsj0-mix/DeepClustering/train.py

    with open(os.path.join(exp_dir, "best_k_models.json"), "w") as f:
        json.dump(checkpoint.best_k_models, f, indent=0)
-    #torch.save(system.model.state_dict(), os.path.join(exp_dir, 'final.pth'))
+    # Save last model for convenience
+    torch.save(system.model.state_dict(),


And this would be another one. But if training didn't finish, this model is not saved..

mpariente added 9 commits May 11, 2020 11:19

Delete simple_train.py

e720be7

Update model file

17c83d9

Update default config

2208a6b

Update run.sh. Add note for mixture generation.

80da8e7

Add sphere convert from WHAM

75da1fc

Add weight decay in run.sh

7ae92b6

make loaders in dataloader file

a60cf8b

Update train

ab10913

Update eval

5f5a447

mpariente commented May 11, 2020

View reviewed changes

mpariente mentioned this pull request May 11, 2020

Writing best_k_models.json after every epoch #84

Closed

mpariente merged commit 4595413 into master May 11, 2020

mpariente deleted the dc_recipe_mp2 branch May 11, 2020 12:57

mpariente mentioned this pull request May 11, 2020

Deep clustering updates #92

Closed

KiAlexander mentioned this pull request Jun 12, 2020

About result of DPRNN in wham dataset #151

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deep clustering/Chimera recipe #96

Deep clustering/Chimera recipe #96

mpariente commented May 11, 2020

mpariente May 11, 2020

mpariente May 11, 2020

mpariente May 11, 2020

Deep clustering/Chimera recipe #96

Deep clustering/Chimera recipe #96

Conversation

mpariente commented May 11, 2020

Things left to do :

mpariente May 11, 2020

Choose a reason for hiding this comment

mpariente May 11, 2020

Choose a reason for hiding this comment

mpariente May 11, 2020

Choose a reason for hiding this comment