New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The poor performance of DeepClustering #138
Comments
Hey, thanks for opening an issue. data:
n_src: 2
sample_rate: 8000
train_dir: data/2speakers/wav8k/min/tr
valid_dir: data/2speakers/wav8k/min/cv
filterbank:
kernel_size: 256
n_filters: 256
stride: 64
main_args:
exp_dir: exp/train_chimera_dcalone_newlr/
help: null
masknet:
dropout: 0.3
embedding_dim: 40
hidden_size: 600
n_layers: 4
n_src: 2
rnn_type: lstm
take_log: true
optim:
lr: 0.0001
optimizer: adam
weight_decay: 0.0
positional arguments: {}
training:
batch_size: 32
early_stop: true
epochs: 200
half_lr: true
loss_alpha: 1.0
num_workers: 8 Most importantly, the learning rate is Here are the metrics {
"si_sdr": 9.846718255569847,
"si_sdr_imp": 9.84787033402749,
"sdr": 10.364474047942608,
"sdr_imp": 10.213430163640966,
"sir": 19.018816263769104,
"sir_imp": 18.867772082693932,
"sar": 11.336185832787844,
"sar_imp": -64.43806377592414,
"stoi": 0.8787521784931114,
"stoi_imp": 0.1407063969268085
} My best val loss was around 2930 and I trained for 130 epochs. I you come over slack, I can share the pretrained model folder (under research only license) to you. |
thanks a lot, I am to try this config. The results may come out tomorrow. If it is fine, I will close this issue. One more question, |
Yes, I did. |
Sorry for bothering again. After the 1st epoch, my loss was around 6000. Is this normal? |
Sounds about right. |
It seemed that I could not produce nice results. After adopting the same settings, the sisdr is 8.3., still 1dB lower. I don't know the reason since I used the newest code but the software version might be different . Please comment if you have any suggestions. Otherwise, I am to close the issue. Thanks a lot. |
Could you please use version 0.2.0 ( Thanks for reporting your problems |
@hangtingchen Could you try it please? |
font{
line-height: 1.6;
}
ul,ol{
padding-left: 20px;
list-style-position: inside;
}
I am a little busy recently. But yes, I am trying, and I will tell you the results if the model is finished training. Maybe 2-3 days later.
font{
line-height: 1.6;
}
Sincerely,Hangting Chen
On 6/12/2020 17:32,Pariente Manuel<notifications@github.com> wrote:
@hangtingchen Could you try it please?
—You are receiving this because you were mentioned.Reply to this email directly, view it on GitHub, or unsubscribe.
|
font{
line-height: 1.6;
}
ul,ol{
padding-left: 20px;
list-style-position: inside;
}
Dear Pariente Manuel,
Sorry for the late relay. We are still unable to reproduce the work.
The environment is Pytorch_lightning 0.6.0 and asteriod 0.2.0, 1 gpuWe follow the same settings, but the final loss is about 3.5e+3, 20% higher than 2.9e+3.We don't know the reason, just report the results to you.
font{
line-height: 1.6;
}
Sincerely,Hangting Chen
On 6/12/2020 17:32,Pariente Manuel<notifications@github.com> wrote:
@hangtingchen Could you try it please?
—You are receiving this because you were mentioned.Reply to this email directly, view it on GitHub, or unsubscribe.
|
Maybe you're using wsj1? |
Hi An example : wv2 spectrogram |
Oh that's a very good point, could you please open a separate issue please? |
Hi
First thanks a lot for such an excellent tool for speech separation. I have tried the deep clustering part of wsj0-mix
https://github.com/mpariente/asteroid/tree/master/egs/wsj0-mix/DeepClustering
My performance was poor (si-sdr=3.5, sdr=4.5 in 35 epochs with 1 gpu for training). As reported here, the sdr is expected to be closed to 10dB. I am wondering the reason of the failure. Is there any tricks for training, or more epochs are needed for improvement?
Thanks a lot.
The text was updated successfully, but these errors were encountered: