Train First Speaker Verification Model #19

juanmc2005 · 2019-05-23T15:31:45Z

Use SpeakerModel and VoxCeleb1 to train a first speaker verification model.
Cross entropy takes priority over the rest of the losses as discussed in previous meetings.
Make sure to use EER as the validation metric.

juanmc2005 · 2019-05-24T15:35:46Z

Cannot directly use VoxCeleb1 data with SincNet. We need to split the samples in chunks, like they explain in the paper (at least at the beginning).
This missing chunks are most likely causing an out of memory issue.

@hbredin any way of chunking audio segments already implemented in pyannote?

hbredin · 2019-05-25T08:30:31Z

Not sure what you need.

SpeechSegmentGenerator already yields audio chunks, whose duration is controlled by the duration parameter.

Can you clarify your needs?

juanmc2005 · 2019-05-25T09:14:00Z

Nevermind, I misunderstood the
duration parameter. I will try changing that on Monday.
Thanks!

juanmc2005 · 2019-05-28T12:45:10Z

@hbredin Please tell me if I can do something to help solve the 3199 issue.
If you want to force it to see what it looks like, you can use the sv-train2 branch and run:
python -W ignore main.py --task speaker --loss softmax --epochs 1500 --no-plot --no-save --batch-size 100 --log-interval 5
The problem occurs around 25% of the first epoch

hbredin · 2019-05-28T12:53:59Z

I am not sure why this happens.

One way of understanding this behavior is to create a simple script that simply iterates forever on SpeechSegmentGenerator and stops as soon as the number of samples is not 3200.

You can edit SpeechSegmentGenerator temporarily so that it also returns the value of sub_segment and files[i].

hbredin · 2019-05-28T12:54:32Z

Starting from here, we will be able to investigate what is happening

juanmc2005 · 2019-05-28T12:57:23Z

Got it. I'm switching to STS for the time being, to integrate the model and dataset.
After that I'll start working this out.
Thanks!

juanmc2005 · 2019-06-04T09:03:45Z

Will unblock and use 0-padding for samples with wrong dimensions while we still look for what's causing this problem.

juanmc2005 · 2019-06-07T09:59:45Z

The validation code is too expensive to run after each epoch, will use a separate script to run validations in parallel when a model is saved. This will be done as part of another issue: #25
The remaining task for this issue is being able to train a model without validation.

juanmc2005 · 2019-06-25T15:00:10Z

Update: Validation can be done in-training for VoxCeleb1, but it would be useful to parallelize anyway to tackle VoxCeleb2.
Models can be trained using cross entropy, although the results are not very good. Will close this issue and open another one to address that problem.

juanmc2005 created this issue from a note in SV - STS (To do) May 23, 2019

juanmc2005 added the enhancement New feature or request label May 23, 2019

juanmc2005 moved this from To do to In progress in SV - STS May 23, 2019

juanmc2005 self-assigned this May 23, 2019

juanmc2005 moved this from In progress to Blocked in SV - STS May 28, 2019

juanmc2005 added the blocked Development has stopped because of a problem label May 28, 2019

juanmc2005 removed the blocked Development has stopped because of a problem label Jun 4, 2019

juanmc2005 moved this from Blocked to In progress in SV - STS Jun 4, 2019

juanmc2005 closed this as completed Jun 25, 2019

juanmc2005 moved this from In progress to Done in SV - STS Jun 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train First Speaker Verification Model #19

Train First Speaker Verification Model #19

juanmc2005 commented May 23, 2019 •

edited

Loading

juanmc2005 commented May 24, 2019

hbredin commented May 25, 2019

juanmc2005 commented May 25, 2019

juanmc2005 commented May 28, 2019

hbredin commented May 28, 2019 •

edited

Loading

hbredin commented May 28, 2019

juanmc2005 commented May 28, 2019

juanmc2005 commented Jun 4, 2019

juanmc2005 commented Jun 7, 2019

juanmc2005 commented Jun 25, 2019

Train First Speaker Verification Model #19

Train First Speaker Verification Model #19

Comments

juanmc2005 commented May 23, 2019 • edited Loading

juanmc2005 commented May 24, 2019

hbredin commented May 25, 2019

juanmc2005 commented May 25, 2019

juanmc2005 commented May 28, 2019

hbredin commented May 28, 2019 • edited Loading

hbredin commented May 28, 2019

juanmc2005 commented May 28, 2019

juanmc2005 commented Jun 4, 2019

juanmc2005 commented Jun 7, 2019

juanmc2005 commented Jun 25, 2019

juanmc2005 commented May 23, 2019 •

edited

Loading

hbredin commented May 28, 2019 •

edited

Loading