Fix VITS upsampling asserts #1550

Edresson · 2022-05-02T18:41:37Z

Fix upsampling VITS asserts that was broken.

erogol · 2022-05-03T08:42:58Z

I don't even see an assert fix in this PR? Do I miss something ?

Edresson · 2022-05-03T10:42:45Z

I don't even see an assert fix in this PR? Do I miss something ?

It fixes the upsampling asserts during the training. Without this, we can't train the model, because depending on the length of the audio the spectrogram length multiplied by the upsampling factor is not equal to the mel-spectrogram length. So I removed some frames of the audio like @WeberJulian did for the bandwidth extension model

We have two alternatives to the training works:

Apply the changes in this PR (the better in my opinion)
We can remove the upsampling asserts

erogol · 2022-05-03T15:07:54Z

I don't see an assert removed in the changes.

Edresson · 2022-05-03T15:21:50Z

I don't see an assert removed in the changes.

Because I have chosen to apply the changes in this PR. This PR fixes the asserts errors during the training keeping the asserts.

erogol · 2022-05-07T00:47:32Z

I don't see an assert removed in the changes.

Because I have chosen to apply the changes in this PR. This PR fixes the asserts errors during the training keeping the asserts.

I don't understand, sorry. Where are the asserts? What assert did you change? Do you mean assert python statement?

Edresson · 2022-05-07T11:47:14Z

I don't see an assert removed in the changes.

Because I have chosen to apply the changes in this PR. This PR fixes the asserts errors during the training keeping the asserts.

I don't understand, sorry. Where are the asserts? What assert did you change? Do you mean assert python statement?

These asserts 1 and 2 in some cases (depending on the audio length) break the upsampling training. So to fix this bug I submitted this PR.

erogol · 2022-05-07T12:53:38Z

I don't see an assert removed in the changes.

Because I have chosen to apply the changes in this PR. This PR fixes the asserts errors during the training keeping the asserts.

I don't understand, sorry. Where are the asserts? What assert did you change? Do you mean assert python statement?

These asserts 1 and 2 in some cases (depending on the audio length) break the upsampling training. So to fix this bug I submitted this PR.

Can you link to your exact changes in the code which fix those asserts? I don't understand how you fix those lines without touching those lines

Edresson · 2022-05-07T17:17:29Z

I don't see an assert removed in the changes.

Because I have chosen to apply the changes in this PR. This PR fixes the asserts errors during the training keeping the asserts.

I don't understand, sorry. Where are the asserts? What assert did you change? Do you mean assert python statement?

These asserts 1 and 2 in some cases (depending on the audio length) break the upsampling training. So to fix this bug I submitted this PR.

Can you link to your exact changes in the code which fix those asserts? I don't understand how you fix those lines without touching those lines

My bad I accidentally included the reinit encoder/duration predictor. Now I kept on this PR just the code that solves the asserts and added the reinit encoder/duration predictor in the PR #1562.

I don't understand how you fix those lines without touching those lines

These asserts are about the audio/spectrogram length, so if we guarantee that the audio/spectrogram is divisible by the upsampling factor, the problem is solved.

Fix style

* Add reinit encoder and duration predictor option * Add .data to prevent any overlooked autograd hook

Edresson changed the base branch from dev to fix-gan May 2, 2022 18:42

Edresson force-pushed the fix-upsampling-asserts branch from d7e079d to ecff669 Compare May 2, 2022 18:43

Edresson changed the base branch from fix-gan to fix_mas May 2, 2022 23:39

Edresson changed the base branch from fix_mas to dev May 2, 2022 23:40

Edresson changed the base branch from dev to fix-gan May 2, 2022 23:57

Base automatically changed from fix-gan to dev May 5, 2022 00:55

Edresson requested a review from erogol May 6, 2022 18:45

erogol force-pushed the dev branch from 0bd7a4a to a34076a Compare May 7, 2022 11:30

Edresson force-pushed the fix-upsampling-asserts branch 2 times, most recently from a66f13b to b51e54c Compare May 7, 2022 17:13

Edresson added 2 commits May 12, 2022 09:08

Fix the VITS upsampling asserts

1827110

Fix style

Add reinit text encoder and duration predictor parameter (#1562)

175ca06

* Add reinit encoder and duration predictor option * Add .data to prevent any overlooked autograd hook

Edresson force-pushed the fix-upsampling-asserts branch from d2f8027 to 175ca06 Compare May 12, 2022 12:09

erogol merged commit e45ae57 into dev May 12, 2022

erogol deleted the fix-upsampling-asserts branch May 12, 2022 12:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix VITS upsampling asserts #1550

Fix VITS upsampling asserts #1550

Edresson commented May 2, 2022 •

edited

Loading

erogol commented May 3, 2022 •

edited

Loading

Edresson commented May 3, 2022 •

edited

Loading

erogol commented May 3, 2022

Edresson commented May 3, 2022

erogol commented May 7, 2022 •

edited

Loading

Edresson commented May 7, 2022 •

edited

Loading

erogol commented May 7, 2022

Edresson commented May 7, 2022 •

edited

Loading

Fix VITS upsampling asserts #1550

Fix VITS upsampling asserts #1550

Conversation

Edresson commented May 2, 2022 • edited Loading

erogol commented May 3, 2022 • edited Loading

Edresson commented May 3, 2022 • edited Loading

erogol commented May 3, 2022

Edresson commented May 3, 2022

erogol commented May 7, 2022 • edited Loading

Edresson commented May 7, 2022 • edited Loading

erogol commented May 7, 2022

Edresson commented May 7, 2022 • edited Loading

Edresson commented May 2, 2022 •

edited

Loading

erogol commented May 3, 2022 •

edited

Loading

Edresson commented May 3, 2022 •

edited

Loading

erogol commented May 7, 2022 •

edited

Loading

Edresson commented May 7, 2022 •

edited

Loading

Edresson commented May 7, 2022 •

edited

Loading