share loss images #67

AngelGuevara7 · 2024-03-13T15:50:17Z

Hi!
Could you share the loss images during training to get an idea of how they should look like?
I'm trying to train a new single speaker model but my model cant articulate words at early stages (epoch 500) eventhough the attention matrix looks diagonal.
I attach the config file in case it might help.
Thanks!!

config.json

jeremy110 · 2024-03-15T01:32:43Z

How many hours did you train on?
In my case, it took about 5~8 hours data to train, but I haven't tried fewer hours.
This is my loss images, training 300 epochs.

AngelGuevara7 · 2024-03-18T09:45:32Z

Hey, thanks for sharing!
Nevermind, it was a problem with my data. I resampled my 22,5kHz data to 44,1kHz and there where some artifacts in the high frequencies, that was the problem. Changing the frequency to 22,5kHz solved the problem and now it is sounding great!
In case it helps to anybody, I trained with 20 hours data and below I let the generator losses, which are higher than yours but it still sounds great to me.

AngelGuevara7 closed this as completed Apr 3, 2024

jeremy110 mentioned this issue Apr 28, 2024

Training for a new language #68

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

share loss images #67

share loss images #67

AngelGuevara7 commented Mar 13, 2024

jeremy110 commented Mar 15, 2024 •

edited

AngelGuevara7 commented Mar 18, 2024

share loss images #67

share loss images #67

Comments

AngelGuevara7 commented Mar 13, 2024

jeremy110 commented Mar 15, 2024 • edited

AngelGuevara7 commented Mar 18, 2024

jeremy110 commented Mar 15, 2024 •

edited