training error #8

MingZJU · 2021-11-08T06:04:28Z

Thanks for your sharing!

I tried both naive and main branches using your checkpoints, it seems the former one is much better. So I trained AISHELL3 models with small changes on your code and the synthesized waves are good for me.

However when I add my own data into AISHELL3, some error occurred:
Training: 0%| | 3105/900000 [32:05<154:31:49, 1.61it/s]
Epoch 2: 69%|██████████████████████▏ | 318/459 [05:02<02:14, 1.05it/s]
File "train.py", line 211, in
main(args, configs)
File "train.py", line 87, in main
output = model(*(batch[2:]))
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/data_parallel.py", line 165, in forward
return self.module(*inputs[0], **kwargs[0])
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/workspace/StyleSpeech-naive/model/StyleSpeech.py", line 83, in forward
) = self.variance_adaptor(
File "/opt/conda/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/workspace/StyleSpeech-naive/model/modules.py", line 404, in forward
x = x + pitch_embedding
RuntimeError: The size of tensor a (52) must match the size of tensor b (53) at non-singleton dimension 1

I only replaced two speakers and preprocessed data the same as the in readme.

Do you have any advice for this error ? Any suggestion is appreciated.

MingZJU · 2021-11-09T05:55:16Z

Perhaps because of the lexicon, I'll fix and try again.

keonlee9420 · 2021-11-09T07:58:56Z

Hi @MingZJU , sorry for the late response. Thanks for sharing your experiments with AISHELL3 dataset. Hope to see it with PR. For the error you mentioned, I think it's from preprocessing stage. Please double-check that the length of the input audio and all the other audio-related features have the same length during data loading.

MingZJU · 2021-11-09T08:21:47Z

Thanks @keonlee9420 . I will check the audio and features and try again recently.

MingZJU · 2021-11-17T11:54:15Z

Solved. The error was caused by mfa. I installed mfa following the official instructions and it works good.

keonlee9420 · 2021-11-18T01:37:14Z

Great to hear that! thanks for sharing. If you'd like to share your experience further, then please make PR with it. It will be helpful for all users who need Chinese dataset.

sirius0503 · 2022-05-04T08:10:13Z

Solved. The error was caused by mfa. I installed mfa following the official instructions and it works good.

@MingZJU : I am getting the same error, can you explain how did you solve it? I am using conda installed mfa.

keonlee9420 closed this as completed Nov 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

training error #8

training error #8

MingZJU commented Nov 8, 2021 •

edited

MingZJU commented Nov 9, 2021

keonlee9420 commented Nov 9, 2021

MingZJU commented Nov 9, 2021

MingZJU commented Nov 17, 2021

keonlee9420 commented Nov 18, 2021

sirius0503 commented May 4, 2022

training error #8

training error #8

Comments

MingZJU commented Nov 8, 2021 • edited

MingZJU commented Nov 9, 2021

keonlee9420 commented Nov 9, 2021

MingZJU commented Nov 9, 2021

MingZJU commented Nov 17, 2021

keonlee9420 commented Nov 18, 2021

sirius0503 commented May 4, 2022

MingZJU commented Nov 8, 2021 •

edited