multilingual VITS:speaker_wav #2488
-
I have trained a multilingual vits_tts model(only using chinese multi-speaker dataset AISHELL3). Now, I am trying to synthesize chinese speech using a new speaker's voice by inputting speaker_wav: tts --text "wo3 shi4 quan2 shi4 jie4 zui4 mei3 de5 ren2 " However, I am encountering the following error message: Traceback (most recent call last): What should I do to fix the problem?Thanks! p.s: I was able to successfully synthesize the speech using the speaker_idx from the training set. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
Give speaker encoder path and config |
Beta Was this translation helpful? Give feedback.
go to train_yourtts.py in recipe and read what speaker encoder is.