Extending to n+1 target speakers using pretrained Cotatron #33

shaun-mathew · 2022-03-05T23:56:22Z

Hello,

How would I extend this model to n+1 target speakers to perform any/many to many conversion? When I increase the number of speakers to include the speakers in LibriTTS + our dataset and use the pretrained cotatron weights, I get an embedding mismatch error when attempting to train the decoder because of the different dimensions which is derived from the speakers_list in the global config.yaml. Do I simply keep the speakers_list the same, i.e. don't include our dataset speaker names (include only LibriTTS + VCTK), but train the decoder/synthesizer on the combined data which includes LibriTTS + our dataset?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extending to n+1 target speakers using pretrained Cotatron #33

Extending to n+1 target speakers using pretrained Cotatron #33

shaun-mathew commented Mar 5, 2022

Extending to n+1 target speakers using pretrained Cotatron #33

Extending to n+1 target speakers using pretrained Cotatron #33

Comments

shaun-mathew commented Mar 5, 2022