Replies: 1 comment
-
>>> georroussos |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
>>> Hassan_Jalil
[August 28, 2020, 8:05pm]
So I have been going through the Issues on Github and questions here on
discourse, and my understanding is, if I want to train Mozilla TTS on my
own voice in English, the best approach is to fine tune the pertained
model with new dataset. slash
Now I have a few questions regarding this
1. How much data is needed for fine tuning, considering new data is
also in English but will have a slightly different ascent (I am from
Pakistan) and voice is male. Is 4-5 hours of clean good data enough
?
2. So clean the dataset, give it similar structure to LJ Speech, update
the config and start training ? slash
Can some one provide some basic how to on getting started with Fine
Tuning a pretrained model with my own dataset.
3. For Dataset we require only Audio and Transcript right ? we dont
need alignment?
Thank you. I know these are noobish questions, but I am starting out and
I couldnt find answer to these questions.
[This is an archived TTS discussion thread from discourse.mozilla.org/t/data-requirements-for-fine-tuning-lj-speech-to-learn-my-voice-in-english]
Beta Was this translation helpful? Give feedback.
All reactions