-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Advice on prepping datasets other than LJspeech? #6
Comments
Hey, just prepare your dataset in the LJSpeech format:
If the language differs from English, make sure you set the correct language in the hparams.py file:
Then just follow the steps from the README with preprocessing the folder, everything should be done automatically including splitting of the dataset into train/val etc. I updated the README to be clearer on this. Best of luck! |
Thanks for your help! I'm looking forward to giving it a try. |
Where find list is supported languages? |
@paklau99988 you can find the list of languages from here |
Hi, I'm trying to prep my own dataset to train on the ForwardTacotron model--could you give any insight as to what train_tacotron.py or train_forward.py is expecting in terms of training data organization? Like, the old NVIDIA TT2 repo expects two text files formatted in a certain way and a path to the WAV files in the arguments. Is there something similar for this repo?
The text was updated successfully, but these errors were encountered: