Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Advice on prepping datasets other than LJspeech? #6

Closed
scripples opened this issue Jun 7, 2020 · 4 comments
Closed

Advice on prepping datasets other than LJspeech? #6

scripples opened this issue Jun 7, 2020 · 4 comments

Comments

@scripples
Copy link

scripples commented Jun 7, 2020

Hi, I'm trying to prep my own dataset to train on the ForwardTacotron model--could you give any insight as to what train_tacotron.py or train_forward.py is expecting in terms of training data organization? Like, the old NVIDIA TT2 repo expects two text files formatted in a certain way and a path to the WAV files in the arguments. Is there something similar for this repo?

@cschaefer26
Copy link

cschaefer26 commented Jun 8, 2020

Hey, just prepare your dataset in the LJSpeech format:

|- dataset_folder/
|   |- metadata.csv
|   |- wav/
|       |- file1.wav
|       |- ...

If the language differs from English, make sure you set the correct language in the hparams.py file:

language = 'fr'
tts_cleaner_name = 'basic_cleaners'

Then just follow the steps from the README with preprocessing the folder, everything should be done automatically including splitting of the dataset into train/val etc.

I updated the README to be clearer on this. Best of luck!

@scripples
Copy link
Author

Thanks for your help! I'm looking forward to giving it a try.

@ghost
Copy link

ghost commented Oct 4, 2020

Where find list is supported languages?

@prajwaljpj
Copy link

@paklau99988 you can find the list of languages from here

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants