New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
what should the development set's content be in a speech dataset and g2p? #36
Comments
Hi @albluc24 , I don't think you need to worry about the g2p model for Italian. Next, you should at least move a couple of files (2-5) from the training folder to the development folder. And, in case you missed it in the documentation, you must not add to the project's data folder directly. You need to run the import step and it will automatically generate resampled wav files, .labs and a whole other bunch (including rendered spectrograms in .png format). Also, before starting training, you should check a couple of file to see if they have the correct content. Also, you can use the available vocoders (in order to avoid training your own, which takes months on low-end hardware), but you need to specify --target-sample-rate=16000 for each step. Let me know if you need anything else. Also, feel free to post any fragment of you lab file if you want me to check for correctness. Best, |
hi and thanks for the reply! |
Can you paste the command you used for the import step? |
sure: |
I think you have some folders missing. In the TTS-Cube folder do: mkdir -p data/processed/train |
this is my first github issue, so please forgive me if there are any mistakes.
The problem i'm having right now though is simply not understanding what should be contained in a development folder of a training set
What I've done.
I've downloaded the M-AILABS italian training set, and have splitted the csv in txt files such as every one of them are corresponding to a wav file, and that's for the training set. My question is: what should i put in the other folder?
The readme says that there should not be more than 5 files in there, but when i start training with an empty dev folder it gives me an error about a lab file that was not found.
I have the same doubt about the g2p thing, but as i'm not going to use that feature that's a secondary thing for me, as well as adding custom things in the lab file which, in fact, i've not added any.
The text was updated successfully, but these errors were encountered: