New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add espnet2 TTS recipe on M-AILABS #4701
Conversation
Codecov Report
@@ Coverage Diff @@
## master #4701 +/- ##
=======================================
Coverage 80.45% 80.45%
=======================================
Files 527 527
Lines 46215 46219 +4
=======================================
+ Hits 37181 37185 +4
Misses 9034 9034
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
Looks great! Some minor suggestion: given that the In addition, could you also update the entry in |
Thanks for the suggestion! |
Hmm... it seems that some tests regarding |
@siddhu001, could you check this? |
@sw005320 It seems to be problem with hugging face.
I think it is because of some huggingface version update. I am looking into this. @Takaaki-Saeki if you have any suggestions, please let me know! |
Hi @siddhu001, I think it is related to the version of PyTorch. But this arg is not implemented in PyTorch v1.5.0 (see this), and it is introduced in v1.6.0 (see this). So could you take a look at the pytorch version? |
Many thanks! Merged. |
This PR adds a TTS recipe on M-AILABS.
TTS recipes on M-AILABS was supported in espnet1 but not supported in espnet2.
This recipe only uses a en_US single-speaker data.
I have uploaded a pretrained Tacotron2 model to HuggingFace.
I also prepared a script for multilingual data preparation:
local/data_multilingual.sh
to train a TTS model on the whole dataset.