Add espnet2 TTS recipe on M-AILABS #4701

Takaaki-Saeki · 2022-10-09T21:30:37Z

This PR adds a TTS recipe on M-AILABS.
TTS recipes on M-AILABS was supported in espnet1 but not supported in espnet2.

This recipe only uses a en_US single-speaker data.
I have uploaded a pretrained Tacotron2 model to HuggingFace.

I also prepared a script for multilingual data preparation: local/data_multilingual.sh to train a TTS model on the whole dataset.

codecov · 2022-10-09T22:21:23Z

Codecov Report

Merging #4701 (0418427) into master (e9d583b) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master    #4701   +/-   ##
=======================================
  Coverage   80.45%   80.45%           
=======================================
  Files         527      527           
  Lines       46215    46219    +4     
=======================================
+ Hits        37181    37185    +4     
  Misses       9034     9034

Flag	Coverage Δ
test_integration_espnet1	`66.37% <ø> (ø)`
test_integration_espnet2	`49.05% <0.00%> (-0.01%)`	⬇️
test_python	`68.66% <100.00%> (+<0.01%)`	⬆️
test_utils	`23.30% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
espnet2/text/phoneme_tokenizer.py	`82.15% <100.00%> (+0.26%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

ftshijt · 2022-10-10T00:55:26Z

Looks great! Some minor suggestion: given that the local/data.sh is automatically called in the TEMPLATE, how about combine data.sh and data_multilingual.sh but with an additional augment? In that way, we can change the config simply by change the --local_data_opts

In addition, could you also update the entry in egs2/TEMPLATE/README.md ?

Takaaki-Saeki · 2022-10-10T13:27:56Z

Thanks for the suggestion!
I have merged the en_us and multilingual recipes.
Also updated the readme.

Takaaki-Saeki · 2022-10-10T20:59:59Z

Hmm... it seems that some tests regarding espnet2/slu/test_transcript_espnet_model.py are not successful.

sw005320 · 2022-10-11T12:04:00Z

Hmm... it seems that some tests regarding espnet2/slu/test_transcript_espnet_model.py are not successful.

@siddhu001, could you check this?
some interfaces are changed?

siddhu001 · 2022-10-12T17:34:16Z

@sw005320 It seems to be problem with hugging face.

E       TypeError: register_buffer() got an unexpected keyword argument 'persistent'
tools/venv/lib/python3.7/site-packages/transformers/models/bert/modeling_bert.py:198: TypeError

I think it is because of some huggingface version update. I am looking into this. @Takaaki-Saeki if you have any suggestions, please let me know!

Takaaki-Saeki · 2022-10-12T20:41:00Z

Hi @siddhu001,

I think it is related to the version of PyTorch.
BertEmbeddings class requires persistent=false in nn.Module.register_buffer().
See this.

But this arg is not implemented in PyTorch v1.5.0 (see this), and it is introduced in v1.6.0 (see this).

So could you take a look at the pytorch version?

ftshijt · 2022-10-30T03:30:05Z

Many thanks! Merged.

Add TTS recipe for M-AILABS (en_us, single speaker).

e2ff65a

mergify bot added the ESPnet2 label Oct 9, 2022

Takaaki-Saeki changed the title ~~Add TTS recipe on M-AILABS~~ Add espnet2 TTS recipe on M-AILABS Oct 9, 2022

Merge branch 'master' into recipe_mailab

4837f45

Takaaki-Saeki marked this pull request as ready for review October 9, 2022 21:48

add readme.md

184ef17

mergify bot added the README label Oct 9, 2022

sw005320 added this to the v.202211 milestone Oct 10, 2022

sw005320 added Recipe TTS Text-to-speech labels Oct 10, 2022

merge en_us and multilingual training

a820129

Takaaki-Saeki added 4 commits October 10, 2022 23:22

Add italian and polish phoneme tokenizers

29e2b6b

add tokenizer info to readme

a2d6355

fix typo on config

749f759

fix typo in config

1154f1b

fix data_prep

de60cd9

mergify bot added the ESPnet1 label Oct 12, 2022

refactor data_prep.sh

1e4e9ef

kan-bayashi and others added 2 commits October 29, 2022 10:42

Merge branch 'master' into recipe_mailab

7c39110

Merge branch 'master' into recipe_mailab

0418427

ftshijt merged commit a25ad80 into espnet:master Oct 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add espnet2 TTS recipe on M-AILABS #4701

Add espnet2 TTS recipe on M-AILABS #4701

Takaaki-Saeki commented Oct 9, 2022 •

edited

codecov bot commented Oct 9, 2022 •

edited

ftshijt commented Oct 10, 2022

Takaaki-Saeki commented Oct 10, 2022

Takaaki-Saeki commented Oct 10, 2022

sw005320 commented Oct 11, 2022

siddhu001 commented Oct 12, 2022

Takaaki-Saeki commented Oct 12, 2022

ftshijt commented Oct 30, 2022

Add espnet2 TTS recipe on M-AILABS #4701

Add espnet2 TTS recipe on M-AILABS #4701

Conversation

Takaaki-Saeki commented Oct 9, 2022 • edited

codecov bot commented Oct 9, 2022 • edited

Codecov Report

ftshijt commented Oct 10, 2022

Takaaki-Saeki commented Oct 10, 2022

Takaaki-Saeki commented Oct 10, 2022

sw005320 commented Oct 11, 2022

siddhu001 commented Oct 12, 2022

Takaaki-Saeki commented Oct 12, 2022

ftshijt commented Oct 30, 2022

Takaaki-Saeki commented Oct 9, 2022 •

edited

codecov bot commented Oct 9, 2022 •

edited