Problems in tokenizing LibriTTS #5

jry-king · 2023-02-07T11:45:58Z

Thanks for your reproduction of the VALL-E paper! When I tried to prepare LibriTTS data with prepare.sh I encountered this problem:

I'm only using 4 tar files (train-clean-100, train-clean-360, test-clean and dev-clean) out of 7 in LibriTTS. Could you give me some suggestions about what's going on? Thanks!

lifeiteng · 2023-02-07T14:14:55Z

datasets fix a23f7c9

jry-king · 2023-02-08T04:07:44Z

Looks like the command is fixed but the "words count mismatch" warning still exists. I only found that it might be caused by some inconsistency between text files and phonemes. Will it actually influence the result?

lifeiteng · 2023-02-08T14:55:12Z

I haven't figured out these warnings yet, I checked the results and there is nothing wrong.

pandamq · 2023-02-23T04:08:13Z

@jry-king Hi, i meet the same warning info, i assume the reason is espeak tokenizer frequently take many words into one phone, e.g. ' of the' to 'ʌvðə'.
Do you have some solution to fix it, or we can igonre the warning to train the model correctly? Thanks!

lifeiteng · 2023-02-24T01:56:52Z

@jry-king Hi, i meet the same warning info, i assume the reason is espeak tokenizer frequently take many words into one phone, e.g. ' of the' to 'ʌvðə'. Do you have some solution to fix it, or we can igonre the warning to train the model correctly? Thanks!

can you provide more cases?

we need dig into https://github.com/bootphon/phonemizer.

pandamq · 2023-02-26T07:38:59Z

@jry-king Hi, i meet the same warning info, i assume the reason is espeak tokenizer frequently take many words into one phone, e.g. ' of the' to 'ʌvðə'. Do you have some solution to fix it, or we can igonre the warning to train the model correctly? Thanks!

it's hard to count them all. i can give you some cases.
"on the floor" to "ɔnðə floːɹ"
"had been" to "hɐdbɪn"
"of a" to "əvə"
"would have" to "wʊdhɐv"
"there is" to "ðɛɹɪz"

lifeiteng · 2023-02-26T10:08:09Z

errors shoud be fixed

ɜː -> ɜ ː

lifeiteng · 2023-02-26T10:12:35Z

@jry-king @mingqilearning Thank you for report these issues. It would be great if you guys could fix these issues.

Training looks good now #37

pandamq · 2023-03-03T04:07:05Z

@jry-king @mingqilearning Thank you for report these issues. It would be great if you guys could fix these issues.

Training looks good now #37

i tried to fix it, but failed. training good means the warning doesn't influent the training and inference?
it would be great if you share the phomezier and espeak-ng version.

lifeiteng · 2023-04-12T15:26:55Z

espeak-ng/espeak-ng#1703

lifeiteng mentioned this issue Feb 24, 2023

[prepare data ] phonemizer :words count mismatch #36

Closed

lifeiteng mentioned this issue Mar 11, 2023

Words Mismatch during preparing LJSpeech dataset #50

Closed

lifeiteng mentioned this issue May 23, 2023

After 100 epochs training, the model can synthesize natural speech on LibriTTS #58

Open

lifeiteng closed this as completed Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems in tokenizing LibriTTS #5

Problems in tokenizing LibriTTS #5

jry-king commented Feb 7, 2023

lifeiteng commented Feb 7, 2023

jry-king commented Feb 8, 2023

lifeiteng commented Feb 8, 2023

pandamq commented Feb 23, 2023

lifeiteng commented Feb 24, 2023

pandamq commented Feb 26, 2023

lifeiteng commented Feb 26, 2023

lifeiteng commented Feb 26, 2023

pandamq commented Mar 3, 2023 •

edited

Loading

lifeiteng commented Apr 12, 2023

Problems in tokenizing LibriTTS #5

Problems in tokenizing LibriTTS #5

Comments

jry-king commented Feb 7, 2023

lifeiteng commented Feb 7, 2023

jry-king commented Feb 8, 2023

lifeiteng commented Feb 8, 2023

pandamq commented Feb 23, 2023

lifeiteng commented Feb 24, 2023

pandamq commented Feb 26, 2023

lifeiteng commented Feb 26, 2023

lifeiteng commented Feb 26, 2023

pandamq commented Mar 3, 2023 • edited Loading

lifeiteng commented Apr 12, 2023

pandamq commented Mar 3, 2023 •

edited

Loading