[Bug] Phoneme extraction with punctuations is wrongly delimited #771

erogol · 2021-08-29T21:03:40Z

Describe the bug
Punctuations in extracted phonemes are delimited wrongly.

For instance the sentence tuː foːɹ paʊndz , bʌt hɛviɚ aɪɚnz , should be tuː foːɹ paʊndz, bʌt hɛviɚ aɪɚnz,

So punctuations do not need a space preceding them.

I think the current implementation causes unnatural silences in the trained models.

The text was updated successfully, but these errors were encountered:

synesthesiam · 2021-09-01T02:12:16Z

I'm reworking parts of gruut's tokenization pipeline to preserve whitespace. I'll delay updating the current pull request until these changes are in.

skol101 · 2021-09-19T21:36:32Z

@erogol would love to see it in the next minor version update, please.

erogol · 2021-09-19T22:20:33Z

It is pretty much in the hands of @synesthesiam

synesthesiam · 2021-09-19T23:47:32Z

I'm working on it! I have this pesky job that keeps taking my time 😉

This is taking longer than expected since I'm adding it and preliminary SSML support at this same time. It didn't seem worth it to me to redo the existing gruut tokenizer (to add proper whitespace preservation) only to scrap it later for SSML.

I will be completing the changes to gruut this week, and my goal is to have it integrated and tested with 🐸 TTS by 1 Oct 👍

stale · 2021-10-20T00:21:02Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

erogol · 2021-10-22T18:52:41Z

I think Gruut 2 adresses this right @synesthesiam

synesthesiam · 2021-10-22T19:32:26Z

Yes, the punctuation doesn't have whitespace artificially added now.

stale · 2021-11-21T19:33:26Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. You might also look our discussion channels.

erogol · 2021-11-23T17:48:22Z

I close this as it's been fixed by the latest Gruut

erogol added the bug Something isn't working label Aug 29, 2021

erogol assigned synesthesiam Aug 29, 2021

synesthesiam mentioned this issue Sep 29, 2021

[Bug] Gruut espeak inconsistencies makes the training harder. #680

Closed

synesthesiam mentioned this issue Oct 14, 2021

Update gruut to version 2.0 #882

Merged

stale bot added the wontfix This will not be worked on but feel free to help. label Oct 20, 2021

stale bot removed the wontfix This will not be worked on but feel free to help. label Oct 22, 2021

stale bot added the wontfix This will not be worked on but feel free to help. label Nov 21, 2021

erogol closed this as completed Nov 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Phoneme extraction with punctuations is wrongly delimited #771

[Bug] Phoneme extraction with punctuations is wrongly delimited #771

erogol commented Aug 29, 2021

synesthesiam commented Sep 1, 2021

skol101 commented Sep 19, 2021

erogol commented Sep 19, 2021

synesthesiam commented Sep 19, 2021

stale bot commented Oct 20, 2021

erogol commented Oct 22, 2021

synesthesiam commented Oct 22, 2021

stale bot commented Nov 21, 2021

erogol commented Nov 23, 2021

[Bug] Phoneme extraction with punctuations is wrongly delimited #771

[Bug] Phoneme extraction with punctuations is wrongly delimited #771

Comments

erogol commented Aug 29, 2021

synesthesiam commented Sep 1, 2021

skol101 commented Sep 19, 2021

erogol commented Sep 19, 2021

synesthesiam commented Sep 19, 2021

stale bot commented Oct 20, 2021

erogol commented Oct 22, 2021

synesthesiam commented Oct 22, 2021

stale bot commented Nov 21, 2021

erogol commented Nov 23, 2021