Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Several issues with fr voice #852

Closed
npitre opened this issue Dec 12, 2020 · 0 comments
Closed

Several issues with fr voice #852

npitre opened this issue Dec 12, 2020 · 0 comments

Comments

@npitre
Copy link

npitre commented Dec 12, 2020

The following test strings produce garbled output, and they sometimes
even affect further synthesis requests to come after them. This looks
like some buffer overflow somewhere when unicode symbols are involved.

Example of problematic strings:

$ copyright="$(echo -ne "\xc2\xa9")"
$ espeak-ng -v fr -q -x "$copyright 2020"
sEb'Ol(base) @@@ @@@ @@@
$ espeak-ng -v fr -q -x "$copyright 2020, Tous droits réservés."
sE
b'Ol(base) @@@ @@@ @@@
@@@ @@@@@ @@@@@@@@

$ sapin="$(echo -ne "\xf0\x9f\x8e\x84")"
$ espeak-ng -v fr -q -x "$sapin"
sap'E~ d@- nO'El
$ espeak-ng -v fr -q -x "$sapin $sapin"
(en)kr'Ism@skr'Ism@s(fr)

$ espeak-ng -v fr -q -x "$sapin $copyright"
sap'E~ d@-- nO'El sE~b'Ol(base) nO'El

And some more with no unicode characters at all:

$ espeak-ng -v fr -q -x "PR"
profEs'Wr dOlart'Ekst

$ espeak-ng -v fr -q -x "1ᵉʳ"
'W~ l,Etr_|,W~_|d'e_|k,atr_|n,Wf_:'Er

@npitre npitre closed this as completed Dec 12, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant