German: Espeak is unable to speak grouped numbers #5235

nvaccessAuto · 2015-07-20T18:56:04Z

Reported by bdorer on 2015-07-20 18:56
This is reproduceable with numbers like 1.000.000, 2.000.000 and so on. Espeak ignores all groups of 0 behind thousand.

I don't know wheather this is espeaks fault as it is reproduceable with espeaks sapi5 version and SVox Pico. I'll test with Vocalizer and report.

nvaccessAuto · 2015-07-23T10:04:00Z

Comment 1 by The_Dark_Man on 2015-07-23 10:04
I think, this bug is only in the German version of eSpeak. In English, they write 2,000,000.

nvaccessAuto · 2015-08-07T11:31:16Z

Comment 2 by jteh on 2015-08-07 11:31
This isn't a bug in eSpeak. It's a problem in the NVDA symbols.dic for German. A complex symbol rule probably needs to be added for the thousands separator. This isn't necessary for English because comma is usually past through to the synth unchanged. The date separator in the German symbols is also causing problems here, though you might be able to get around this by making sure the thousands separator rule is above it so it takes precedence.

nvaccessAuto · 2015-08-07T11:36:57Z

Comment 3 by jteh on 2015-08-07 11:36
Something like the following complex symbol rule should do the trick (untested):

thousands separator (?<=\d)\.<?=\d{3})

and in symbols:

thousands separator punkt   all norep

nvaccessAuto · 2015-08-08T08:52:23Z

Comment 4 by chrislm (in reply to comment description) on 2015-08-08 08:52
Replying to bdorer:

I don't know wheather this is espeaks fault as it is reproduceable with espeaks sapi5 version and SVox Pico.

Running "espeak -vde" in command line those numbers are spoken correctly with eSpeak sapi5, also using espeakedit in german language.
SvPiko seems to not read beyond the six digit in any language.
Could a native german speaker test this regexpt?
Pattern:
(?<=\d)(.?)(\d{3})
R:
\2

nvaccessAuto · 2015-08-10T17:08:55Z

Comment 5 by bdorer (in reply to comment 3) on 2015-08-10 17:08
Replying to jteh:

Something like the following complex symbol rule should do the trick (untested):
thousands separator   (?<=\d)\.<?=\d{3})
and in symbols:
thousands separator   punkt   all norep

hmm, this rule isn't complete. NVDA reports mismatching of braces. I tried the following:

thousands separator (?<=\d)\.(<?=\d{3})

If you meant that, this rule isn't working as expected. Furthermore if I spell 30.000.000 NVDA says 3 dot 000thausand.

nvaccessAuto · 2015-08-10T17:13:26Z

Comment 6 by bdorer (in reply to comment 4) on 2015-08-10 17:13
Replying to chrislm:

Replying to bdorer:

I don't know wheather this is espeaks fault as it is reproduceable with espeaks sapi5 version and SVox Pico.

Running "espeak -vde" in command line those numbers are spoken correctly with eSpeak sapi5, also using espeakedit in german language.

SvPiko seems to not read beyond the six digit in any language.

Could a native german speaker test this regexpt?

Pattern:

(?<=\d)(.?)(\d{3})

R:

\2

hmm, I don't know how to test your regular expression as I am not familiar with it. Which part should I use as pattern and which part as Replacement?
Thanks for your help.

nvaccessAuto · 2015-08-10T19:58:07Z

Comment 7 by chrislm (in reply to comment 6) on 2015-08-10 19:58
Replying to bdorer:

I don't know how to test your regular expression as I am not familiar with it. Which part should I use as pattern and which part as Replacement?

Sorry, I mean to test it in a speech dictionary.
From Preferences menu open a Temporary dictionary, insert pattern and replacement and choose Regular expression in the radio button.
Thanks.

nvaccessAuto · 2015-08-10T20:09:45Z

Comment 8 by bdorer (in reply to comment 7) on 2015-08-10 20:09

Sorry, I mean to test it in a speech dictionary.

From Preferences menu open a Temporary dictionary, insert pattern and replacement and choose Regular expression in the radio button.

Thanks.

Sure, but I didn't understand R as an abbreviation for replacement as you wrote pattern and not p.
Your rule fixes it. Now I need this as an regexp for complexSymbols like

thousands separator (?<=\d)\.(<?=\d{3})

This one isn't working for me.

nvaccessAuto · 2015-08-10T21:34:50Z

Comment 9 by chrislm (in reply to comment 8) on 2015-08-10 21:34
Replying to bdorer:

thousands separator (?<=\d).(<?=\d{3})
This one isn't working for me.

Try so:

thousands separator (?<=\d)\.(?=\d{3})

nvaccessAuto · 2015-08-10T22:08:24Z

Comment 10 by bdorer on 2015-08-10 22:08
Thanks! This regexp is doing the job.
@jamie may it's worth documenting such regexps for other languages on the wiki?

nvaccessAuto · 2015-08-10T23:27:38Z

Comment 11 by jteh on 2015-08-10 23:27
Ug. Yeah, that's the expression i meant; sorry about the typos.

Yeah, we should probably document this somewhere. Perhaps we could add a Tips section to TranslatingSymbols.

nvaccessAuto · 2015-08-11T10:31:00Z

Comment 12 by chrislm on 2015-08-11 10:31
this ticket can be used for other thousands separators?
Sometimes is used a space character as separator, for example in many articles on Wikipedia.
Probably enter a standard space in a symbols rule may cause problems, but the specific character below It is widely used as a thousands separator.

Character: " "
Name: "thin space"
UNICODE: "u+2009"

nvaccessAuto · 2015-08-11T14:59:37Z

Comment 13 by bdorer on 2015-08-11 14:59
hmm, I think so. Espeak for example accepts spaces as thousands separator for german with no problem.

nvaccessAuto · 2015-08-19T22:42:06Z

Comment 14 by bdorer on 2015-08-19 22:42
Bah, @jteh would it be possible to merge symbols.dic of SVN-Rev 23136? There was a typo which I fixed now. Sorry for my inconviniance!

nvaccessAuto · 2015-08-19T22:53:37Z

Comment 15 by jteh on 2015-08-19 22:53
Sorry, but we can only accept critical changes for 2015.3 now (i.e. fixes for crashes or serious security issues). Is this really a critical change?

nvaccessAuto · 2015-08-19T22:59:34Z

Comment 16 by bdorer on 2015-08-19 22:59
in this case, it isn't but it is confusing for many Germans.

nvaccessAuto · 2015-08-19T23:05:48Z

Comment 17 by jteh on 2015-08-19 23:05
Can you explain the impact? That is, what will happen if we don't take this?

nvaccessAuto · 2015-08-19T23:09:00Z

Comment 18 by bdorer on 2015-08-19 23:09
well, many synths don't say thousand and million and so on on grouped numbers.

nvaccessAuto · 2015-08-19T23:10:14Z

Comment 19 by jteh on 2015-08-19 23:10
Let me put this another way: is there any regression from 2015.2 without this change? That is, is there something in 2015.2 that worked but is now broken because of this mistake?

nvaccessAuto · 2015-08-19T23:27:25Z

Comment 20 by bdorer on 2015-08-19 23:27
well, synths wich spoke groups of thousands correct in 2015.2 have now a wrong speech as they don't say thousand and so on in grouped numbers

nvaccessAuto · 2015-08-19T23:39:43Z

Comment 21 by jteh on 2015-08-19 23:39
This doesn't seem to match my testing. eSpeak German reports thousands and millions when I do, for example, 1.000 or 1.000.000.

nvaccessAuto · 2015-08-20T05:46:52Z

Comment 22 by bdorer on 2015-08-20 05:46
sure but it doesn't work for Microsoft sapi5 Hedda and vocalizer for nvda which are used on many computers.
I don't have more sapi5 voices to test.

nvaccessAuto · 2015-08-24T02:13:13Z

Comment 23 by James Teh <jamie@... on 2015-08-24 02:13
In [81824fa]:

German symbols: Fix typo which was breaking the thousands separator.

Re #5235.

nvaccessAuto · 2015-08-24T23:22:48Z

Comment 24 by jteh on 2015-08-24 23:22
This can be closed, since it was fixed in the German symbols.

I'd be reluctant to do this for spaces, as it might match when it shouldn't. We've certainly had problems with this in the past in spreadsheets, etc. where coordinates get mixed with numbers, and while that particular case is fixed, there could be others. However, this is really up to the German community to decide.
Changes:
Changed title from "Espeak is unable to speak grouped numbers" to "German: Espeak is unable to speak grouped numbers"
Milestone changed from None to 2015.3
State: closed

Re #5235.

The-Dark-Man · 2016-06-05T13:52:01Z

The Issue is back.

Christianlm · 2016-06-06T08:45:44Z

the level for tousan separator in german symbols has been change to "none".
Set a higher level such as "all" to solve it.

The-Dark-Man · 2016-06-06T18:16:34Z

It works. Thank you!

bdorer · 2016-06-07T15:49:39Z

Hi, I fixed this again. Sorry for theinkonvinience

Am 06.06.2016 um 20:16 schrieb The-Dark-Man:

It works. Thank you!

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#5235 (comment),
or mute the thread
https://github.com/notifications/unsubscribe/AKun5bYGlbpoys2GM5nELt0_r8Kg4Q0jks5qJGQEgaJpZM4IuW-C.

nvaccessAuto added bug component/i18n existing localisations or internationalisation labels Nov 10, 2015

nvaccessAuto assigned jcsteh Nov 10, 2015

nvaccessAuto added this to the 2015.3 milestone Nov 10, 2015

nvaccessAuto closed this as completed Nov 10, 2015

jcsteh added a commit that referenced this issue Nov 23, 2015

German symbols: Fix typo which was breaking the thousands separator.

81824fa

Re #5235.

The-Dark-Man mentioned this issue Jun 4, 2016

Switch to eSpeak NG in NVDA distribution #5651

Closed

jcsteh mentioned this issue Jun 23, 2016

The issue #5235 is back #6039

Closed

DrSooom mentioned this issue Sep 6, 2019

Question mark is not verbalized #10164

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

German: Espeak is unable to speak grouped numbers #5235

German: Espeak is unable to speak grouped numbers #5235

nvaccessAuto commented Jul 20, 2015

nvaccessAuto commented Jul 23, 2015

nvaccessAuto commented Aug 7, 2015

nvaccessAuto commented Aug 7, 2015

nvaccessAuto commented Aug 8, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 11, 2015

nvaccessAuto commented Aug 11, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 20, 2015

nvaccessAuto commented Aug 24, 2015

nvaccessAuto commented Aug 24, 2015

The-Dark-Man commented Jun 5, 2016

Christianlm commented Jun 6, 2016

The-Dark-Man commented Jun 6, 2016

bdorer commented Jun 7, 2016

German: Espeak is unable to speak grouped numbers #5235

German: Espeak is unable to speak grouped numbers #5235

Comments

nvaccessAuto commented Jul 20, 2015

nvaccessAuto commented Jul 23, 2015

nvaccessAuto commented Aug 7, 2015

nvaccessAuto commented Aug 7, 2015

nvaccessAuto commented Aug 8, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 10, 2015

nvaccessAuto commented Aug 11, 2015

nvaccessAuto commented Aug 11, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 19, 2015

nvaccessAuto commented Aug 20, 2015

nvaccessAuto commented Aug 24, 2015

nvaccessAuto commented Aug 24, 2015

The-Dark-Man commented Jun 5, 2016

Christianlm commented Jun 6, 2016

The-Dark-Man commented Jun 6, 2016

bdorer commented Jun 7, 2016