Add Edward voice to NVDA #8177

michaelDCurran · 2018-04-13T07:36:27Z

Link to issue number:

Summary of the issue:

The Edward voice from speechPlayerInEspeak has proven extremely popular with people who have had issues with espeak in the past. Many people have asked for Edward to be included in NVDA.

Description of how this pull request fixes the issue:

This PR replaces eSpeak's Klatt implementation with that of nvSpeechPlayer, and includes the Edward variant for use with eSpeak.

Testing performed:

Ran NVDA. Switched to Edward voice. Read some text.

Known issues with pull request:

This changes the sound of all Klatt voices in eSpeak, not just the Edward voice, as speechPlayer now handles generate of all Klatt frames. I don't think this should be a problem as the older Klatt voices in most cases were distorted and had strange artifacts.

Change log entry:

New Features:
- The Edward voice from SpeechPlayerInEspeak is now available as a variant while using eSpeak with NVDA. No extra add-on is required. (Include Edward voice from NV Speechplayer with NVDA for Windows 10s #7870)

This implementation is clean enough to show improvements to espeak's klatt sound, but there are still some buffering issues at certain speeds.

…least 1 sample long. Sometimes after subtracting fades it can be 0 or negative!

…peak voices

… to its fade in, don't bother playing that frame at all. Previously its length was forced to 1 ms, but this would cause a jump from the first frame to the second frame if there was a fade out. And instead of submitting a null frame for the fade out, submit the same second frame but with volume of 0, so that even if we didn't play the second frame, at least the fade out would fade towards it.

… to 0 at the end of audio. preFormantGain was not enough as resonators can produce tails. Stops apparent clipping at the end of some utterances.

…sals sound perhaps a little less... nasally?

…r in eSpeak

…was what was in the survey

…g where no klatt output could be heard (instead just espeak consonants) when starting NVDA or sometimes a little way through, due to an uninitialized variable.

…utput (a voiced consonant), decrease the output gain of the klatt frames to be queued by a factor of 5. this is similar to how speechPlayer did it and helps to stop perceiving voiced consonants as being too long (e.g. language)

…e offset and max values. Stops noise junk at the end of words such as 'az' when at slow speeds.

…tly too long.

… there is a waveFile that is being / was being mixed. Previously the volume went straight back up as soon as the end of the wave file was reached. This change slightly improves the grunt heard at the end of voiced consonants at high rates.

…consonants a little.

…layer add-on to use them.

LeonarddeR · 2018-04-13T11:54:54Z

@michaelDCurran: Has it ever been considered to file this upstream for espeak-ng?

ehollig · 2018-04-13T14:30:59Z

This may close #7870 and possibly related to #4578 and #5272

michaelDCurran · 2018-04-13T23:24:35Z

The maintainer of eSpeak-ng wishes to eventually address the underlying issues in the existing Klatt implementation, and has no interest in integrating speechPlayer (which is c++) into the project at this stage. So for the foreseeable future, the best way for people to get Edward is to integrate speechPlayer in NVDA.

feerrenrut · 2018-04-15T23:51:12Z

nvdaHelper/espeak/sconscript

@@ -138,8 +141,13 @@ espeakLib=env.SharedLibrary(
 		# com\comentrypoints.c
 		# com\ttsengine.cpp
 		# We do not use the ASYNC compile option in espeak.
+		speechPlayerCpp,


I have been trying to keep this list in alphabetical order, it makes it easier to spot missing entries when updating espeak. Could you move this up, before synthdata.c. Maybe you could also add a comment to say that loose files should be added in alphabetical order.

feerrenrut · 2018-04-16T00:10:00Z

nvdaHelper/espeak/speechPlayer.cpp

+extern unsigned char *out_end;
+
+speechPlayer_handle_t speechPlayerHandle=NULL;
+#define minFadeLength 110


Please make minFadeLength a constant rather than a #define

feerrenrut · 2018-04-16T00:10:32Z

nvdaHelper/espeak/speechPlayer.cpp

+extern unsigned char *out_ptr;
+extern unsigned char *out_end;
+
+speechPlayer_handle_t speechPlayerHandle=NULL;


Please use nullptr instead of NULL

feerrenrut · 2018-04-16T00:15:49Z

nvdaHelper/espeak/speechPlayer.cpp

+#define minFadeLength 110
+
+inline bool needsMixWaveFile() {
+	return wdata.n_mix_wavefile>0;


Is wdata.n_mix_wavefile a BOOL? If so, I would prefer to see this explicitly cast to boolean: static_cast<bool>(wdata.n_mix_wavefile)

feerrenrut · 2018-04-16T00:16:32Z

nvdaHelper/espeak/speechPlayer.cpp

+	return wdata.n_mix_wavefile>0;
+}
+
+unsigned int mixWaveFile(unsigned int maxNumSamples, sample* sampleBuf) {


Could you add documentation for this? Specifically, what is the return?

feerrenrut · 2018-04-16T06:01:20Z

nvdaHelper/espeak/speechPlayer.cpp

+			signed char c=wdata.mix_wavefile[wdata.mix_wavefile_ix+wdata.mix_wavefile_offset];
+			val+=(c*256);
+		} else {
+			val=(signed char)wdata.mix_wavefile[wdata.mix_wavefile_ix+wdata.mix_wavefile_offset]*wdata.mix_wave_scale;


please explicitly cast with static_cast<signed char>(blah) you can capture blah in whatever type it starts in with: auto blah = wdata.mix.....

feerrenrut · 2018-04-16T06:05:32Z

nvdaHelper/espeak/speechPlayer.cpp

+}
+
+bool isKlattFrameFollowing() {
+	for(int i=(wcmdq_head+1)%N_WCMDQ;i!=wcmdq_tail;i=(i+1)%N_WCMDQ) {


It would be nice to have some comment to explain what's going on here.

feerrenrut · 2018-04-16T06:05:55Z

nvdaHelper/espeak/speechPlayer.cpp

+	return false;
+}
+
+void fillSpeechPlayerFrame(frame_t * eFrame, speechPlayer_frame_t* spFrame) {


magic numbers in this function.

feerrenrut · 2018-04-16T06:09:37Z

nvdaHelper/espeak/speechPlayer.cpp

+			speechPlayer_queueFrame(speechPlayerHandle,&spFrame2,minFadeLength/2,minFadeLength/2,-1,false);
+			spFrame2.outputGain=0;
+			speechPlayer_queueFrame(speechPlayerHandle,&spFrame2,minFadeLength/2,minFadeLength/2,-1,false);
+			//speechPlayer_queueFrame(speechPlayerHandle,NULL,1,1,-1,false);


remove this comment

feerrenrut · 2018-04-16T06:10:07Z

nvdaHelper/espeak/speechPlayer.cpp

+
+int Wavegen_Klatt2(int length, int resume, frame_t *fr1, frame_t *fr2){
+	if(!resume) {
+		speechPlayer_frame_t spFrame1={0};


Could you comment or clarify any of this?

valiant8086 · 2018-04-18T03:55:24Z

This is expected to not be available in the latest NVDA next as of this writing, right? It would need to be approved first? Sorry I'm a little behind on my terminology.

feerrenrut · 2018-04-18T03:58:21Z

@valiant8086 Once you see the incubating label applied, then this will become available in next.

michaelDCurran · 2018-05-02T05:33:55Z

nvdaHelper/espeak/speechPlayer.cpp

+		if(i>=maxNumSamples) break;
+		int val;
+		if(wdata.mix_wave_scale==0) {
+			val=wdata.mix_wavefile[wdata.mix_wavefile_ix+wdata.mix_wavefile_offset];


I think I should keep this as is, as sometimes mix_wavefile_ix is incremented, and the mix_wavefile_ix+mix_wavefile_offset is used both before and after at times. Surely the compiler would optimize the addition out where it can?

My comment was more about trying to improve the clarity of the code, to make it easier to see where the same value is being used, rather than about optimisation.

LeonarddeR · 2018-07-03T18:56:20Z

Is this still on the radar? It looks like last review actions date from like two months ago.

Adriani90 · 2019-01-05T14:49:24Z

@michaelDCurran are now all review actions done?

feerrenrut · 2019-06-11T07:09:26Z

We have discussed this again. From a product level I think it makes more sense to separate this from espeak. It appearing as it's own synthesizer will make it clearer that it has something different from espeak, and will also help us to separate issues that occur in this / espeak. That said, it would be best if it continued to use the existing espeak submodule, I wouldn't want to duplicate that.

ahicks92 · 2020-01-08T16:53:08Z

With the upcoming transition to Python 3, I believe that the add-on is going to break. It's been my primary synth for years so figured it's worth chiming in to say this would be valuable to me.

lukaszgo1 · 2020-02-16T21:07:03Z

@michaelDCurran Is further work on this pr planned in the near future? Some people liked the voice from the addon, and it can no longer be used in 2019.3.

michaelDCurran · 2020-03-04T23:22:21Z

NV Access has chosen to no longer work on this particular project. Although I did update nvaccess/nvSpeechPlayer to be Python3 compatible and the add-on to be NVDA 2019.3 compatible, there are some breaking changes in eSpeak that specifically cause speechPlayerInEspeak (Edward) to no longer correctly speak voiced consonants (z, v, j, g etc). This broke somewhere in late eSpeak 1.48 or 1.49. I don't personally have the time to look into this anymore. Though of course other members of the community would be most welcome to pick this up and try and further debug and solve the issues.

michaelDCurran added 30 commits July 30, 2014 20:09

embed speechPlayer inside espeak as the klatt synthesis engine.

d197b83

This implementation is clean enough to show improvements to espeak's klatt sound, but there are still some buffering issues at certain speeds.

convey espeak's amplitude to speechPlayer.

8505cb5

keep eSpeak at 1.48.03 like official NVDA

b5c33c2

ensure that the length of a frame given to speechPlayer is always at …

0537b95

…least 1 sample long. Sometimes after subtracting fades it can be 0 or negative!

turn up speechPlayr a little more by default to better match other eS…

6d76f15

…peak voices

Merge branch 'master' into speechPlayerInEspeak

0b540aa

Merge branch 'master' into speechPlayerInEspeak

175a67d

speechPlayer in eSpeak: use outputGain in speechPlayer to fade ontput…

09ae59a

… to 0 at the end of audio. preFormantGain was not enough as resonators can produce tails. Stops apparent clipping at the end of some utterances.

Merge branch 'master' into speechPlayerInEspeak

478d786

speechPlayer in eSpeak: drop nasal pole frequency a little to make na…

6e5b526

…sals sound perhaps a little less... nasally?

Add a new Edward eSpeak variant which should be used with speechPlaye…

f1437d0

…r in eSpeak

Edward variant: correct some formant settings to ensure its the same …

4ce4bff

…was what was in the survey

Use a newer version of speechPlayer which should hopefully fix the bu…

174a026

…g where no klatt output could be heard (instead just espeak consonants) when starting NVDA or sometimes a little way through, due to an uninitialized variable.

Merge branch 'master' into speechPlayerInEspeak

f30f9aa

Merge branch 'master' into speechPlayerInEspeak

0ac652b

Merge branch 'master' into speechPlayerInEspeak

eb786b0

Merge branch 'master' into speechPlayerInEspeak

32f2a64

Upgrade to eSpeak 1.48.13 (dev)

94c9cb9

Merge branch 'master' into speechPlayerInEspeak

0966def

Merge branch 'master' into speechPlayerInEspeak

0a412c9

Merge branch 'master' into speechPlayerInEspeak

f625467

speechPlayer.cpp: When mixing wave files, ensure to honor the waveFil…

03c4be2

…e offset and max values. Stops noise junk at the end of words such as 'az' when at slow speeds.

speechPlayer.cpp: fix a typo. Was possibly causing chunks to be sligh…

1b00050

…tly too long.

speechPlayer: implement the voicing attribute in eSpeak voice files.

66bd38d

Make edward voice a little more brighter and take down the volume of …

115d075

…consonants a little.

eSpeak 1.48.15

f546b6e

Merge branch 'espeak1.48.15' into speechPlayerInEspeak

6b707ed

michaelDCurran added 5 commits December 7, 2016 12:07

Change speechplayerInEspeak to use espeak-ng.

41406cd

Merge branch 'master' into speechPlayerInEspeak

cdb1cbe

Export speechPlayer functions from espeak dll to also allow nvSpeechP…

49e2f5a

…layer add-on to use them.

Merge branch 'master' into speechPlayerInEspeak

fed8769

Drop volume for sibilants slightly in Edward voice.

f33f40e

michaelDCurran requested a review from feerrenrut April 13, 2018 07:36

LeonarddeR mentioned this pull request Apr 14, 2018

change default NVDA voice and variant #5272

Closed

feerrenrut suggested changes Apr 16, 2018

View reviewed changes

Merge branch 'master' into speechPlayerInEspeak

63f2c13

michaelDCurran commented May 2, 2018

View reviewed changes

michaelDCurran added 2 commits May 3, 2018 02:53

Review actions.

2f8a196

More review actions.

39fe365

LeonarddeR mentioned this pull request Jan 5, 2019

New variant of Edward Voice #4578

Open

LeonarddeR added AfterPython3Transition and removed AfterPython3Transition labels Apr 24, 2019

feerrenrut removed the AfterPython3Transition label May 8, 2019

Merge remote-tracking branch 'origin/master' into speechPlayerInEspeak

dbc0448

Merge branch 'master' into speechPlayerInEspeak

f150dfe

michaelDCurran closed this Mar 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Edward voice to NVDA #8177

Add Edward voice to NVDA #8177

michaelDCurran commented Apr 13, 2018 •

edited by LeonarddeR

LeonarddeR commented Apr 13, 2018

ehollig commented Apr 13, 2018

michaelDCurran commented Apr 13, 2018 via email

feerrenrut Apr 15, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

feerrenrut Apr 16, 2018

valiant8086 commented Apr 18, 2018

feerrenrut commented Apr 18, 2018

michaelDCurran May 2, 2018

feerrenrut May 6, 2018

LeonarddeR commented Jul 3, 2018

Adriani90 commented Jan 5, 2019

feerrenrut commented Jun 11, 2019

ahicks92 commented Jan 8, 2020

lukaszgo1 commented Feb 16, 2020

michaelDCurran commented Mar 4, 2020

Add Edward voice to NVDA #8177

Add Edward voice to NVDA #8177

Conversation

michaelDCurran commented Apr 13, 2018 • edited by LeonarddeR

Link to issue number:

Summary of the issue:

Description of how this pull request fixes the issue:

Testing performed:

Known issues with pull request:

Change log entry:

LeonarddeR commented Apr 13, 2018

ehollig commented Apr 13, 2018

michaelDCurran commented Apr 13, 2018 via email

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

valiant8086 commented Apr 18, 2018

feerrenrut commented Apr 18, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LeonarddeR commented Jul 3, 2018

Adriani90 commented Jan 5, 2019

feerrenrut commented Jun 11, 2019

ahicks92 commented Jan 8, 2020

lukaszgo1 commented Feb 16, 2020

michaelDCurran commented Mar 4, 2020

michaelDCurran commented Apr 13, 2018 •

edited by LeonarddeR