-
-
Notifications
You must be signed in to change notification settings - Fork 637
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ESpeak Voice Sounds Harsher in Master and Next Versions #5868
Comments
Is this the same as the command line flag that was added a couple of On 4/7/2016 5:51 PM, David Goldfield wrote:
Websites: email me at derek.riemer@colorado.edu mailto:derek.riemer@colorado.edu |
Actually, I cannot hear this on the internal realtek hardware, but a bglists@blueyonder.co.uk
|
I'm willing to see if there is a driver update for my sound card but I can tell you that this is not occurring in 2016.1. bglists@blueyonder.co.ukmailto:bglists@blueyonder.co.uk
— |
#3860 is exactly the same issue as what I'm hearing now, if that is helpful. |
Any ideas, @michaelDCurran? Seems we have #3860 again. Reading briefly, that was apparently due to badly compiled phoneme data. |
@dgoldfield What exact eSpeak settings are you using? I.e. rate, rate boost, variant, pitch. Also, what kind of sound card? Can you confirm that the issue is not seen in 2016.1? |
|
@dgoldfield Any possibility of getting a recording of both 2016.1 and next? Annoyingly I cannot reproduce the issue yet on at least 2 machines. I remember the old bug, and that was caused by using an incorrect version of eSpeakEdit. However now eSpeak has the ability to compile the phoneme data itself. It is very possible that there is a bug in the compilation code... but Nothing can be done until it can be reproduced. |
I can probably attach two audio samples, both from 2016.1 and a newer |
For me, if I do hear anything, it sounds as if next/master is slightly louder, and perhaps slightly compressed, compared to 2016.1. |
Next/master certainly also has slightly different EQing. Less trebble perhaps. Also some kind of low shelf. |
It would be useful to track down what is causing this difference. On my GitHub espeak branch, I have been able to compile espeak on Linux for a long time (all the tags should be buildable). They will likely require some work to get them to build on Windows, but that could be useful trying to track down the cause of the issue. Some things I want to test are:
This should help isolate where the issue is being introduced. |
I have recorded two separate .wav files. This system will not allow me to upload them, saying this type of file is not accepted. https://www.dropbox.com/sh/jx7d0kfac0pm2rh/AADyorRuZl3zCRf0Eiq8HQnXa?dl=0 The current build audio file is using 2016.1 and the master build audio file is using a master from April 21. In the file, I alt-tab into the Jarte text editor which contains the following sentence. Welcome to heading level 1. I am testing this synthesizer as I dialog with all of you about the various NVDA issues. I have NVDA read the file and then I navigate through some of it word by word and then character by character. I then alt-tab back into Audacity and stop the recording. |
Please try the latest NVDA Next snapshot (13300,9ab71476 or later). This contains the latest update to espeak-ng that apparently may fix the issue. On my system the difference I was hearing seem to have gone away. |
Congratulations to both NV Access and the ESpeak development team for this work. Yes, I believe it is fixed. At first, I wasn't sure as the two versions do have some differences but I think the differences I'm now hearing are changes to some of the phonemes. However, most of that harshness is gone and, in some ways, I think I'm even liking the new version a bit better. Thank you to all of you for your willingness to track this down. ESpeak is actually my preferred synthesizer when using NVDA and it's nice to know I won't need to switch to something else. Many thanks. |
The new ESpeak voices sound harsher, particularly with words with the letter V, such as "level." It sounds like it is saying "lebel" which becomes obvious when moving through headings on the Web and I hear items such as "heading lebel 1." Some words, such as "Internet" almost have a slight pop sound at the beginning, as though effects on my sound card were enabled. This is with English U.S. voices. This happened once before and it was addressed/I could try and locate the ticket if it would help.
The text was updated successfully, but these errors were encountered: