You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi devs.
I'm not a developer, but I've come with a suggestion which revolvs around making this tts engine accessible for assistive technology(screen readers for blind people, ebook/text readers etc), and this has 2 options:
Making a sapi5(speech application programming interface) version of this tts engine available for download and use.
Making an addon for the nvda(non visual desktop access) free opensource python made screen reader.
The voice must be responsive, meaning it has no delay before the speech and no lag in the middle of speech.
If this can't be achieved with a neural network, would making an hts version of the voice(training an hts on the ljspeech dataset) and then uploading it to rhvoice so they can add it with the rest of the voices or as a standalone product. Rhvoice's voices are limited when it comes to inflection, and they don't handle punctuations correctly. Also, the voice should be high quality and inteligible as well as responsive and free of lag. But rh voices are less clean than say, google tts on android. It's still hts but it sounds better, even though it still can't handle punctuations propperly, maybe this is a downside of hts.
Hope this feature/suggestion is considered. If you have anything I can answer, then please comment on the issue and we can hopefully discuss this. Thanks and have a good time.
The text was updated successfully, but these errors were encountered:
Hi devs.
I'm not a developer, but I've come with a suggestion which revolvs around making this tts engine accessible for assistive technology(screen readers for blind people, ebook/text readers etc), and this has 2 options:
The voice must be responsive, meaning it has no delay before the speech and no lag in the middle of speech.
If this can't be achieved with a neural network, would making an hts version of the voice(training an hts on the ljspeech dataset) and then uploading it to rhvoice so they can add it with the rest of the voices or as a standalone product. Rhvoice's voices are limited when it comes to inflection, and they don't handle punctuations correctly. Also, the voice should be high quality and inteligible as well as responsive and free of lag. But rh voices are less clean than say, google tts on android. It's still hts but it sounds better, even though it still can't handle punctuations propperly, maybe this is a downside of hts.
Hope this feature/suggestion is considered. If you have anything I can answer, then please comment on the issue and we can hopefully discuss this. Thanks and have a good time.
The text was updated successfully, but these errors were encountered: