Add support for additional TTS integrations through non-Microsoft focused SpeechService interface #2379
Labels
9. enhancement
The behaviour is as specified, but we would like to modify or extend the spec.
EDDI currently uses whatever built in Windows TTS system is installed. Unfortunately, the built in Windows TTS are not particularly good.
This feature request is to ask for a better more modular SpeechService class that allows other speech engines to "plugin" that do not rely on the Windows TTS interfaces and provide the same WAV stream as the existing class uses.
Examples of other engines could include (but are not limited to):
As a proof of concept, here is an Amazon polly implementation I created.
https://gist.github.com/druggedhippo/0a887973ee019dea1fc9e522f513b0f5
Example audio of Amazon Polly processing a EDDI TTS prompt in real-time:
https://imgur.com/zyoWmQg
The text was updated successfully, but these errors were encountered: