-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: Wyoming API #4
Comments
Hi @ser, thanks for the suggestion. Since the idea behind this very simple extension is to remain such, I would rather not add features that IMHO, will see limited use. Actually, for this use case, I would recommend starting from something like cliblurt which uses minimal resources (GUI is optional) and is not GNOME only (should work under XFCE4 for example.) |
These are not only Pis, 90% of my computers, older PCs or laptops are unable to handle speech recognition in sensible time. So in other words, do you plan to add any API or you are decided to keep everything local? BTW this local stack is very complex to be honest, making use of server-client architecture would simplify things a lot, even on the same machine. |
Valid points. It may not be such an edge case after all. If you would like, you can then use that as a base to craft an appropriate "multipart/form-data" curl request to conform to the Wyoming protocol and call the referenced faster-whisper server. Setting up this little hack will likely remain complex since it is not a monolithic app, but rather uses the built-in tools and flexibility of the Linux system. An installation script will help automate things a bit, will see. |
Hi @ser, the extension can now be set up to transcribe over the network using a whisper.cpp server. Talking to a faster-whisper server should be possible to implement in a similar fashion. |
fantastic!!!!! i am investigating now how much resources would take whisper.cpp server additionally to current fast whisper. |
So finally I decided to write Wyoming server also using Whisper API to avoid necessity of having two STT services, https://github.com/ser/wyoming-whisper-api-client |
It would be cool if extension could communicate with local Faster Whisper via Wyoming protocol API:
https://github.com/rhasspy/wyoming-faster-whisper
The advantage is that voice recognition could work on cheap gnome clients with one more capable machine in the local network.
The text was updated successfully, but these errors were encountered: