-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using a microphone #80
Comments
@adamnova added something like this to the demo in: #9 However, I didn't wanted to add this Naudio dependency on the full demo, but since then, each example is done in a different project where there is no problem to have NAudio. I think it makes sense to move it as a standalone example. Also, for the best mic support, continuous recognition is also a must: #25 I would also add some mic example for blazor, as that would be pretty cool. |
My demo was basically a proof of concept, it is not very usable in practice. Without the continuous recognition, all you get is somewhat repeating lines of text. |
It appears there is now continuous recognition here: https://github.com/sandrohanea/whisper.net/tree/main/examples/ContinuousRecognition Though it appears that's an example rather than part of core, is there a chance of getting a microphone sample now? |
I was able to get realtime transcription from the mic working on my M1 Mac using the code below, which uses OpenTK.OpenAL. This is stitched together from various SO posts, and could be improved, but may be helpful to others looking to do similar. I ended up having to get the CoreML model manually, unzipping, and putting it in the current folder. Ideally IMO Whisper.net would "just work" and download this model when on apple silicon, similar to how it does the base .bin model. The other "gotcha" I ran into was that I needed to specify a float[] buffer and ALFormat.MonoFloat32Ext capture.
|
Will close any issue related to streaming processing as linked to: #25 |
Is there a way in the library to use the microphone and not just transcribe an existing recording?
because the original library has
in whisper.cpp
The text was updated successfully, but these errors were encountered: