Replies: 1 comment 1 reply
-
answered on the Caster repo, looks like the correct level would be to write a module that depends on whisper and provides a dragonfly api. the whisper implementation will likely need to reuse components from KAG, in particular whisper doesn't seem to have VAD or any sort of guided decoding. If anyone here can point me to guidance about how to extract the guided decoding code from dragonfly/KAG, it may save me time; if not, I may figure it out. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
https://github.com/openai/whisper
would this be an appropriate place to integrate openai whisper, for handling of longer dictation? note that as of right now, whisper doesn't support streaming, and it looks to me like it may require transfer learning to be able to add streaming support to it.
(I've asked this question in other repos as well (Caster), and I'm crosslinking between the questions)
Beta Was this translation helpful? Give feedback.
All reactions