You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
i've just got my custom fast-whisper model working on a docker server and am looking where i can implement this myself. i haven't changed volume threshold settings for VAD yet but i get a lot of junk tokens. with slow whisper i implemented a black list for phrases like "Thank you", "Thanks very much", etc that get thrown out by the model. I think i can see where to look at transcribe() in transcriber.py to maybe select phrases and so expose them but the process seems expensive so i might need to look further.
Looking at the code, I don't see how the library user is supposed to access the transcribed text?
It looks like it just gets printed?
WhisperLive/whisper_live/client.py
Line 123 in e1a42c2
I think a workaround would be to read the
output.srt
- But maybe we could also just return the transcribed text as string?The text was updated successfully, but these errors were encountered: