API: Return transcribed text #220

powellnorma · 2024-05-29T17:01:32Z

Looking at the code, I don't see how the library user is supposed to access the transcribed text?
It looks like it just gets printed?

WhisperLive/whisper_live/client.py

Line 123 in e1a42c2

utils.print_transcript(text)

I think a workaround would be to read the output.srt - But maybe we could also just return the transcribed text as string?

The text was updated successfully, but these errors were encountered:

makaveli10 · 2024-05-30T03:27:18Z

@powellnorma Thanks for using the library. I think you make a good point, we can bring this feature in an upcoming release.

tidymonkey81 · 2024-06-11T02:27:22Z

i've just got my custom fast-whisper model working on a docker server and am looking where i can implement this myself. i haven't changed volume threshold settings for VAD yet but i get a lot of junk tokens. with slow whisper i implemented a black list for phrases like "Thank you", "Thanks very much", etc that get thrown out by the model. I think i can see where to look at transcribe() in transcriber.py to maybe select phrases and so expose them but the process seems expensive so i might need to look further.

makaveli10 added the Feature Request New feature or request label May 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: Return transcribed text #220

API: Return transcribed text #220

powellnorma commented May 29, 2024

makaveli10 commented May 30, 2024

tidymonkey81 commented Jun 11, 2024

API: Return transcribed text #220

API: Return transcribed text #220

Comments

powellnorma commented May 29, 2024

makaveli10 commented May 30, 2024

tidymonkey81 commented Jun 11, 2024