Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

API: Return transcribed text #220

Open
powellnorma opened this issue May 29, 2024 · 2 comments
Open

API: Return transcribed text #220

powellnorma opened this issue May 29, 2024 · 2 comments
Labels
Feature Request New feature or request

Comments

@powellnorma
Copy link

Looking at the code, I don't see how the library user is supposed to access the transcribed text?
It looks like it just gets printed?

utils.print_transcript(text)

I think a workaround would be to read the output.srt - But maybe we could also just return the transcribed text as string?

@makaveli10 makaveli10 added the Feature Request New feature or request label May 30, 2024
@makaveli10
Copy link
Collaborator

@powellnorma Thanks for using the library. I think you make a good point, we can bring this feature in an upcoming release.

@tidymonkey81
Copy link

i've just got my custom fast-whisper model working on a docker server and am looking where i can implement this myself. i haven't changed volume threshold settings for VAD yet but i get a lot of junk tokens. with slow whisper i implemented a black list for phrases like "Thank you", "Thanks very much", etc that get thrown out by the model. I think i can see where to look at transcribe() in transcriber.py to maybe select phrases and so expose them but the process seems expensive so i might need to look further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature Request New feature or request
Development

No branches or pull requests

3 participants