Skip to content

Whisper combined with Silero VAD, for improved long-form transcriptions

License

Notifications You must be signed in to change notification settings

ANonEntity/WhisperWithVAD

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

Whisper with Silero VAD

This notebook combines Whisper with a separate VAD. This improves long-form transcriptions, at the cost of possibly missing a few lines. It also attempts to filter out hallucinations and obvious mistranslations.

How to use

  • Click here to open the notebook in Google Colab.
  • Run the Setup Whisper cell.
  • Upload your input audio to either the runtime itself, Google Drive, or a file hosting service with direct download links.
  • Set the audio_path and language variables, and then run the Run Whisper cell. (Note: Audio path is set automatically if you use the Upload cell)
  • Once it's done, the notebook will automatically download the generated SRT file.

About

Whisper combined with Silero VAD, for improved long-form transcriptions

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published