whisper-playground

A playground to use whisper python package for transcription. A dev container is used to set up all that is needed included whisper, pyannote, ffmpeg and pydub.

Requirements:

Docker installed locally
If you want to use diarization, pyannote package reuires Hugging Face access token

Open folder in VS Code
copy .devcontainer/env.sample into .devcontainer/.env file and update environment variables as needed.
Run 'Reopen in Container' from the command pallete

Transcription & Diarization

To transcribe a video or audio file, run:

python transcription.py --file "path/to/audio/or/video/file"

If you also want to add diarization and align the transcription with each speaker segment run"

python transcription.py --file "path/to/audio/or/video/file" --diarization True

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.devcontainer		.devcontainer
.vscode		.vscode
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
file_utils.py		file_utils.py
requirements.txt		requirements.txt
transcription.py		transcription.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whisper-playground

Transcription & Diarization

About

Releases

Packages

Languages

License

limorl/whisper-playground

Folders and files

Latest commit

History

Repository files navigation

whisper-playground

Transcription & Diarization

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages