This is a Golang CLI tool for generating transcripts of video files. It uses ffmpeg, whisper and ollama.
go run *.go [--format] <file>
--format
: Optional flag to trigger formatting the transcript into paragraphs using an LLM
go run *.go --format video.mp4
This command will:
- Convert
video.mp4
to a WAV - Transcribe the audio using Whisper
- Format the transcript into paragraphs using an LLM
- Save the formatted transcript to
formatted_transcript.txt
I'm still working on the best way to distribute this. For now, you can clone the repository, ensure you have the required dependencies installed, and update the WhisperPath
and ModelPath
constants in the code to match your system.
To build the CLI:
go build -o subber
Then move the binary to somewhere on your PATH (e.g. /usr/local/bin
).