Skip to content

mikeesto/subber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Subber

This is a Golang CLI tool for generating transcripts of video and audio files. It wraps FFmpeg and Whisper.

Usage

go run *.go [--format] <file>

Options

  • --format: Optional flag to trigger formatting the transcript into paragraphs using basic topic analysis with TF-IDF

Example

go run *.go --format video.mp4

This command will:

  1. Convert video.mp4 to a WAV
  2. Transcribe the audio using Whisper
  3. Format the transcript into paragraphs using topic analysis with TF-IDF
  4. Save the formatted transcript to formatted_transcript.txt

Some very rough benchmarks.

Installation

I'm still working on the best way to distribute this. For now, you can clone the repository, ensure you have the required dependencies installed, and update the WhisperPath and ModelPath constants in the code to match your system.

To build the CLI:

go build -o subber

Then move the binary to somewhere on your PATH (e.g. /usr/local/bin).

About

A small CLI tool for converting video & audio to a text transcription

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages