Local video dubbing with a web UI. Uses faster-whisper (CTranslate2) for transcription and supertonic for TTS — all on-device, no cloud.
dubb-cli dub <video> --to <lang> -o output.mp4
dubb-cli serve --port 8080
dubb-cli versiondubb-cli serve --port 8080
# Open http://localhost:8080/uiUpload a video, pick a language, download the result.
- Extract audio (ffmpeg)
- Transcribe (faster-whisper, CPU int8 with CTranslate2)
- Translate (Google Translate via deep-translator)
- Generate speech (supertonic TTS — 31 languages, 10 voices)
- Speed up video per-segment to match TTS pace
- Concatenate clips
- Python 3.10+
- ffmpeg
- 8 GB+ RAM recommended
https://dubb.dk1.intrane.fr/ui
Tested on an Intel i5-12400T (10 cores, 8 GB RAM VM). A 40 MB, 5-minute video completed in ~120 seconds (Whisper base model, English → English re-voice).