SubForge v1.0.0
·
17 commits
to master
since this release
SubForge v1.0.0
AI-powered video captioning tool with ASR, subtitle optimization, translation, and synthesis.
Updates (2026-05-31)
Fix: Eliminate VAD Padding-Induced Timestamp Overlaps
- Root cause: Silero VAD's
speech_pad_ms(300ms default) was blindly added to both sides of every speech segment. Adjacent segments separated by less than 600ms would overlap after padding, causing subtitle timestamp overlaps in bilingual output. - Fix: Gap-aware padding — each segment's expansion is now limited to at most half the silence gap to its neighbor. Mathematically guarantees no overlap between adjacent segments.
- Impact: Bilingual subtitles no longer have overlapping timestamps. Previously ~7.3% of subtitle segments (52/715 in a typical video) had slight overlaps.
Features
- AI Transcription: mimo-omni ASR engine for accurate speech recognition
- Intelligent Translation: LLM-powered bilingual subtitle generation (mimo v2.5-pro)
- Smart Segmentation: VAD-based silence detection with optimized subtitle splitting
- Desktop App: Native macOS application with modern web UI
- Multi-format Support: SRT, ASS, VTT subtitle export
- Audio Enhancement: DeepFilterNet3 noise reduction
Installation
macOS Desktop App
Download SubForge.dmg and drag to Applications.
CLI
pip install subforge-cliQuick Start
# Run desktop app
open SubForge.app
# CLI usage
subforge transcribe video.mp4
subforge translate subtitles.srt --target-lang zh
subforge optimize subtitles.srtHighlights
- Optimized VAD parameters for better silence detection
- Conservative ASR prompt to minimize hallucinations
- Position-based subtitle resegmentation
- Bilingual subtitle synthesis
- Real-time progress tracking