Skip to content

SubForge v1.0.0

Choose a tag to compare

@henry1786580051-lang henry1786580051-lang released this 30 May 19:11
· 17 commits to master since this release

SubForge v1.0.0

AI-powered video captioning tool with ASR, subtitle optimization, translation, and synthesis.

Updates (2026-05-31)

Fix: Eliminate VAD Padding-Induced Timestamp Overlaps

  • Root cause: Silero VAD's speech_pad_ms (300ms default) was blindly added to both sides of every speech segment. Adjacent segments separated by less than 600ms would overlap after padding, causing subtitle timestamp overlaps in bilingual output.
  • Fix: Gap-aware padding — each segment's expansion is now limited to at most half the silence gap to its neighbor. Mathematically guarantees no overlap between adjacent segments.
  • Impact: Bilingual subtitles no longer have overlapping timestamps. Previously ~7.3% of subtitle segments (52/715 in a typical video) had slight overlaps.

Features

  • AI Transcription: mimo-omni ASR engine for accurate speech recognition
  • Intelligent Translation: LLM-powered bilingual subtitle generation (mimo v2.5-pro)
  • Smart Segmentation: VAD-based silence detection with optimized subtitle splitting
  • Desktop App: Native macOS application with modern web UI
  • Multi-format Support: SRT, ASS, VTT subtitle export
  • Audio Enhancement: DeepFilterNet3 noise reduction

Installation

macOS Desktop App

Download SubForge.dmg and drag to Applications.

CLI

pip install subforge-cli

Quick Start

# Run desktop app
open SubForge.app

# CLI usage
subforge transcribe video.mp4
subforge translate subtitles.srt --target-lang zh
subforge optimize subtitles.srt

Highlights

  • Optimized VAD parameters for better silence detection
  • Conservative ASR prompt to minimize hallucinations
  • Position-based subtitle resegmentation
  • Bilingual subtitle synthesis
  • Real-time progress tracking