v0.7.0

Latest

Latest

vitormf released this 04 Jun 16:22

aff1670

What's new

New features

Per-segment cross-language scoring — each segment now detects its own audio language via Whisper; cross-language scoring activates per segment rather than all-or-nothing, so dubbed or mixed-language files are handled correctly even when not every segment is cross-language
Per-segment audio language in output — --verbose now shows asr[lang] for each segment; segment audio languages are also included in CSV and HTML reports
Language confidence gate — segments where Whisper reports low confidence are excluded from audio language voting; unsupported languages (Basque, Filipino) bail out early to avoid wasting time on unreliable transcription

Bug fixes

Fixed crash on video containers that omit duration at the format level (e.g., raw MPEG-TS recordings)
Fixed audio candidate positions exceeding the audio track duration, which caused ffmpeg errors on recordings padded with video after audio ends
Fixed audio language voting to only count segments that pass the quality gate
Fixed segment_langs padding in cache store when fewer segments were transcribed than expected
Fixed quality gate not applying to the --no-cache transcription voting path

Assets 4