Skip to content

v0.7.0

Latest

Choose a tag to compare

@vitormf vitormf released this 04 Jun 16:22

What's new

New features

  • Per-segment cross-language scoring — each segment now detects its own audio language via Whisper; cross-language scoring activates per segment rather than all-or-nothing, so dubbed or mixed-language files are handled correctly even when not every segment is cross-language
  • Per-segment audio language in output — --verbose now shows asr[lang] for each segment; segment audio languages are also included in CSV and HTML reports
  • Language confidence gate — segments where Whisper reports low confidence are excluded from audio language voting; unsupported languages (Basque, Filipino) bail out early to avoid wasting time on unreliable transcription

Bug fixes

  • Fixed crash on video containers that omit duration at the format level (e.g., raw MPEG-TS recordings)
  • Fixed audio candidate positions exceeding the audio track duration, which caused ffmpeg errors on recordings padded with video after audio ends
  • Fixed audio language voting to only count segments that pass the quality gate
  • Fixed segment_langs padding in cache store when fewer segments were transcribed than expected
  • Fixed quality gate not applying to the --no-cache transcription voting path