Skip to content

Planned Features

ThioJoe edited this page Jan 11, 2024 · 17 revisions

Priority Planned Features:

  • N/A

Other Planned Features:

  • Azure Translator support

Features Maybe Eventually:

  • Support for Amazon Polly

Looking Into It

  • Find a way to do 'forced alignment' of a good transcript with the video to get word-level timestamps, which should make it easier to start and stop speech clips at more natural breakpoints. And also not necessarily require pre-made subtitle files
  • It seems Azure Translator may be able to provide "Word Alignment" information, which could be used to provide better translations. So you could translate the whole transcript at once instead of individual fragments which sometimes sentences in half resulting in incorrect translations. Then you could use the 'word indexes' to match up where the translated words should go based on where the pre-translated words are. I suspect this is how YouTube does the auto syncing of translated transcripts behind the scenes.