Generating Transcript and Translation with one command #1132

skartekko · 2023-03-21T12:05:09Z

skartekko
Mar 21, 2023

Hello, is it possible to generate the transcript and the translation at the same time via the CLI of Whisper, so that the VTT/SRT subtitle files (transcript and translation) have the identical segments and timecodes?

Answered by jongwook

Apr 12, 2023

Not out-of-the-box, because the probability distribution of sampling timestamps will be different between the translation and transcription modes. It might be possible to hack a pipeline of transcribing with word timestamps and enforce the positions of the timestamp tokens in the translation mode (by editing decoding.py), but it may be easier to just use a separate translation API at that point..

View full answer

jongwook · 2023-04-12T00:05:17Z

jongwook
Apr 12, 2023
Maintainer

Not out-of-the-box, because the probability distribution of sampling timestamps will be different between the translation and transcription modes. It might be possible to hack a pipeline of transcribing with word timestamps and enforce the positions of the timestamp tokens in the translation mode (by editing decoding.py), but it may be easier to just use a separate translation API at that point..

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating Transcript and Translation with one command #1132

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Generating Transcript and Translation with one command #1132

Uh oh!

skartekko Mar 21, 2023

Replies: 1 comment

Uh oh!

jongwook Apr 12, 2023 Maintainer

skartekko
Mar 21, 2023

jongwook
Apr 12, 2023
Maintainer