Limiting previous token #851

amnike · 2023-01-16T14:13:39Z

amnike
Jan 16, 2023

Every time a new result is created it captures the previous result and as this goes on it gets to the point it's capturing the previous 50 results then the processing time increases tremendously lol.

I get that it's done like this for accuracy but, do you have a way or can point me in the direction of reducing the previous tokens or results captured?

I'd really appreciate it

Answered by jongwook

Jan 17, 2023

You could truncate the prefix tokens by adding something like

decode_options["prompt"] = decode_options["prompt"][-10:]

after:

whisper/whisper/transcribe.py

Line 180 in 0f39c89

decode_options["prompt"] = all_tokens[prompt_reset_since:]

That said, prompt tokens affect the inference time only during the first forward pass through the decoder, which is usually not very significant compared to the total autoregressive decoding time which usually involves tens or hundreds of forward passes through the decoder.

View full answer

jongwook · 2023-01-17T08:07:43Z

jongwook
Jan 17, 2023
Maintainer

You could truncate the prefix tokens by adding something like

decode_options["prompt"] = decode_options["prompt"][-10:]

after:

whisper/whisper/transcribe.py

Line 180 in 0f39c89

decode_options["prompt"] = all_tokens[prompt_reset_since:]

That said, prompt tokens affect the inference time only during the first forward pass through the decoder, which is usually not very significant compared to the total autoregressive decoding time which usually involves tens or hundreds of forward passes through the decoder.

2 replies

amnike Jan 17, 2023
Author

Thanks a lot! Any tips on reducing that super lag 15 minutes into a "real-time" transcription?

jongwook Jan 17, 2023
Maintainer

In case you're referring to #608, I can't comment because I wasn't involved. This repo currently does not support real-time transcription, except an idea I suggested in #117 (comment). In any case, you'd need a fast GPU to run the transcription faster than the audio duration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limiting previous token #851

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Limiting previous token #851

Uh oh!

amnike Jan 16, 2023

Replies: 1 comment · 2 replies

Uh oh!

jongwook Jan 17, 2023 Maintainer

Uh oh!

amnike Jan 17, 2023 Author

Uh oh!

jongwook Jan 17, 2023 Maintainer

amnike
Jan 16, 2023

Replies: 1 comment 2 replies

jongwook
Jan 17, 2023
Maintainer

amnike Jan 17, 2023
Author

jongwook Jan 17, 2023
Maintainer