Inference on long files #44

databill86 · 2023-03-15T16:25:46Z

Hello,

Thank you for this great library!
Is there any way we can chunk the initial audio into shorter samples, let's say 50 seconds each, run inference on those, and end up with a final reconstruction.
I came across this article and I wonder if it's possible to get it working here.
Any ideas if this is possible ?

guillaumekln · 2023-03-15T16:33:46Z

Hi,

The Whisper transcription loop already handles long files using a sliding 30-second window while keeping the context. So you don't need to do anything to transcribe long files.

databill86 · 2023-03-15T22:28:06Z

Thank you. So is it normal that the transcription time is considerably long for long files ?

guillaumekln · 2023-03-16T03:28:25Z

Yes, the transcription time depends on the audio file duration. Long files will take longer.

databill86 · 2023-03-16T13:38:04Z

Sorry I closed and reopened the issue. I just have one last thing about the longer files.
If we use the "gpu" as a device, is there any way we can avoid OOM for these longer files ?

guillaumekln · 2023-03-16T14:00:21Z

What is your GPU and what model size are you running?

databill86 · 2023-03-16T14:37:02Z

It's a NVIDIA GeForce GTX 1070 Ti 8Go, I was running the large-v2 model on a 18min file. But even with 4min file I have OOM.

guillaumekln · 2023-03-16T14:42:30Z

Try running the model with 8-bit quantization:

model = WhisperModel(model_path, device="cuda", compute_type="int8")

databill86 · 2023-03-16T14:48:23Z

Wow, just like that! it's a lot faster, and no OOM!!!
Thank you!
I will close the issue now for good :)

databill86 closed this as completed Mar 16, 2023

databill86 reopened this Mar 16, 2023

databill86 closed this as completed Mar 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference on long files #44

Inference on long files #44

databill86 commented Mar 15, 2023

guillaumekln commented Mar 15, 2023

databill86 commented Mar 15, 2023 •

edited

guillaumekln commented Mar 16, 2023

databill86 commented Mar 16, 2023

guillaumekln commented Mar 16, 2023 •

edited

databill86 commented Mar 16, 2023

guillaumekln commented Mar 16, 2023

databill86 commented Mar 16, 2023

Inference on long files #44

Inference on long files #44

Comments

databill86 commented Mar 15, 2023

guillaumekln commented Mar 15, 2023

databill86 commented Mar 15, 2023 • edited

guillaumekln commented Mar 16, 2023

databill86 commented Mar 16, 2023

guillaumekln commented Mar 16, 2023 • edited

databill86 commented Mar 16, 2023

guillaumekln commented Mar 16, 2023

databill86 commented Mar 16, 2023

databill86 commented Mar 15, 2023 •

edited

guillaumekln commented Mar 16, 2023 •

edited