How to get the progress bar while transcribing? #850

benjieperez · 2023-01-16T08:53:22Z

benjieperez
Jan 16, 2023

Here is my code.

import whisper

Load Whisper model Large

MODEL = whisper.load_model("model/medium.pt", in_memory=True)

Set options

translate_options = dict(task="translate", **dict(language=translate_language, beam_size=5, best_of=5))
result_transcribe = MODEL.transcribe(audio, **transcribe_options, fp16=False, verbose=False)

When i enable verbose=False, this is the terminal output while transcribing,

Detected language: Turkish
60%|███████████████████████████████████████████████████████████████████████████ | 4058/6058 [00:21<00:00, 277.58frames/s]
100%|███████████████████████████████████████████████████████████████████████████ | 6058/6058 [00:21<00:00, 277.58frames/s]

I want to get the progress bar while its transcribing is it possible?

Thanks,

mayeaux · 2023-01-16T11:58:35Z

mayeaux
Jan 16, 2023

This is how I do it in JavaScript: https://github.com/mayeaux/generate-subtitles/blob/master/helpers/formatStdErr.js#L9

16 replies

benjieperez Jan 19, 2023
Author

How to set PYTHONUNBUFFERED? No idea on this.

glangford Jan 19, 2023

@benjieperez

glangford Jan 19, 2023

@benjieperez ...something along these lines is worth a try

import subprocess, os
whisper_env = os.environ.copy()
whisper_env["PYTHONUNBUFFERED"] = "1"
subprocess.Popen( ... , env=whisper_env)

benjieperez Jan 23, 2023
Author

@benjieperez ...something along these lines is worth a try

import subprocess, os
whisper_env = os.environ.copy()
whisper_env["PYTHONUNBUFFERED"] = "1"
subprocess.Popen( ... , env=whisper_env)

Still no luck, the progress bar shows after the process not while processing the audio.

benjieperez Jan 23, 2023
Author

Date Run: 2023-01-23 10:44
Running Command: whisper files_uploaded/63c8cb48abc63045d80fd5d2.wav --model=medium --model_dir=model/ --verbose=False --fp16=False --task=transcribe --output_dir=output/63c8cb48abc63045d80fd5d2
STDOUT: Detected language: Turkish

STDOUT:
STERR:

None
(This is blank, after the process finished, this will be the result)
STERR: 0%| | 0/6060 [00:00<?, ?frames/s]

{'percentDone': '0%', 'timeElapsed': '00:00', 'speed': 0, 'percentDoneAsNumber': 0.0, 'timeRemaining': {'string': '00:00', 'hoursRemaining': 0, 'minutesRemaining': 0, 'secondsRemaining': 0}}
STERR: 36%|███▋ | 2200/6060 [00:10<00:17, 215.65frames/s]

{'percentDone': '36%', 'timeElapsed': '00:10', 'speed': '215.65', 'percentDoneAsNumber': 36.0, 'timeRemaining': {'string': '00:17', 'hoursRemaining': 0, 'minutesRemaining': 0, 'secondsRemaining': 17}}
STERR: 74%|███████▍ | 4500/6060 [00:18<00:06, 256.08frames/s]

{'percentDone': '74%', 'timeElapsed': '00:18', 'speed': '256.08', 'percentDoneAsNumber': 74.0, 'timeRemaining': {'string': '00:06', 'hoursRemaining': 0, 'minutesRemaining': 0, 'secondsRemaining': 6}}
STERR: 100%|██████████| 6060/6060 [00:21<00:00, 295.05frames/s]

{'percentDone': '100%', 'timeElapsed': '00:21', 'speed': '295.05', 'percentDoneAsNumber': 100.0, 'timeRemaining': {'string': '00:00', 'hoursRemaining': 0, 'minutesRemaining': 0, 'secondsRemaining': 0}}
STERR: 100%|██████████| 6060/6060 [00:21<00:00, 276.00frames/s]

{'percentDone': '100%', 'timeElapsed': '00:21', 'speed': '276.00', 'percentDoneAsNumber': 100.0, 'timeRemaining': {'string': '00:00', 'hoursRemaining': 0, 'minutesRemaining': 0, 'secondsRemaining': 0}}
STERR:
Total Time Processed: --- 43.48728823661804 seconds ---

zefr0x · 2023-01-19T11:31:29Z

zefr0x
Jan 19, 2023

I think it would be great to have a function like whisper.transcribe(), that yields results rather then waiting to the end then returning every thing at ones.

I think it's not complicated, since whisper.transcribe() can print result in realtime when we set verbose=True.

0 replies

aadnk · 2023-03-27T16:13:01Z

aadnk
Mar 27, 2023

Another option is to override the tqdm.tqdm progress bar in Whisper itself:

import os
import sys
import tqdm
import urllib.request 

class _CustomProgressBar(tqdm.tqdm):
    def __init__(self, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self._current = self.n  # Set the initial value
        
    def update(self, n):
        super().update(n)
        self._current += n
        
        # Handle progress here
        print("Progress: " + str(self._current) + "/" + str(self.total))

# Inject into tqdm.tqdm of Whisper, so we can see progress
import whisper.transcribe 
transcribe_module = sys.modules['whisper.transcribe']
transcribe_module.tqdm.tqdm = _CustomProgressBar

import whisper
model = whisper.load_model("medium")

if not os.path.exists("sample1.wav"):
    urllib.request.urlretrieve("https://github.com/itsupera/audiobook_alignment/raw/main/samples/sample1.wav", "sample1.wav")

result = model.transcribe("sample1.wav", language="Japanese", fp16=False, verbose=None)
print(result['text'])

That way, you don't have launch Whisper in another process, or parse the output. I have a complete example of how this can be done here:

https://huggingface.co/spaces/aadnk/whisper-webui/blob/main/src/hooks/whisperProgressHook.py#L95

I use it to pass progress from Whisper into Gradio. But yeah, hopefully there will be a proper API soon so we don't have to rely on method hooks or parsing the console output.

1 reply

bees4ever Oct 18, 2023

love to see it as a standard callback of the transcribe function

jbeltran73-2 · 2023-11-13T12:06:17Z

jbeltran73-2
Nov 13, 2023

#!/usr/bin/env python3

import whisper

model = whisper.load_model("large-v3")
result = model.transcribe("scale.m4a")

with open("transcription.txt", "w") as f:
        f.write(result["text"])

I don't see the progress bar with this simple code. I've searched and transcribe.py has tqdm already.
What I could be doing wrong?

Thank you

2 replies

dkleptsov Dec 14, 2023

Many functions have "verbose" boolean parameter. It controls how much information function will print to terminal.

Transcribe also has this parameter:
result = model.transcribe("scale.m4a") - will print nothing
result = model.transcribe("scale.m4a", verbose=False) - will print only progress bar
result = model.transcribe("scale.m4a", verbose=True) - will print detailed information

jbeltran73-2 Dec 14, 2023

Thank you trying it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the progress bar while transcribing? #850

{{title}}

Replies: 4 comments 19 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How to get the progress bar while transcribing? #850

Load Whisper model Large

Set options

Replies: 4 comments · 19 replies

benjieperez Jan 19, 2023 Author

benjieperez Jan 23, 2023 Author

benjieperez Jan 23, 2023 Author

Replies: 4 comments 19 replies

benjieperez Jan 19, 2023
Author

benjieperez Jan 23, 2023
Author

benjieperez Jan 23, 2023
Author