Backend import for 'as-a-module' use #10

Luca-Pozzi · 2023-09-06T14:13:00Z

With reference to issue #9, the present PR implements a import_backed method in ASRBase. The method is then appropriately overridden both in WhisperTimestampedASR and FasterWhisperASR to import the required libraries.

To make the use more flexible, I have also exposed the output arg in OnlineASRProcessor. It receives a file-like object, so one could pass:

sys.stderr (as before) which in most cases will result into printing to terminal
a file handler (open("/path/to/file.txt", "a")) to log the output to a text file
open(os.devnull, "w"), to send the output to /dev/null, i.e. to discard it

Gldkslfmsd

I forgot to "submit review"

Gldkslfmsd · 2023-09-06T14:24:10Z

whisper_online.py

        self.commited_in_buffer = []
        self.buffer = []
        self.new = []

        self.last_commited_time = 0
        self.last_commited_word = None

+        self.output = output


Let's rename self.output to self.logfile. Otherwise I agree, good idea.

Btw., logging module should be used, but I am too lazy for that. The Python code should stay simple.

Gldkslfmsd · 2023-09-06T14:26:33Z

whisper_online.py

@@ -465,11 +476,11 @@ def split(self, text):
    #asr = WhisperASR(lan=language, modelsize=size)

    if args.backend == "faster-whisper":
-        from faster_whisper import WhisperModel
+        #from faster_whisper import WhisperModel


Gldkslfmsd · 2023-09-06T14:26:40Z

whisper_online.py

        asr_cls = FasterWhisperASR
    else:
-        import whisper
-        import whisper_timestamped
+        #import whisper


Gldkslfmsd · 2023-09-06T14:26:47Z

whisper_online.py

-        import whisper
-        import whisper_timestamped
+        #import whisper
+        #import whisper_timestamped


Gldkslfmsd · 2023-09-06T14:27:05Z

whisper_online.py

-    # join transcribe words with this character (" " for whisper_timestamped, "" for faster-whisper because it emits the spaces when neeeded)
-    sep = " "
+    sep = " "   # join transcribe words with this character (" " for whisper_timestamped,
+                # "" for faster-whisper because it emits the spaces when neeeded)


Gldkslfmsd · 2023-09-06T14:28:45Z

whisper_online.py


    def __init__(self, lan, modelsize=None, cache_dir=None, model_dir=None):
        self.transcribe_kargs = {}
        self.original_language = lan 

+        self.import_backend()


too complicated code. Let's just do the imports in load_model in the childs that need it.

lrq3000 · 2023-10-30T23:41:01Z

@Luca-Pozzi Great work! Do you plan on finishing this PR? I would really like to use whisper_online as a module, being unable to greatly limits composability into more end user facing apps.

Luca-Pozzi · 2023-10-31T08:57:28Z

@lrq3000 It is really a minor contribution, but thank you!
I plan to finish the PR by this week

lrq3000 · 2023-10-31T09:09:36Z

@Luca-Pozzi Awesome, thank you very much! Please let me know if I can help, I am experienced in Python coding but not in deep learning modules.

Luca-Pozzi · 2023-11-03T10:40:30Z

@Gldkslfmsd I have edited the PR as per your suggestions!
Thank you very much, and sorry if it took so long on my side.

Gldkslfmsd · 2023-11-06T15:28:14Z

Thanks! No worries for late.
I'm now busy until 25.11.2023, at least, I hope I'll look on this later.

PR #10, issues #9, #30

lrq3000 · 2023-11-28T14:07:08Z

Thank you both for your great work on this! :D

umaryasin33 · 2023-12-08T06:12:50Z

How to use this to allow multiple clients to connect when you host a server or create an API for live transcription?

Gldkslfmsd · 2023-12-08T09:19:19Z

How to use this to allow multiple clients to connect when you host a server or create an API for live transcription?

I don't know, it's a topic that requires a separate issue.
But first, there must be a Whisper backend that enables batching -- more inputs processing at once.
If there's not, then use one GPU with one server for one client.

umaryasin33 · 2023-12-08T11:06:31Z

How to use this to allow multiple clients to connect when you host a server or create an API for live transcription?

I don't know, it's a topic that requires a separate issue. But first, there must be a Whisper backend that enables batching -- more inputs processing at once. If there's not, then use one GPU with one server for one client.

Thank you. Using one GPU for each client is a tall ask for me as there could be up to a dozen clients active at a particular time for my use case. I think there are a few backends which do support batched processing. e.g. https://github.com/Blair-Johnson/batch-whisper
If you have any references or you can point me to the parts where changes are needed to implement this.
Or is it alright if I create a new issue for this?

Luca-Pozzi added 2 commits September 6, 2023 12:39

import backend from __init__

f95d019

add option to save log to file

d6828b7

Luca-Pozzi mentioned this pull request Sep 6, 2023

Import error when using whisper_online as module #9

Closed

Gldkslfmsd requested changes Oct 26, 2023

View reviewed changes

Luca-Pozzi and others added 2 commits November 3, 2023 11:03

Merge branch 'ufal:main' into main

57d4e4e

backend import in child load_model method and expose logfile arg

cc7e524

Gldkslfmsd merged commit 39e06b5 into ufal:main Nov 28, 2023

Gldkslfmsd added a commit that referenced this pull request Nov 28, 2023

logfile reviewed, whisper_timestamped loading module and vad

0a50eec

PR #10, issues #9, #30

Gldkslfmsd mentioned this pull request Dec 8, 2023

batching multi-client server #42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend import for 'as-a-module' use #10

Backend import for 'as-a-module' use #10

Luca-Pozzi commented Sep 6, 2023

Gldkslfmsd left a comment

Gldkslfmsd Sep 6, 2023

Gldkslfmsd Sep 6, 2023

Gldkslfmsd Sep 6, 2023

Gldkslfmsd Sep 6, 2023

Gldkslfmsd Sep 6, 2023

Gldkslfmsd Sep 6, 2023

lrq3000 commented Oct 30, 2023

Luca-Pozzi commented Oct 31, 2023

lrq3000 commented Oct 31, 2023

Luca-Pozzi commented Nov 3, 2023

Gldkslfmsd commented Nov 6, 2023

lrq3000 commented Nov 28, 2023

umaryasin33 commented Dec 8, 2023 •

edited

Loading

Gldkslfmsd commented Dec 8, 2023

umaryasin33 commented Dec 8, 2023

Backend import for 'as-a-module' use #10

Backend import for 'as-a-module' use #10

Conversation

Luca-Pozzi commented Sep 6, 2023

Gldkslfmsd left a comment

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

Gldkslfmsd Sep 6, 2023

Choose a reason for hiding this comment

lrq3000 commented Oct 30, 2023

Luca-Pozzi commented Oct 31, 2023

lrq3000 commented Oct 31, 2023

Luca-Pozzi commented Nov 3, 2023

Gldkslfmsd commented Nov 6, 2023

lrq3000 commented Nov 28, 2023

umaryasin33 commented Dec 8, 2023 • edited Loading

Gldkslfmsd commented Dec 8, 2023

umaryasin33 commented Dec 8, 2023

umaryasin33 commented Dec 8, 2023 •

edited

Loading