# Speech to Text Models' Inference and Comparison of Results and Benchmarks

### Simulated Patient Doctor Audio Conversation lasting for 3 minutes 27 seconds

### OpenAI Whisper large-V2 Inference for 1 minute: 26.4s

### SYSTRAN Faster-Whisper large-V2 Inference for 1 minute: 1.3s

In [1]:
file_path = 'Patient_Doctor_First_Convo.mp3'

### Whisper Large-V2 Model

#### 10563 MiB of VRAM on GeForce GTX 1080 Ti (Approx. 11 GB of VRAM)

#### https://github.com/openai/whisper

In [4]:
import whisper
import torch
import time

device  = ("cuda" if torch.cuda.is_available() else "mps" if torch.backends.mps.is_available() else "cpu")

whisper_model = whisper.load_model("/data/rizwan_karim/rs_dev_restored/model/large-v2.pt", device= device)

start_time = time.time()

result = whisper_model.transcribe(file_path)

end_time = time.time() - start_time

In [5]:
result["text"]

" Hello, good morning Mark. How are you feeling today? Good afternoon doctor. I was doing fine until I came back from my holiday trip. Oh really? Where did you go for the holidays? I went to Karachi to go to the beach and spend some sea time. Oh that's great. Karachi is good this time of the year. Not so hot, not so cold. Plus, who doesn't like sea and beach? Anyways, how did you end up here? Coming to the real question. I've been experiencing some troubling symptoms lately. I've had persistent headaches, occasional dizziness, and I've been feeling unusually fatigued. When did these symptoms start? I first noticed the headaches about two weeks ago. The dizziness comes and goes and the fatigue has been lingering for about a month now. I thought it might be stress related, but it's been getting worse. Alright, let me just look it into here. Okay, so I see. Thank you for sharing the information. Stress can certainly impact your health, but we need to rule out other possibilities too. Have

In [6]:
print("Inference Time: ", end_time)

Inference Time:  92.49082779884338


### Faster-Whisper Large-V2

#### 9319 MiB of VRAM on GeForce GTX 1080 Ti (Approx. 11 GB of VRAM)

#### https://github.com/SYSTRAN/faster-whisper

In [1]:
from faster_whisper import WhisperModel
import time
#import faster_whisper
model_size = "large-v2"

# Run on GPU with FP16
# model = WhisperModel(model_size, device="cuda", compute_type="float8")

# or run on CPU with INT8
# model = WhisperModel(model_size, device="cpu", compute_type="int8")

model = WhisperModel(model_size, device="cuda")

start_time = time.time()

segments, info = model.transcribe(file_path, beam_size=1)

print("Detected language '%s' with probability %f" % (info.language, info.language_probability))

end_time = time.time() - start_time
result=''

for segment in segments:
    # print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
    # print(segment.text)
    result += segment.text
    # print(result)
    

Estimating duration from bitrate, this may be inaccurate


Detected language 'en' with probability 0.921098


In [2]:
result

" Hello, good morning Mark. How are you feeling today? Good afternoon doctor. I was doing fine until I came back from my holiday trip. Oh really? Where did you go for the holidays? I went to Karachi to go to the beach and spend some sea time. Oh that's great. Karachi is good this time of the year. Not so hot, not so cold. Plus, who doesn't like sea and beach? Anyways, how did you end up here? Coming to the real question. I've been experiencing some troubling symptoms lately. I've had persistent headaches, occasional dizziness, and I've been feeling unusually fatigued. When did these symptoms start? I first noticed the headaches about two weeks ago. The dizziness comes and goes and the fatigue has been lingering for about a month now. I thought it might be stress related, but it's been getting worse. Alright, let me just look it into here. Okay, so I see. Thank you for sharing the information. Stress can certainly impact your health, but we need to rule out other possibilities too. Have

In [3]:
print("Inference Time: ", end_time)

Inference Time:  4.601823568344116
