How to detect video language without conversion? #733

schotek · 2022-12-21T11:00:16Z

schotek
Dec 21, 2022

Hey. Hey,

Whisper's good at detecting language if I don't give it one. But is it possible to only run this detection without it then starting to convert the video to that language?

Answered by glangford

Dec 21, 2022

The README shows how to run detection only. Scroll down to "Python usage" near the bottom, starting with

Below is an example usage of whisper.detect_language() and whisper.decode() which provide lower-level access to the model.

# load audio and pad/trim it to fit 30 seconds
audio = whisper.load_audio("audio.mp3")
audio = whisper.pad_or_trim(audio)

# make log-Mel spectrogram and move to the same device as the model
mel = whisper.log_mel_spectrogram(audio).to(model.device)

# detect the spoken language
_, probs = model.detect_language(mel)
print(f"Detected language: {max(probs, key=probs.get)}")

View full answer

glangford · 2022-12-21T16:57:53Z

glangford
Dec 21, 2022

The README shows how to run detection only. Scroll down to "Python usage" near the bottom, starting with

Below is an example usage of whisper.detect_language() and whisper.decode() which provide lower-level access to the model.

# load audio and pad/trim it to fit 30 seconds
audio = whisper.load_audio("audio.mp3")
audio = whisper.pad_or_trim(audio)

# make log-Mel spectrogram and move to the same device as the model
mel = whisper.log_mel_spectrogram(audio).to(model.device)

# detect the spoken language
_, probs = model.detect_language(mel)
print(f"Detected language: {max(probs, key=probs.get)}")

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to detect video language without conversion? #733

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

How to detect video language without conversion? #733

Uh oh!

schotek Dec 21, 2022

Replies: 1 comment

Uh oh!

Uh oh!

glangford Dec 21, 2022

schotek
Dec 21, 2022

glangford
Dec 21, 2022