-
|
Hi, I have tested whisper on the attached audio with the python script with lower-level access to the model (below) but the model doesn't transcribe a big chunk of the audio (almost half). It happened with few other audios. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
it's not transcribing most of it because you're trimming a 50 seconds down to 30 seconds, so 20 seconds of the audio got thrown out in this line: audio = whisper.pad_or_trim(audio)
|
Beta Was this translation helpful? Give feedback.
it's not transcribing most of it because you're trimming a 50 seconds down to 30 seconds, so 20 seconds of the audio got thrown out in this line:
pad_or_trim()by default trims the audio down to 480000 samples or 30 seconds, which what the model expects for input