ffmpeg error #114
Replies: 6 comments 8 replies
-
worked fine using deepgram api end point: used this request:
got this response:
after converting the text from unicode this is the result:
|
Beta Was this translation helpful? Give feedback.
-
I was getting the same error with colab, but I needed to specify exactly where the audio file is. I mounted my drive so it was found here: Successful output of my program on colab after changing the path:
|
Beta Was this translation helpful? Give feedback.
-
Sorry, I'm not versed enough to follow your script. I tried the following, after uploading the audio file to my area in Google Drive and mounting it. Processing got partway through then failed with the same error message. It seems like that should work, though. Do you have another suggestion? !whisper "drive/MyDrive/Alfonso1_1h10m.mp3" – model large --language es Screenshot (I'll sleep on this one and tackle it again tomorrow). Thanks! |
Beta Was this translation helpful? Give feedback.
-
@jremington I have published a whisper package with pure Python dependencies. It does not require ffmpeg cli setup and gives exact same results with the original implementation: https://github.com/fcakyon/pywhisper Feel free to try it. |
Beta Was this translation helpful? Give feedback.
-
(nice skull, by the way!) The new script has the same problem. It starts up transcribing, then fails slightly after 9 minutes. The error message is not shown in the script output, though. I'm pretty sure now that the problem is with the colab file system. Edit: I just noticed that the file has been truncated down to 8 MB, (from 55.4 MB), which accounts for the early stop. I'll upload the full file and try again. That time the script worked to completion! However, I really am confused about how data seem to vanish. |
Beta Was this translation helpful? Give feedback.
-
Yes, that works fine and I appreciate the elegant script you provided. It solved the problem. However, I wanted to understand the problem and after a bit more effort, it became clear that upload to the local colab directory often fails, leaving behind a truncated file. The only warning issued is an open red circle next to the name of the file colab is attempting to upload. See screenshot below, lower right corner. If you hover the mouse over the file name in the local directory, the file size is reported, and it is too small. |
Beta Was this translation helpful? Give feedback.
-
Hi, Folks:
I'm using Colaboratory with a GPU and large model to transcribe and translate an interview I recorded in Spanish. It works well for part of the file, but the program fails with an error message that I can't interpret, at various times in the input file (tried both mp3 and flac formats with the similar results but failures at different times). I've listened to the audio and can't hear anything unusual at the times where the error is raised (2 minutes (flac), 10 minutes (mp3) and 21 minutes (mp3) into the file).
I uploaded a zipped edition of the .mp3 version of the audio input file if someone would be willing to give it a spin. It fails with the same message as below, 9'25" into the audio. At bottom of page.
Suggestions for how to determine what this message means, and how to get around it would be appreciated. Where can I find "stderr output"?
SNIP of the runtime output from the FLAC audio format version:
[02:01.000 --> 02:20.000] Sé que hay taro baiji, hay taro hiba, taro neti, taro harajara hiba, ketuana mea y hay un taro que en este momento me falta los nombres, no me acuerdo muy bien.
[02:20.000 --> 02:25.000] Pero podemos completarlos con los datos.
Traceback (most recent call last):
File "/usr/local/lib/python3.7/dist-packages/whisper/audio.py", line 44, in load_audio
.run(cmd="ffmpeg", capture_stdout=True, capture_stderr=True)
File "/usr/local/lib/python3.7/dist-packages/ffmpeg/_run.py", line 325, in run
raise Error('ffmpeg', out, err)
ffmpeg._run.Error: ffmpeg error (see stderr output for detail)
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/usr/local/bin/whisper", line 8, in
sys.exit(cli())
File "/usr/local/lib/python3.7/dist-packages/whisper/transcribe.py", line 275, in cli
result = transcribe(model, audio_path, temperature=temperature, **args)
File "/usr/local/lib/python3.7/dist-packages/whisper/transcribe.py", line 76, in transcribe
mel = log_mel_spectrogram(audio)
File "/usr/local/lib/python3.7/dist-packages/whisper/audio.py", line 111, in log_mel_spectrogram
audio = load_audio(audio)
File "/usr/local/lib/python3.7/dist-packages/whisper/audio.py", line 47, in load_audio
raise RuntimeError(f"Failed to load audio: {e.stderr.decode()}") from e
RuntimeError: Failed to load audio: ffmpeg version 3.4.11-0ubuntu0.1 Copyright (c) 2000-2022 the FFmpeg developers
built with gcc 7 (Ubuntu 7.5.0-3ubuntu1~18.04)
configuration: --prefix=/usr --extra-version=0ubuntu0.1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --enable-gpl --disable-stripping --enable-avresample --enable-avisynth --enable-gnutls --enable-ladspa --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librubberband --enable-librsvg --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-omx --enable-openal --enable-opengl --enable-sdl2 --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libopencv --enable-libx264 --enable-shared
libavutil 55. 78.100 / 55. 78.100
libavcodec 57.107.100 / 57.107.100
libavformat 57. 83.100 / 57. 83.100
libavdevice 57. 10.100 / 57. 10.100
libavfilter 6.107.100 / 6.107.100
libavresample 3. 7. 0 / 3. 7. 0
libswscale 4. 8.100 / 4. 8.100
libswresample 2. 9.100 / 2. 9.100
libpostproc 54. 7.100 / 54. 7.100
–: No such file or directory
alfonso1_0_21.zip
Beta Was this translation helpful? Give feedback.
All reactions