-
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Whisper V3 model can support long Audio input, can you add an API to support large Audio file as a whole? #1733
Comments
good point, we need to update whisper |
but I still can't send 17MByte MP3 audio file to the API :
I'm using ggml-whisper-largev3 model with localai:v2.11.0-cublas-cuda12-ffmpeg |
curl http://172.16.1.193/v1/audio/transcriptions -H "Content-Type: multipart/form-data" -F file="@/tmp/audio.mp3" -F model="whisper-3"
{"error":{"code":413,"message":"Request Entity Too Large","type":""}}% Yeah even a direct curl on a v3 model. I'm using the all in one cuda 12 image here. |
Even under that size,
|
Hey so its currently undocumented but found it in discord
|
set environment LOCALAI_UPLOAD_LIMIT right? Can I set it to 512? |
donno but I've managed to upload at least to 100 give it a shot |
just for future reference, yes it works
|
Is your feature request related to a problem? Please describe.
Current whisper api can only take around 20MB audio file per request. Now whisper V3 can work with large audio file. Can you support it too. Eliminate the limitation on audio file size for whisper V3.
Describe the solution you'd like
Allow upload a large audio file.
call whisper with V3 model, transcribe.
Stream the text back to the user.
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: