Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Analyze uploaded PDF files through OpenAI API #1751

Open
zu0feng opened this issue Mar 9, 2025 · 2 comments
Open

Analyze uploaded PDF files through OpenAI API #1751

zu0feng opened this issue Mar 9, 2025 · 2 comments
Labels
support A request for help setting things up

Comments

@zu0feng
Copy link

zu0feng commented Mar 9, 2025

When I upload a PDF file and leverage it, I will get the base64 data. But I didn't find the code to process it in endpoints/openai, while it can handle the image base64 data. Besides, I failed to transfer it back to text. How can I analyze the file through OpenAI API?

Image

@zu0feng zu0feng added the support A request for help setting things up label Mar 9, 2025
@evalstate
Copy link
Contributor

What model are you using - I don't think the OpenAI models have native PDF Support do they? I added this for the Anthropic endpoint a couple of months ago.

@flozi00
Copy link
Contributor

flozi00 commented Mar 15, 2025

I would also like to add audio at the same run.
Since Gemini supports both and has an openai api this could make sense.
Especially when running a litellm proxy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
support A request for help setting things up
Projects
None yet
Development

No branches or pull requests

3 participants