Adding decoding of base64 image data for gemini pro 1.5 #3711

hmcp22 · 2024-05-17T16:37:36Z

Adding decoding of base64 image data for gemini pro 1.5

Relevant issues

When passing a base64 encoded image to gemini-pro-1.5 we get the following error:

Exception has occurred: APIConnectionError
[Errno 36] File name too long: '/home/hmcp22/hugo-repos/litellm/data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEAS...
  File "/home/hmcp22/hugo-repos/litellm/litellm/main.py", line 1759, in completion
    model_response = gemini.completion(
                     ^^^^^^^^^^^^^^^^^^
  File "/home/hmcp22/hugo-repos/litellm/litellm/llms/gemini.py", line 147, in completion
    prompt = prompt_factory(
             ^^^^^^^^^^^^^^^
  File "/home/hmcp22/hugo-repos/litellm/litellm/llms/prompt_templates/factory.py", line 1505, in prompt_factory
    return _gemini_vision_convert_messages(messages=messages)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/hmcp22/hugo-repos/litellm/litellm/llms/prompt_templates/factory.py", line 1359, in _gemini_vision_convert_messages
    raise e
  File "/home/hmcp22/hugo-repos/litellm/litellm/llms/prompt_templates/factory.py", line 1354, in _gemini_vision_convert_messages
    image = Image.open(img)
            ^^^^^^^^^^^^^^^

Type

🆕 New Feature
🐛 Bug Fix

Changes

Added code to load image from base64 data in _gemini_vision_convert_message

Code to test:

Set GEMINI_API_KEY

import litellm
import base64
from dotenv import load_dotenv

load_dotenv()

def encode_image(image_path):
  with open(image_path, "rb") as image_file:
    return base64.b64encode(image_file.read()).decode('utf-8')

image_path = "landmark3.jpg"

base64_image = encode_image(image_path)

messages = [
    {
        "role": "user",
        "content": [
            {
                "type": "text",
                "text": 'Describe the image in a few sentences.'
            },
            {
                "type": "image_url",
                "image_url": { "url": f"data:image/jpeg;base64,{base64_image}"
          }
            }
        ]
    }
]
response = litellm.completion(
    model="gemini/gemini-1.5-pro-latest",
    messages=messages,
)
content = response.get('choices', [{}])[0].get('message', {}).get('content')
print(content)

Screenshot of result of running above code:

…s function

vercel · 2024-05-17T16:37:41Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	May 17, 2024 4:38pm

ishaan-jaff · 2024-06-01T17:28:43Z

hi @hmcp22 can we hop on a call sometime this week. I'd love to learn how we can improve litellm for you. What's the best email to send an invite to ?

If it's easier here's a link to my cal https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

added decoding of base64 image data in _gemini_vision_convert_message…

32e25cd

…s function

vercel bot deployed to Preview May 17, 2024 16:38 View deployment

ishaan-jaff approved these changes May 20, 2024

View reviewed changes

ishaan-jaff merged commit 622e241 into BerriAI:main May 20, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding decoding of base64 image data for gemini pro 1.5 #3711

Adding decoding of base64 image data for gemini pro 1.5 #3711

hmcp22 commented May 17, 2024

vercel bot commented May 17, 2024 •

edited

ishaan-jaff commented Jun 1, 2024

Adding decoding of base64 image data for gemini pro 1.5 #3711

Adding decoding of base64 image data for gemini pro 1.5 #3711

Conversation

hmcp22 commented May 17, 2024