Description
The read tool successfully locates and claims to read a JPEG image file. Instead of using the native vision model of the LLM, the read tool returns a base64 encoding of the image (not suitable for LLMs).
This causes the LLM to hallucinate image details.
OpenCode fails to recognize if a model has native vision capabilities.
Plugins
No response
OpenCode version
1.1.47
Steps to reproduce
- Place a JPEG image file, (e.g.
imgtest.jpg in the current directory, do not name it what the image has, as the LLM would guess correctly)
- Ask the model to describe the image, do not give it any hints
- The model fails to answer correctly, when asked what it received, it states it has received a bas64 string
Screenshot and/or share link
Operating System
Ubuntu 24.04.3 LTS
Terminal
OpenCode Desktop v1.1.45
Description
The
readtool successfully locates and claims to read a JPEG image file. Instead of using the native vision model of the LLM, thereadtool returns abase64encoding of the image (not suitable for LLMs).This causes the LLM to hallucinate image details.
OpenCode fails to recognize if a model has native vision capabilities.
Plugins
No response
OpenCode version
1.1.47
Steps to reproduce
imgtest.jpgin the current directory, do not name it what the image has, as the LLM would guess correctly)Screenshot and/or share link
Operating System
Ubuntu 24.04.3 LTS
Terminal
OpenCode Desktop v1.1.45