Skip to content

Read tool accesses JPEG file but vision model hallucinates incorrect description (no actual image content processed) #11437

@randomvibecoder

Description

@randomvibecoder

Description

The read tool successfully locates and claims to read a JPEG image file. Instead of using the native vision model of the LLM, the read tool returns a base64 encoding of the image (not suitable for LLMs).
This causes the LLM to hallucinate image details.
OpenCode fails to recognize if a model has native vision capabilities.

Plugins

No response

OpenCode version

1.1.47

Steps to reproduce

  1. Place a JPEG image file, (e.g. imgtest.jpg in the current directory, do not name it what the image has, as the LLM would guess correctly)
  2. Ask the model to describe the image, do not give it any hints
  3. The model fails to answer correctly, when asked what it received, it states it has received a bas64 string

Screenshot and/or share link

Image

Operating System

Ubuntu 24.04.3 LTS

Terminal

OpenCode Desktop v1.1.45

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions