fix: add fallback vision detection for Ollama models with incomplete capabilities by roomote-v0[bot] · Pull Request #12214 · RooCodeInc/Roo-Code

roomote-v0 · 2026-04-28T12:10:47Z

Related GitHub Issue

Description

This PR attempts to address Issue #12211 where Ollama models (like gemma4 unsloth variants) report "Does not support images" even though they have vision capabilities. The root cause is that some third-party model quants strip the "vision" entry from the Ollama capabilities array.

How it works:

The parseOllamaModel function previously relied solely on capabilities.includes("vision"). This PR adds a detectVisionSupport() helper that checks three sources in order:

capabilities array (authoritative, preferred) -- unchanged behavior
details.families -- checks for known vision encoder family names (clip, siglip, mmproj, mllama)
model_info keys -- regex match for keys containing vision, clip, siglip, mmproj, or image_encoder

This makes Roo Code more resilient when the capabilities array is incomplete while still preferring the authoritative field when available.

Test Procedure

Added 8 new unit tests covering all fallback paths (families-based detection, model_info-based detection, case insensitivity, and negative cases)
All 20 tests in ollama.test.ts pass
Lint and type checks pass across the monorepo

Pre-Submission Checklist

Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
Scope: My changes are focused on the linked issue (one major feature/fix per PR).
Self-Review: I have performed a thorough self-review of my code.
Testing: New and/or updated tests have been added to cover my changes.
Documentation Impact: No documentation updates required.
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Documentation Updates

No documentation updates are required.

Additional Notes

Feedback and guidance are welcome. The set of vision family names and model_info key patterns may need to be expanded as new multimodal architectures appear in Ollama.

Interactively review PR in Roo Code Cloud

…capabilities

fix: add fallback vision detection for Ollama models with incomplete …

14c254d

…capabilities

github-project-automation Bot added this to Roo Code Roadmap Apr 28, 2026

github-project-automation Bot moved this to New in Roo Code Roadmap Apr 28, 2026

roomote-v0 Bot mentioned this pull request Apr 28, 2026

[BUG] GEMMA 4 has Vision but ROO CODE says Does not support images #12211

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add fallback vision detection for Ollama models with incomplete capabilities#12214

fix: add fallback vision detection for Ollama models with incomplete capabilities#12214
roomote-v0[bot] wants to merge 1 commit intomainfrom
fix/ollama-vision-fallback-detection

roomote-v0 Bot commented Apr 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

roomote-v0 Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Description

Test Procedure

Pre-Submission Checklist

Documentation Updates

Additional Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

roomote-v0 Bot commented Apr 28, 2026 •

edited

Loading