[Question]: How to use img2text with Ollama?

### Self Checks

- [x] I have searched for existing issues [search for existing issues](https://github.com/infiniflow/ragflow/issues), including closed ones.
- [x] I confirm that I am using English to submit this report ([Language Policy](https://github.com/infiniflow/ragflow/issues/5910)).
- [x] Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) ([Language Policy](https://github.com/infiniflow/ragflow/issues/5910)).
- [x] Please do not modify this template :) and fill in all the required fields.

### Describe your problem

I am trying to use vision models on Ollama such as `llava` or `llama3.2-vision` but I cannot get it working. I have a knowledge base with only one PDF document that contains 3 images. The parsing will always return a single, empty chunk.

This is what I have tried to set:
- `/knowledge/dataset?id=...` > Configuration > Layout recognition & OCR > LLava
- `/knowledge/dataset?id=...` > Action (on the specific document) > Chunk method > Layout recognition & OCR > LLava
- `/user-setting/model` > System model settings > Img2text model > LLava

Am I missing something?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: How to use img2text with Ollama? #6183

Self Checks

Describe your problem

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Question]: How to use img2text with Ollama? #6183

Description

Self Checks

Describe your problem

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions