Support multimodal models such as LLaVA for image input #1568

cebtenzzre · 2023-10-24T18:41:22Z

Feature request

We can make use of the upstream work at ggerganov/llama.cpp#3436 to support image input to LLMs.

@AndriyMulyar What was the name of the model that you wanted to consider as an alternative to LLaVA?

Motivation

Real-time image recognition on resource-constrained hardware would be very useful in applications such as robotics. This feature would open the door to broader use cases for GPT4All than simple text completion.

Your contribution

I may submit a pull request implementing this functionality.

AndriyMulyar · 2023-10-24T19:22:04Z

Fuyu 8b is interesting because its decoder only.

I think LLaVA style is a fine choice though for an initial multimodal implementation

manyoso · 2023-10-24T22:50:55Z

This will require extensive changes to the GUI as well. It has been agreed that the GUI changes will come first to provide a UI for the current multimodel upstream.

PedzacyKapec · 2023-12-01T18:07:54Z

+1

cebtenzzre added the enhancement New feature or request label Oct 24, 2023

This comment was marked as spam.

Sign in to view

cebtenzzre mentioned this issue Feb 23, 2024

Visin + LLM fused models in GPT4All? #2013

Closed

cosmic-snow mentioned this issue Apr 29, 2024

[Feature] Feature request title... #2279

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multimodal models such as LLaVA for image input #1568

Support multimodal models such as LLaVA for image input #1568

cebtenzzre commented Oct 24, 2023

AndriyMulyar commented Oct 24, 2023

manyoso commented Oct 24, 2023

This comment was marked as spam.

PedzacyKapec commented Dec 1, 2023

Support multimodal models such as LLaVA for image input #1568

Support multimodal models such as LLaVA for image input #1568

Comments

cebtenzzre commented Oct 24, 2023

Feature request

Motivation

Your contribution

AndriyMulyar commented Oct 24, 2023

manyoso commented Oct 24, 2023

This comment was marked as spam.

PedzacyKapec commented Dec 1, 2023