You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We can make use of the upstream work at ggerganov/llama.cpp#3436 to support image input to LLMs.
@AndriyMulyar What was the name of the model that you wanted to consider as an alternative to LLaVA?
Motivation
Real-time image recognition on resource-constrained hardware would be very useful in applications such as robotics. This feature would open the door to broader use cases for GPT4All than simple text completion.
Your contribution
I may submit a pull request implementing this functionality.
The text was updated successfully, but these errors were encountered:
This will require extensive changes to the GUI as well. It has been agreed that the GUI changes will come first to provide a UI for the current multimodel upstream.
Feature request
We can make use of the upstream work at ggerganov/llama.cpp#3436 to support image input to LLMs.
@AndriyMulyar What was the name of the model that you wanted to consider as an alternative to LLaVA?
Motivation
Real-time image recognition on resource-constrained hardware would be very useful in applications such as robotics. This feature would open the door to broader use cases for GPT4All than simple text completion.
Your contribution
I may submit a pull request implementing this functionality.
The text was updated successfully, but these errors were encountered: