LLAVA Configuration #737

hswlab · 2024-05-13T20:30:26Z

I have difficulties to figure out, how to correctly config the LLava example.

First I Initialized the backend with a path to libllama.dll and llava_shared.dll in
NativeLibraryConfig.Instance.WithLibrary(llamaPath, llavaPath);

Then I tried to Implement something like shown in this example. I don't understand where to find the suitable Models I need for

string multiModalProj = UserSettings.GetMMProjPath();
string modelPath = UserSettings.GetModelPath();

modelPath, I belive is the model, I can download here. But what is a clipModel, and where can I get it?

The text was updated successfully, but these errors were encountered:

SignalRT · 2024-05-13T20:42:06Z

You can see in Llama.Unitest.csproj the URLs of the models used in the example and UnitTest:

You will have both files in any vision model. Example:

hswlab · 2024-05-13T21:09:52Z

Ah, thank you. So both models can be found at Huggingface. That's something completely new to me, usually I'm using just a single model^^'

SignalRT · 2024-05-13T21:29:57Z

Yes, you should download both files from the model you choose to use. Normally you will have several quantized models and one projection model

Llava is using CLIP with an MML Projection (mmproj).

You can find the details in this paper:

AsakusaRinne · 2024-05-13T23:52:44Z

Maybe some documentations are necessary. :D

SignalRT self-assigned this May 13, 2024

hswlab closed this as completed May 14, 2024

Provide feedback