llama.cpp local eval support #686

lbux · 2024-04-17T22:20:22Z

Is your feature request related to a problem? Please describe.
Ollama is a good solution for local evaluation for projects that already use it. If a project uses pure llama.cpp instead, it seems redundant to have to use both (one for generation, one for eval).

Describe the solution you'd like
llama.cpp has a web server that supports OpenAI format which should be compatible with litellm

Describe alternatives you've considered
As mentioned Ollama works, but I don't want to have to download 2 models when I can share the models if I would be able to use llama.cpp for everything.

Additional context
Add any other context or screenshots about the feature request here.

Thank you for your feature request - We love adding them

lbux added the enhancement New feature or request label Apr 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp local eval support #686

llama.cpp local eval support #686

lbux commented Apr 17, 2024

llama.cpp local eval support #686

llama.cpp local eval support #686

Comments

lbux commented Apr 17, 2024

Thank you for your feature request - We love adding them