Support for LLama cpp

### 🚀 The feature

[llama cpp](https://github.com/ggml-org/llama.cpp) is one of the widely used LLM inference tool that has native support for more than 95% GGUF models. 

Although [llama server](https://github.com/ggml-org/llama.cpp/blob/master/examples/server) supports open ai style API, its hard to run Main LLM and embedding model.

Python Library [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) allows to run both models simultaneously. It will be really useful if there is native support for llama cpp using the llama-cpp-python library.



### Motivation, pitch

[llama cpp](https://github.com/ggml-org/llama.cpp) also supports Embedding models which makes it suitable for local and on device (EDGE) AI. Extending support to llama cpp will encourage local and edge ai use case to adapt mem0 capabilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support for LLama cpp #2379

🚀 The feature

Motivation, pitch

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Support for LLama cpp #2379

Description

🚀 The feature

Motivation, pitch

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions