feature: embedding support #70

mudler · 2023-04-23T19:49:21Z

Add support to embeddings to the API and the llama backend: https://github.com/ggerganov/llama.cpp/blob/e4422e299c10c7e84c8e987770ef40d31905a76b/llama.cpp#L2160

go-llama.cpp
go-gpt4all-j.cpp
go-gpt2.cpp

limcheekin · 2023-05-03T03:53:12Z

Just curious to find out what is the use/purpose of embeddings above.

For the following use case of Retrieval Augmented Data QA:
https://blog.langchain.dev/tutorial-chatgpt-over-your-data/

Can't we use the following embedding models? I plan to use gpt4all-j with one of the following embeddings model.

Please advise. Thank you.

mudler · 2023-05-05T09:30:05Z

embeddings support has been merged to master. It is experimental and currently it's available only on llama.cpp based models, so any feedback is more than welcome!

To enable it you can set embeddings: true in the model's YAML config file

mudler · 2023-05-06T06:58:03Z

I've published a sample using embeddings over here: https://github.com/go-skynet/LocalAI/tree/master/examples/query_data

mudler · 2023-05-10T12:10:31Z

further optimizations in #222 - now embeddings can be used with bert on any model - and there is also a huge performance impact!

v4rm3t · 2023-05-24T22:09:43Z

Hello! I am trying to run a gpt4all-j model for building a local chatbot. How can I use an embedding using BERT and implement it for chat completions endpoint?

Currently, I am running it on Mac Mini i7, 32gb RAM. I am planning to upgrade it to a higher resource(vRAM) cloud server in future. Is it possible to make a fast chatbot API using own document embeddings?

michelec1000 · 2023-07-10T20:21:14Z

https://github.com/go-skynet/LocalAI/tree/master/examples/query_data

Thank you for the example! But it can't be included in the API? Currently I think you run those commands inside the container, right? Is there already the scenario that calling a certain path executes the query on the documents?

mudler mentioned this issue May 4, 2023

feat: add embeddings for go-llama.cpp backend #190

Merged

blu3knight mentioned this issue May 10, 2023

Local LLM models rizerphe/obsidian-companion#11

Open

mudler linked a pull request May 10, 2023 that will close this issue

feat: add bert.cpp embeddings #222

Merged

mudler closed this as completed in #222 May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: embedding support #70

feature: embedding support #70

mudler commented Apr 23, 2023 •

edited

Loading

limcheekin commented May 3, 2023 •

edited

Loading

mudler commented May 5, 2023

mudler commented May 6, 2023

mudler commented May 10, 2023 •

edited

Loading

v4rm3t commented May 24, 2023

michelec1000 commented Jul 10, 2023

feature: embedding support #70

feature: embedding support #70

Comments

mudler commented Apr 23, 2023 • edited Loading

limcheekin commented May 3, 2023 • edited Loading

mudler commented May 5, 2023

mudler commented May 6, 2023

mudler commented May 10, 2023 • edited Loading

v4rm3t commented May 24, 2023

michelec1000 commented Jul 10, 2023

mudler commented Apr 23, 2023 •

edited

Loading

limcheekin commented May 3, 2023 •

edited

Loading

mudler commented May 10, 2023 •

edited

Loading