fix embedding by adding fixes from llama.cpp upstream #4399

deadbeef84 · 2024-05-13T11:13:07Z

Embedding appears broken since v0.1.32

See #3777 #4207 for details.

This PR applies fixes based on ggerganov/llama.cpp@1b67731#diff-87355a1a297a9f0fdc86af5e2a59cae153290f58d68822cd10c30fee4f7f7076.

I've tested it and embedding vectors looks correct after applying this patch.

fredrik-smedberg · 2024-05-14T16:27:56Z

I can confirm this PR indeed fixes very obvious issues I had when doing embedding and queries with mxbai-embed-large and nomic-embed-text models.

multiduplikator · 2024-05-16T14:55:30Z

Desperately waiting for this fix to be integrated into next version. Having to stay on 0.1.31 is kind of a pain ...

fredrik-smedberg · 2024-05-16T15:08:21Z

@multiduplikator are you on a Mac? If so, it's very easy to download, compile and run this branch on your computer.

Install Brew if you haven't already (https://brew.sh)
Check out this branch,

git clone https://github.com/ollama/ollama.git
cd ollama
git fetch origin pull/4399/head:pr-4399
git checkout pr-4399

Follow the instructions here https://github.com/ollama/ollama?tab=readme-ov-file#building
Then just run it, like in the examples here, https://github.com/ollama/ollama?tab=readme-ov-file#running-local-builds

multiduplikator · 2024-05-21T11:48:28Z

@fredrik-smedberg Not on a Mac. I still have time to wait ... would like to avoid getting into custom building. But thanks for the input. Will come in handy when the time comes :)

0ssamaak0 · 2024-05-28T04:44:29Z

GJ @deadbeef84 🚀🚀
I just found out this is not working as expected after creating a vector db for 50k examples 😅😅 gonna use HF untill this get merged

0ssamaak0 · 2024-05-28T04:46:14Z

@jmorganca please merge this! Thanks 😁😁

youkefan18 · 2024-05-31T03:28:34Z

@jmorganca Hi, I can confirm this is important fix for any application running in prod.
The change in embedding results after upgrading ollama ruins all RAG related feature.

fix embedding by adding fixes from llama.cpp upstream

8aea5f1

deadbeef84 mentioned this pull request May 13, 2024

mxbai-embed-large embedding not consistent with original paper #4207

Open

monotykamary added a commit to monotykamary/ollama that referenced this pull request May 18, 2024

merge: fix embedding by adding fixes from llama.cpp upstream ollama#4399

bb7c592

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix embedding by adding fixes from llama.cpp upstream #4399

fix embedding by adding fixes from llama.cpp upstream #4399

deadbeef84 commented May 13, 2024

fredrik-smedberg commented May 14, 2024

multiduplikator commented May 16, 2024

fredrik-smedberg commented May 16, 2024

multiduplikator commented May 21, 2024

0ssamaak0 commented May 28, 2024

0ssamaak0 commented May 28, 2024

youkefan18 commented May 31, 2024

fix embedding by adding fixes from llama.cpp upstream #4399

Are you sure you want to change the base?

fix embedding by adding fixes from llama.cpp upstream #4399

Conversation

deadbeef84 commented May 13, 2024

fredrik-smedberg commented May 14, 2024

multiduplikator commented May 16, 2024

fredrik-smedberg commented May 16, 2024

multiduplikator commented May 21, 2024

0ssamaak0 commented May 28, 2024

0ssamaak0 commented May 28, 2024

youkefan18 commented May 31, 2024