Embedding results have changed in v0.1.32 #3777

SunmeetOberoi · 2024-04-20T12:14:23Z

What is the issue?

The values for embedding have changed in 0.1.32 release.

I used an older version of ollama to complete a POC for categorization of some data it all went fine. Now when I was trying to implement the solution I could see the search result were way off. Almost all categories had a 50-60% similarity with every input value. After trying to fix my script for hours I thought to downgrade ollama and that worked.

This is only happening in 0.1.32, I tested the same code on 0.1.29, 0.1.30 and 0.1.31 it is working consistently and accurately.

Attaching a sample python script to test out the observation:

import subprocess

import numpy as np
import ollama
import pandas as pd

models = [
    'nomic-embed-text',
    'mxbai-embed-large',
    'snowflake-arctic-embed'
]


def calculate_similarity(product_embedding):
    df['similarity'] = df['embeddings'].apply(
        lambda x: np.dot(x, product_embedding) / (np.linalg.norm(x) * np.linalg.norm(product_embedding)))
    df_sorted = df.sort_values(by='similarity', ascending=False)
    return df_sorted[['Name', 'similarity']].head()


print(subprocess.check_output(["ollama", "--version"]).decode())

for model in models:
    df = pd.DataFrame({
        'Name': ['Fruits', 'Vegetables', 'Yellow', 'Brown']
    })
    df["embeddings"] = df.apply(
        lambda row: np.array(ollama.embeddings(model=model, prompt=row['Name']).get('embedding')), axis=1)

    arg = "Veggies"
    embedding = np.array(ollama.embeddings(model=model, prompt=arg).get('embedding'))
    result = calculate_similarity(embedding)
    print(f"========={model}==============")
    print(result)

Results:

Output for v0.1.29

ollama version is 0.1.29

=========nomic-embed-text==============
         Name  similarity
1  Vegetables    0.663238
0      Fruits    0.449560
2      Yellow    0.307696
3       Brown    0.281496
=========mxbai-embed-large==============
         Name  similarity
1  Vegetables    0.558774
2      Yellow    0.551288
3       Brown    0.549997
0      Fruits    0.548751
=========snowflake-arctic-embed==============
         Name  similarity
0      Fruits    0.825177
2      Yellow    0.825156
3       Brown    0.824968
1  Vegetables    0.824860

Output for v0.1.30

ollama version is 0.1.30

=========nomic-embed-text==============
         Name  similarity
1  Vegetables    0.663238
0      Fruits    0.449560
2      Yellow    0.307696
3       Brown    0.281496
=========mxbai-embed-large==============
         Name  similarity
1  Vegetables    0.558774
2      Yellow    0.551288
3       Brown    0.549997
0      Fruits    0.548751
=========snowflake-arctic-embed==============
         Name  similarity
0      Fruits    0.825177
2      Yellow    0.825156
3       Brown    0.824968
1  Vegetables    0.824860

Output for v0.1.31

ollama version is 0.1.31

=========nomic-embed-text==============
         Name  similarity
1  Vegetables    0.663238
0      Fruits    0.449560
2      Yellow    0.307696
3       Brown    0.281496
=========mxbai-embed-large==============
         Name  similarity
1  Vegetables    0.558774
2      Yellow    0.551288
3       Brown    0.549997
0      Fruits    0.548751
=========snowflake-arctic-embed==============
         Name  similarity
0      Fruits    0.825177
2      Yellow    0.825156
3       Brown    0.824968
1  Vegetables    0.824860

Output for v0.1.32

ollama version is 0.1.32

=========nomic-embed-text==============
         Name  similarity
2      Yellow    0.670598
3       Brown    0.636525
1  Vegetables    0.629145
0      Fruits    0.607114
=========mxbai-embed-large==============
         Name  similarity
1  Vegetables    0.619664
3       Brown    0.573989
2      Yellow    0.549525
0      Fruits    0.480530
=========snowflake-arctic-embed==============
         Name  similarity
0      Fruits    0.787907
3       Brown    0.787868
2      Yellow    0.787841
1  Vegetables    0.787761

The difference is clearly visible in the output. While the similarity is not changing by much for the correct answer in this sample, but for my dataset this has a far bigger impact.
Although I was using nomic and mxbai only but I kept snowflake in the sample as well because for my full dataset I could see snowflake was performing a little better than the other two in 0.1.32.
I understand I should use a vector database for such a problem but this is just a lightweight sample.
I conducted this test on WSL2 but its the same in Windows as well.
While gathering this data one thing I noted was 0.1.32 takes significantly longer to run the script as compared to older version.

Python libraries version

Ollama Version: 0.1.8
Pandas Version: 2.2.2
Numpy Version: 1.26.4

OS

Windows, WSL2

GPU

Nvidia

CPU

Intel

Ollama version

0.1.32

The text was updated successfully, but these errors were encountered:

Kanishk-Kumar · 2024-04-22T06:24:56Z

I also have this issue. Also (even in 0.1.30), it never loads the full n_ctx = 8192 and shows this error:

Apr 18 11:21:22 xyz-MS-7D91 ollama[38865]: time=2024-04-18T11:21:22.071+05:30 level=WARN source=server.go:51 msg="requested context length is greater than model max context length" requested=8192 model=2048

@SunmeetOberoi can you please also cross check Ollama logs using sudo journalctl -xeu ollama.service -f and check if n_ctx is accurate?

#3727 (comment)

jimscard · 2024-04-22T16:01:39Z

@jmorganca Looks like 0.1.32 is using the wrong model config parameter to determine max context length. Details [here] (#3727 (comment))

Also see Readme.md in the nomic GGUF file repository for this model.

SunmeetOberoi · 2024-04-22T18:52:00Z

Hi @Kanishk-Kumar, I tried it out and yes I am also seeing that log message and the n_ctx as 2048

v0.1.31 - WSL2
time=2024-04-22T23:43:22.474+05:30 level=WARN source=llm.go:44 msg="requested context length is greater than model's max context length (8192 > 2048), using 2048 instead"
v0.1.32 - Windows
time=2024-04-22T23:51:48.208+05:30 level=WARN source=server.go:51 msg="requested context length is greater than model max context length" requested=8192 model=2048

Also, the embeddings are not the same amongst these versions.

As @jimscard correctly pointed out the nomic GGUF readme file does mentions something related to this as well which might help.

llama.cpp will default to 2048 tokens of context with these files. To use the full 8192 tokens that Nomic Embed is benchmarked on, you will have to choose a context extension method. The original model uses Dynamic NTK-Aware RoPE scaling, but that is not currently available in llama.cpp. A combination of YaRN and linear scaling is an acceptable substitute.

Since this context length issue is existing in v0.1.31 as well and is a model specific issue. I think its not related to the different embedding values problem mentioned here and can be tracked separately in #3727

youkefan18 · 2024-05-06T08:23:43Z

Guys, any luck here? I just bumped to v0.1.33 and this issue still exists.
Vectors embedded with 'mxbai-embed-large' in v0.1.26 are so different from v0.1.33.

deadbeef84 · 2024-05-13T10:26:39Z

I've pinpointed the issue to this commit 5ec12ce

SunmeetOberoi added the bug Something isn't working label Apr 20, 2024

jmorganca self-assigned this Apr 20, 2024

Kanishk-Kumar mentioned this issue May 8, 2024

How do I set default Advanced Parameters for new users? open-webui/open-webui#2072

Closed

4 tasks

deadbeef84 mentioned this issue May 10, 2024

mxbai-embed-large embedding not consistent with original paper #4207

Closed

deadbeef84 mentioned this issue May 13, 2024

fix embedding by adding fixes from llama.cpp upstream #4399

Closed

jmorganca mentioned this issue Jun 9, 2024

llm: always add bos token to prompt #4941

Merged

jmorganca closed this as completed in #4941 Jun 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embedding results have changed in v0.1.32 #3777

Embedding results have changed in v0.1.32 #3777

SunmeetOberoi commented Apr 20, 2024

Kanishk-Kumar commented Apr 22, 2024 •

edited

jimscard commented Apr 22, 2024 •

edited

SunmeetOberoi commented Apr 22, 2024

youkefan18 commented May 6, 2024

deadbeef84 commented May 13, 2024

Embedding results have changed in v0.1.32 #3777

Embedding results have changed in v0.1.32 #3777

Comments

SunmeetOberoi commented Apr 20, 2024

What is the issue?

Results:

Python libraries version

OS

GPU

CPU

Ollama version

Kanishk-Kumar commented Apr 22, 2024 • edited

jimscard commented Apr 22, 2024 • edited

SunmeetOberoi commented Apr 22, 2024

youkefan18 commented May 6, 2024

deadbeef84 commented May 13, 2024

Kanishk-Kumar commented Apr 22, 2024 •

edited

jimscard commented Apr 22, 2024 •

edited