Fix error querying PineconeVectorStore using sparse query mode #12967

Javtor · 2024-04-19T23:01:01Z

Description

Pinecone always expects a vector (or technically also a record id) when querying. When using sparse query mode, we were setting the vector to None and making the request with that, resulting in an error. Changed to make the request with a vector filled with zeroes instead.

Version Bump?

Yes

Type of Change

Bug fix (non-breaking change which fixes an issue)

logan-markewich · 2024-04-19T23:08:07Z

.../vector_stores/llama-index-vector-stores-pinecone/llama_index/vector_stores/pinecone/base.py

@@ -433,7 +433,7 @@ def query(self, query: VectorStoreQuery, **kwargs: Any) -> VectorStoreQueryResul
                    "values": [v * (1 - query.alpha) for v in sparse_vector["values"]],
                }

-        query_embedding = None
+        query_embedding = [0.0] * len(query.query_embedding)


This is specifically to fix the case where query_mode is sparse right?

Won't providing a vector of zeros still kind of influence the search result? If we are in sparse mode, should we just query without the vector kwarg? Or should we set the alpha to completely ignore the Dense zero vector?

yup

The issue is that if we query without the vector kwarg, we get an error when making the request if we don't pass a record id instead. In their hybrid search tutorial, pinecone says we should scale the dense vector by alpha (in this case zero) before querying, so using a vector of zeros in this case should be fine (it's overwritten later for other query modes anyways) https://www.pinecone.io/learn/hybrid-search-intro/

Ah great, that works for me!

Ok wait, one more worry haha -- we should ensure that the query embedding is not none before doing this

true, added a check

…e-search

…lama#12967) * fix sparse query pinecone * none checking

fix sparse query pinecone

bbed047

Javtor requested a review from logan-markewich April 19, 2024 23:01

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Apr 19, 2024

logan-markewich reviewed Apr 19, 2024

View reviewed changes

logan-markewich approved these changes Apr 20, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Apr 20, 2024

none checking

eae1eb3

dosubot bot added size:S This PR changes 10-29 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Apr 22, 2024

Merge remote-tracking branch 'origin/main' into javier/pinecone-spars…

a34a4e0

…e-search

Javtor merged commit 521178f into main Apr 22, 2024
8 checks passed

Javtor deleted the javier/pinecone-sparse-search branch April 22, 2024 16:36

chrisalexiuk-nvidia pushed a commit to chrisalexiuk-nvidia/llama_index that referenced this pull request Apr 25, 2024

Fix error querying PineconeVectorStore using sparse query mode (run-l…

5b1b219

…lama#12967) * fix sparse query pinecone * none checking

mattf pushed a commit to mattf/llama_index that referenced this pull request Apr 25, 2024

Fix error querying PineconeVectorStore using sparse query mode (run-l…

3028f7d

…lama#12967) * fix sparse query pinecone * none checking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix error querying PineconeVectorStore using sparse query mode #12967

Fix error querying PineconeVectorStore using sparse query mode #12967

Javtor commented Apr 19, 2024

logan-markewich Apr 19, 2024

Javtor Apr 19, 2024 •

edited

logan-markewich Apr 20, 2024

logan-markewich Apr 20, 2024

Javtor Apr 22, 2024

Fix error querying PineconeVectorStore using sparse query mode #12967

Fix error querying PineconeVectorStore using sparse query mode #12967

Conversation

Javtor commented Apr 19, 2024

Description

Version Bump?

Type of Change

logan-markewich Apr 19, 2024

Choose a reason for hiding this comment

Javtor Apr 19, 2024 • edited

Choose a reason for hiding this comment

logan-markewich Apr 20, 2024

Choose a reason for hiding this comment

logan-markewich Apr 20, 2024

Choose a reason for hiding this comment

Javtor Apr 22, 2024

Choose a reason for hiding this comment

Javtor Apr 19, 2024 •

edited