Support named vectors in Qdrant #6871

kacperlukawski · 2023-06-28T13:51:35Z

Description

This PR makes it possible to use named vectors from Qdrant in Langchain. That was requested multiple times, as people want to reuse externally created collections in Langchain. It doesn't change anything for the existing applications. The changes were covered with some integration tests and included in the docs.

Example

Qdrant.from_documents(
    docs,
    embeddings,
    location=":memory:",
    collection_name="my_documents",
    vector_name="custom_vector",
)

Issue: #2594

Tagging @rlancemartin & @eyurtsev. I'd appreciate your review.

vercel · 2023-06-28T13:51:38Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)			Jun 29, 2023 10:19am

mahmoudajawad · 2023-06-28T17:24:36Z

Your approach over mine allows the following:

Add new documents to QDrant collection that uses a single vector key vector_name.
Query QDrant collection that uses any number of vector keys, using one pre-specified vector_name.

My approach in my second PR #5975 allows both scenarios but expands first to allow adding documents to collections that define multiple vector keys.

Now, my use case is already fulfilled with your work, because my use of langchain doesn't involve getting it to add new documents, but only for querying them. However, I would like you to understand the difference between both before you finalise your approach.

dev2049 · 2023-06-29T01:43:19Z

langchain/vectorstores/qdrant.py

            with_payload=True,
            with_vectors=True,
            limit=fetch_k,
        )
        embeddings = [result.vector for result in results]
        mmr_selected = maximal_marginal_relevance(
-            np.array(embedding), embeddings, k=k, lambda_mult=lambda_mult
+            np.array(query_vector), embeddings, k=k, lambda_mult=lambda_mult


we probably don't want to pass in vector_name to this, right?

@dev2049 Thanks for pointing this out! This was not covered well in the tests, so I extended them and fixed the issues. I would appreciate another look!

kacperlukawski · 2023-06-29T12:00:39Z

Your approach over mine allows the following:
* Add new documents to QDrant collection that uses a single vector key `vector_name`.

* Query QDrant collection that uses any number of vector keys, using one pre-specified `vector_name`.
My approach in my second PR #5975 allows both scenarios but expands first to allow adding documents to collections that define multiple vector keys.

Now, my use case is already fulfilled with your work, because my use of langchain doesn't involve getting it to add new documents, but only for querying them. However, I would like you to understand the difference between both before you finalise your approach.

We should not expose the vector configuration to the Langchain users. If anyone wants to use a custom configuration for their collection, it should be created directly with QdrantClient and then passed to Qdrant while being instantiated.

rlancemartin · 2023-06-29T22:14:18Z

lgtm

@rlancemartin

# Description This PR makes it possible to use named vectors from Qdrant in Langchain. That was requested multiple times, as people want to reuse externally created collections in Langchain. It doesn't change anything for the existing applications. The changes were covered with some integration tests and included in the docs. ## Example ```python Qdrant.from_documents( docs, embeddings, location=":memory:", collection_name="my_documents", vector_name="custom_vector", ) ``` ### Issue: #2594 Tagging @rlancemartin & @eyurtsev. I'd appreciate your review.

@rlancemartin

# Description This PR makes it possible to use named vectors from Qdrant in Langchain. That was requested multiple times, as people want to reuse externally created collections in Langchain. It doesn't change anything for the existing applications. The changes were covered with some integration tests and included in the docs. ## Example ```python Qdrant.from_documents( docs, embeddings, location=":memory:", collection_name="my_documents", vector_name="custom_vector", ) ``` ### Issue: langchain-ai#2594 Tagging @rlancemartin & @eyurtsev. I'd appreciate your review.

kacperlukawski added 2 commits June 28, 2023 15:24

Add support of named vectors in Qdrant

b664518

Add named vectors info to Qdrant notebook

3a60d6c

kacperlukawski added 2 commits June 28, 2023 16:05

Run linter

3c792cd

Run linter again

fe60de3

kacperlukawski mentioned this pull request Jun 28, 2023

Allow use of non-default QDrant vector key #5975

Closed

dev2049 reviewed Jun 29, 2023

View reviewed changes

Fix max_marginal_relevance_search with named vectors

128a33e

kacperlukawski requested a review from dev2049 June 29, 2023 10:10

rlancemartin self-assigned this Jun 29, 2023

rlancemartin approved these changes Jun 29, 2023

View reviewed changes

rlancemartin merged commit 140ba68 into langchain-ai:master Jun 29, 2023
14 checks passed

mahmoudajawad mentioned this pull request Sep 1, 2023

[Q] How to re-use QDrant collection data that are created separatly with non-default vector name? #2594

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support named vectors in Qdrant #6871

Support named vectors in Qdrant #6871

kacperlukawski commented Jun 28, 2023

vercel bot commented Jun 28, 2023 •

edited

mahmoudajawad commented Jun 28, 2023

dev2049 Jun 29, 2023

kacperlukawski Jun 29, 2023 •

edited

kacperlukawski commented Jun 29, 2023

rlancemartin commented Jun 29, 2023

Support named vectors in Qdrant #6871

Support named vectors in Qdrant #6871

Conversation

kacperlukawski commented Jun 28, 2023

Description

Example

Issue: #2594

vercel bot commented Jun 28, 2023 • edited

mahmoudajawad commented Jun 28, 2023

dev2049 Jun 29, 2023

Choose a reason for hiding this comment

kacperlukawski Jun 29, 2023 • edited

Choose a reason for hiding this comment

kacperlukawski commented Jun 29, 2023

rlancemartin commented Jun 29, 2023

vercel bot commented Jun 28, 2023 •

edited

kacperlukawski Jun 29, 2023 •

edited