-
Notifications
You must be signed in to change notification settings - Fork 2.4k
refactor: QdrantRM #979
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor: QdrantRM #979
Conversation
|
Hi @Anush008 |
|
Not this one. If we can have a sparse vectors providers interface in DSPy, in |
|
I was thinking in these lines. Please correct me if i am missing something. qdrant_client.search expects same parameter for sparse embedding. and for dense embedding. So if we can generalize the |
|
If it can support sparse vectors, we can have them here. |
|
Hey @arnavsinghvi11. Just bumping this PR. Please take a look when possible. |
|
LGTM. thanks @Anush008 ! |
Description
This refactor allows users to query Qdrant with any implementation for
BaseVectorizer, the default being the newFastEmbedVectorizer.The field containing the document content in the Qdrant payload can be specified. It doesn't necessarily have to be
"document"anymore.Qdrant supports multiple named vectors and this update allows specifying one for retrieval. Defaults to the first found vector.
Currently, the implementation relies on
qdrant_client'squery_batch()abstraction, which uses FastEmbed internally.Breaking?
The default values to the new params ensure backward compatibility.