Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hybrid Search #509

Closed
davidmezzetti opened this issue Jul 21, 2023 · 0 comments
Closed

Hybrid Search #509

davidmezzetti opened this issue Jul 21, 2023 · 0 comments
Assignees
Milestone

Comments

@davidmezzetti
Copy link
Member

The recent sparse vector additions/improvements to txtai (#507, #508) open up the possibility of hybrid vector search (sparse and dense). This change will repurpose the scoring index already on an Embeddings instance for sparse vector search.

The overview on hybrid search is below.

  • Modify how scoring works with an embeddings instance.
    • If terms is disabled, work as it does today. Term weighting for word vector based indexes. This use case isn't the most popular but it is useful for low resource languages.
    • If terms is enabled, use scoring as a sparse vector index.
  • Merge sparse and dense query results using one of following:
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant