Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Add PoC for LanceDB integration #6

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

transitive-bullshit
Copy link
Collaborator

DO NOT MERGE

This PR explores adding LanceDB as an embedded dependency. The goal is to be able to embed files and specific functions in a vector db which comes in handy for enforcing all sorts of rules, the most obvious of which is in detecting near duplicates for DRY-type rules.

We don't really need a vector database for this, but 1) it would be nice for the embeddings to be cached in-between runs and 2) we'd like to support larger codebases and more experimentation without worrying about the embeddings / vector search, so I'd prefer to use a production-grade solution.

Overall, I love that lancedb can be embedded locally or self-hosted or hosted in their cloud. BUT their JS/TS SDK is a complete mess, and I'm really not sure it's worth the trouble.

I'm also considering Chroma and Qdrant, both of which are OSS and self-hostable. If we go with one of these non-embedded solutions, then I'd prefer to make it an optional dependency.

DO NOT MERGE

Copy link

vercel bot commented Apr 10, 2024

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Updated (UTC)
gptlint ✅ Ready (Inspect) Visit Preview Apr 24, 2024 4:31am

@transitive-bullshit
Copy link
Collaborator Author

transitive-bullshit commented Apr 12, 2024

qdrant supports local mode, but this may just be for the python package.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant