Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

R&D: Structured Data #51

Open
gmpetrov opened this issue May 26, 2023 · 2 comments
Open

R&D: Structured Data #51

gmpetrov opened this issue May 26, 2023 · 2 comments

Comments

@gmpetrov
Copy link
Owner

Semantic search does not perform well with structured data (e.g: CSV file)

I think it does not make sense to store this kind of data in a vector database.

Should we create an SQL table on the fly?

How to make it work seamlessly for the end user? Can it work in combination with Datastores or should we treat it as another tool?

Solution requirements

  • At runtime, the full CSV or SQL table should be loaded in memory

Ressources:

@gmpetrov gmpetrov added this to the Structured Data Capabilities milestone May 26, 2023
@pax-k
Copy link

pax-k commented Jun 11, 2023

If you want to answer questions like "how many sales did I have in May", then SqlDatabaseChain should work as you suggested https://js.langchain.com/docs/use_cases/tabular/

But if you don't care about querying your data and just want to do semantic search, eg "do you know something about X", I'm thinking that CSVLoader shouldn't be discarded. Did you have bad experience with it?

@BenoitGdb
Copy link
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants