Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-user #38

Closed
jeffchuber opened this issue Oct 27, 2022 · 2 comments
Closed

Multi-user #38

jeffchuber opened this issue Oct 27, 2022 · 2 comments
Assignees
Labels

Comments

@jeffchuber
Copy link
Contributor

jeffchuber commented Oct 27, 2022

One concern about using DuckDb and parquet is maintaining correctness even when potentially many requests are coming in per second to add new embeddings to the production data space.

The other concern is multiple users in the org querying or pulling data from a service at the same time.

Will this work? Will there be collisions?

@jeffchuber jeffchuber changed the title Multi-user [CHR-26] Multi-user Oct 27, 2022
@jeffchuber jeffchuber changed the title [CHR-26] Multi-user Multi-user Oct 27, 2022
@levand
Copy link
Contributor

levand commented Oct 29, 2022

Yes, there would be. Follow up question: Would there be concurrent requests to the same dataset?

Also want to dig into the notion of "correctness". Depending how you design the API, update the in-memory model & flush to disk it should be possible to avoid potential conflicts or corruption, but you'd be subject to data loss if the process crashed after receiving a message but before persisting.

I'll read up on the capabilities of DuckDB to support incremental writes, that makes a difference here too.

@jeffchuber
Copy link
Contributor Author

Agree this is under-defined. Let's discuss tomorrow! I am also not at all wed to duckdb! In some ways - it might just not be the right tool for the job? We will see

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants