Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for batch patch (partial update) #2124

Open
ju-bezdek opened this issue Aug 22, 2022 · 7 comments
Open

Support for batch patch (partial update) #2124

ju-bezdek opened this issue Aug 22, 2022 · 7 comments

Comments

@ju-bezdek
Copy link

I need (quite often) to do updates on some of the columns/properties of my class...

But there is no batch update functionality... batch update one-by-one take forever...

For example update of 400k records takes about 20h... which is crazy long... (I can do the same in pandas in about a second..)

I propose 2 functionalities...

  1. Batch update (same as the batch import) ...

  2. Update by query >> for example property of thousands of records by the same value (similar to update table set col1='foo' where x=true )

The first is higher priority for me personally right now

@ju-bezdek
Copy link
Author

Probably duplicate to older issue: Batch update functionality #1980

@ju-bezdek
Copy link
Author

Eech... I've closed it accidentally... reopening

@ju-bezdek ju-bezdek reopened this Sep 28, 2022
@evenfrost
Copy link

It would be great to see this implemented as currently when using a 3rd-party embedding model (e.g., text-embedding-ada-002), you often face a rate limit, and OpenAI suggests performing batch operations in this case.

@mjsteele12
Copy link

+1 would really like this feature, am doing some object updating with LLMs and was surprised that the bulk updates are actually the bottleneck

@cckalen
Copy link

cckalen commented Nov 9, 2023

Surprised this still doesn't exist. the current absence of this feature required the integration of an alternate product on my end... Initially thought about using the containsAny option in the Weaviate where filter but I don't think it's possible?

@etiennedi
Copy link
Member

etiennedi commented Jan 9, 2024

We will lay the groundwork for an efficient batch patch in v1.24 with those features focused on high-frequency updates. Then the next logical step is to deliver Batch Patch in v1.25 v1.26.

EDIT: changed target version.

@etiennedi
Copy link
Member

Created an explicit ticket for update by query which is also mentioned in the OP, so we can track that separately from "Batch Patch": #4760

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants