You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When trying to use Batch Update operation in a loop to update fields in the payload using SetPayload operations, the requests(ALL) start timing out with high consistent IOWAIT of 45%, disk writes as high as 250 MB/s and disk read of 150 MB/s
The 2 main fields in the payload we are interacting with, are course_name and doc_groups. The collection has 1675570 points for a certain course_name and I am trying to update doc_groups for all of these in a loop with a max batch size of 1500 operations. After some initial updates the all queries to the database start timing out.
Some logs:
update_operations for qdrant: 4
update_operations for qdrant: 4
update_operations for qdrant: 466
Failed to fetch/update documents for page 1 and doc_group group_1, group_2 due to: timed out
update_operations for qdrant: 1499
Failed to fetch/update documents for page 1 and doc_group group_3, group_4 due to: timed out
update_operations for qdrant: 1499
Failed to fetch/update documents for page 1 and doc_group group_5 due to: timed out
update_operations for qdrant: 1
update_operations for qdrant: 10
Failed to fetch/update documents for page 1 and doc_group group_6 due to: timed out
Is there a better way to implement such an operation?
The text was updated successfully, but these errors were encountered:
I notice that you're using filters to achieve this. If possible, it'll be faster to use point IDs directly for updating payloads. If you don't know the associated point IDs, please make sure you have a payload index configured (with the correct type) on both fields you're filtering.
Are you running these batch updates concurrently? If you do, I'd recommend to limit parallelism.
You mention high disk IO. That could be if using memory mapping when it constantly needs to swap memory pages in and out. But, sharing your collection configuration would help us get some insight into that.
Current Behavior
When trying to use Batch Update operation in a loop to update fields in the payload using SetPayload operations, the requests(ALL) start timing out with high consistent IOWAIT of 45%, disk writes as high as 250 MB/s and disk read of 150 MB/s
Steps to Reproduce
Context (Environment)
The 2 main fields in the payload we are interacting with, are
course_name
anddoc_groups
. The collection has 1675570 points for a certaincourse_name
and I am trying to updatedoc_groups
for all of these in a loop with a max batch size of 1500 operations. After some initial updates the all queries to the database start timing out.Some logs:
Is there a better way to implement such an operation?
The text was updated successfully, but these errors were encountered: