-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
enhance: Avoid merging insert data when buffering insert msgs #33562
enhance: Avoid merging insert data when buffering insert msgs #33562
Conversation
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: congqixia The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@congqixia E2e jenkins job failed, comment |
1 similar comment
@congqixia E2e jenkins job failed, comment |
/run-cpu-e2e |
@congqixia E2e jenkins job failed, comment |
return ib.currentBuffer().buffer | ||
} | ||
// no error assumed, buffer created before | ||
result, _ := storage.NewInsertDataWithCap(ib.collSchema, int(ib.rows)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe we don't really need to return a real *storage.InsertData ?
can we instead return a insertDataInterator? so we can avoid one extra copy?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. Working on it
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this shall be a major refactory in storage. we shall implement iterator in next PR
See also milvus-io#33561 This PR: - Adds a new param item for insert buffer chunk size - Pre-allocate for each insert buffer preventing frequent `growslice` Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
b48248e
to
ca7bae6
Compare
@congqixia E2e jenkins job failed, comment |
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #33562 +/- ##
==========================================
+ Coverage 80.88% 80.90% +0.01%
==========================================
Files 1051 1058 +7
Lines 134863 135201 +338
==========================================
+ Hits 109088 109387 +299
- Misses 21590 21621 +31
- Partials 4185 4193 +8
|
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
@congqixia ut workflow job failed, comment |
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
/lgtm |
See also milvus-io#33561 milvus-io#33562 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
See also milvus-io#33561 milvus-io#33562 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
…-io#33562) See also milvus-io#33561 This PR: - Use zero copy when buffering insert messages - Make `storage.InsertCodec` support serialize multiple insert data chunk into same batch binlog files Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
…io#33817) See also milvus-io#33561 milvus-io#33562 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
…-io#33562) See also milvus-io#33561 This PR: - Use zero copy when buffering insert messages - Make `storage.InsertCodec` support serialize multiple insert data chunk into same batch binlog files Signed-off-by: Congqi Xia <congqi.xia@zilliz.com> --------- Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
…io#33817) See also milvus-io#33561 milvus-io#33562 Signed-off-by: Congqi Xia <congqi.xia@zilliz.com>
See also #33561
This PR:
storage.InsertCodec
support serialize multiple insert data chunk into same batch binlog filesSigned-off-by: Congqi Xia congqi.xia@zilliz.com