-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Enhancement]: query optimization: reduce io when retrieve output fields #31822
Labels
Comments
sre-ci-robot
pushed a commit
that referenced
this issue
Apr 25, 2024
issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
czs007
pushed a commit
that referenced
this issue
May 10, 2024
) issue: #31822 Signed-off-by: longjiquan <jiquan.long@zilliz.com>
UnyieldingOrca
pushed a commit
to UnyieldingOrca/milvus
that referenced
this issue
May 10, 2024
…vus-io#32927) issue: milvus-io#31822 Signed-off-by: longjiquan <jiquan.long@zilliz.com>
sre-ci-robot
pushed a commit
that referenced
this issue
May 15, 2024
issue: #31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>
longjiquan
added a commit
to longjiquan/milvus
that referenced
this issue
May 23, 2024
issue: milvus-io#31822 --------- Signed-off-by: longjiquan <jiquan.long@zilliz.com>
sre-ci-robot
pushed a commit
that referenced
this issue
May 23, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Is there an existing issue for this?
What would you like to be added?
Above figure illustrates how milvus handles a query request:
We already optimize step 1 & 2 by pushing down the limit operator to the segment, which reduce the sort operation of step 2.
However, step 1 may be also very inefficient. With mmap enabled, retrieving the output fields may cost big io compared to retrieving raw data from memory. However, in the reduce phase, in fact many candidates in the segment results will be discarded, so the prefetched io was wasted. So does the proxy's reduce.
So I want to optimize the query workflow by changing the way of retrieving the required output fields:
Why is this needed?
Reduce the io when retrieving output fields and thus increase the qps.
Anything else?
No response
The text was updated successfully, but these errors were encountered: