-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[improve](partition_topn) Add partition threshold check in hash table to control partition nums #39057
[improve](partition_topn) Add partition threshold check in hash table to control partition nums #39057
Conversation
Thank you for your contribution to Apache Doris. Since 2024-03-18, the Document has been moved to doris-website. |
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
647345b
to
2b39316
Compare
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TPC-H: Total hot run time: 40163 ms
|
TPC-H: Total hot run time: 39465 ms
|
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
9daad35
to
6a24f85
Compare
run buildall |
clang-tidy review says "All clean, LGTM! 👍" |
TPC-H: Total hot run time: 40128 ms
|
TPC-DS: Total hot run time: 189905 ms
|
ClickBench: Total hot run time: 29.96 s
|
345ebcd
to
8259137
Compare
clang-tidy review says "All clean, LGTM! 👍" |
8259137
to
ad301a8
Compare
clang-tidy review says "All clean, LGTM! 👍" |
1 similar comment
clang-tidy review says "All clean, LGTM! 👍" |
clang-tidy review says "All clean, LGTM! 👍" |
5b9de17
e1ae016
to
5b9de17
Compare
run buildall |
TeamCity be ut coverage result: |
TPC-H: Total hot run time: 41771 ms
|
TPC-DS: Total hot run time: 198489 ms
|
ClickBench: Total hot run time: 33.93 s
|
PR approved by at least one committer and no changes requested. |
PR approved by anyone and no changes requested. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
… to control partition nums (#39057) ## Proposed changes 1. Add a session variable to control partition_topn partition threshold 2. move the partition threshold check at emplace data to hash table to control partition nums, so get check every rows. this could improve some bad case about 50%+ performance improvement, and some better case before, after move the check in hash table, maybe have performance degradation almost 10%, I think this is within the acceptable result。 <!--Describe your changes.-->
… to control partition nums (apache#39057) 1. Add a session variable to control partition_topn partition threshold 2. move the partition threshold check at emplace data to hash table to control partition nums, so get check every rows. this could improve some bad case about 50%+ performance improvement, and some better case before, after move the check in hash table, maybe have performance degradation almost 10%, I think this is within the acceptable result。 <!--Describe your changes.-->
Proposed changes
this could improve some bad case about 50%+ performance improvement, and some better case before, after move the check in hash table, maybe have performance degradation almost 10%, I think this is within the acceptable result。