New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: support specify bloom index columns #12048
Conversation
The latest updates on your projects. Learn more about Vercel for Git ↗︎ 1 Ignored Deployment
|
What's the real usage of this pr? I did not get it. |
This comment was marked as outdated.
This comment was marked as outdated.
bloom filter creation takes a significant amount of CPU resources. for some "wide" tables with lots of columns
Creating bloom indexes for all the columns is wasteful. |
I agree with that, so let's disable this creation by default? |
I am not sure about this. maybe for those tables that are not heavily appended(and the speed of ingestion is not that critical), and users are not sure about which columns might be involved in the point query, enabling bloom filters by default seems to be not a bad idea. I do not if "those tables" are the common cases though. |
I think this PR only for some special case. |
Fix @BohuTANG , drop column will remove the droped column from the bloom index columns. Rename column will replace the bloom index columns with new column name.
|
Docker Image for PR
|
Conflicting files: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
Added some comments to the test scripts, hope it helps
…r.test Co-authored-by: dantengsky <dantengsky@gmail.com>
…r.test Co-authored-by: dantengsky <dantengsky@gmail.com>
* feat: support specify bloom index columns * Update tests/sqllogictests/suites/mode/standalone/explain/bloom_filter.test Co-authored-by: dantengsky <dantengsky@gmail.com> * Update tests/sqllogictests/suites/mode/standalone/explain/bloom_filter.test Co-authored-by: dantengsky <dantengsky@gmail.com> --------- Co-authored-by: dantengsky <dantengsky@gmail.com>
I hereby agree to the terms of the CLA available at: https://databend.rs/dev/policies/cla/
Summary
Add
bloom_index_columns
option for fuse engine. Support for specifying bloom index columns.To create table with bloom index:
To create or modify bloom index for a existing table:
The existing Bloom index options will be replaced by the new options, it does not create Bloom filters for existing data.
To disable the bloom index: