New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CARBONDATA-2800][Doc] Add useful tips about bloomfilter datamap #2581
Conversation
add useful tips about bloomfilter datamap
f8f1536
to
67b5024
Compare
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7620/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6355/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7649/ |
Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6375/ |
docs/useful-tips-on-carbondata.md
Outdated
@@ -125,6 +125,10 @@ | |||
TBLPROPERTIES ('SORT_COLUMNS'='Dime_1, HOST, MSISDN') | |||
``` | |||
|
|||
**NOTE:** | |||
+ BloomFilter can be created to enhance performance for queries with precise equal/in conditions. You can find more information about it in BloomFilter datamap document. | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add one link to bloomfilter datamap document.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK
Notice that bigger `BLOOM_SIZE` will increase the size of index file | ||
and smaller `BLOOM_FPP` will increase runtime calculation while performing query. | ||
+ '0' skipped blocklets of BloomFilter datamap in explain output indicates that | ||
BloomFilter datamap does not prune better than Main datamap. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BloomFilter datamap does not prune better than Main datamap. -- can you provide more detail about this point.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Added an example scenario
1698120
to
1b9f344
Compare
SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/6075/ |
Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7668/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6400/ |
retest sdv please |
Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/7701/ |
Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/6427/ |
SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/6094/ |
LGTM |
add useful tips about bloomfilter datamap This closes apache#2581
add useful tips about bloomfilter datamap This closes #2581
add useful tips about bloomfilter datamap
Be sure to do all of the following checklist to help us incorporate
your contribution quickly and easily:
Any interfaces changed?
NO
Any backward compatibility impacted?
NO
Document update required?
Yes, updated the document
Testing done
Please provide details on
- Whether new unit test cases have been added or why no new tests are required?
NA
- How it is tested? Please attach test report.
NA
- Is it a performance related change? Please attach the performance test report.
NA
- Any additional information to help reviewers in testing this change.
NA
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.