Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

zone map question #16

Closed
sam1988 opened this issue Apr 18, 2013 · 1 comment
Closed

zone map question #16

sam1988 opened this issue Apr 18, 2013 · 1 comment

Comments

@sam1988
Copy link

sam1988 commented Apr 18, 2013

when doing filter ,zone map is used to filter some block that is not between the number of [min,max] ,it has a great help in a sorted column,but seems has little affect on column that not sorted and the number is random?
and also when doing filter, it seems transfer the needed compressed column to device and also to decompress the data to do fiter operation, will it cost some time,i mean whether alenka filter operation has increase speed over same cpu filter operation ?

@antonmks
Copy link
Owner

Yes, this is correct. Zone maps are a bit like indexes. Alenka checks if a segments mix and max column values fall within a filter expression. This way we can skip entire segments. And yes, often it requires sorting data based on filter columns.
About the speed of CPU and GPU operations - I believe that many operations are faster on a GPU even when taking into consideration the cost of data transfers and decompression.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants