New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add GCD codec #53149
Add GCD codec #53149
Conversation
This is an automated comment for commit d416aaf with description of existing statuses. It's updated for the latest CI running
|
Let's add tests. |
0 rows in set. Elapsed: 11.981 sec. Processed 200.25 million rows, 1.60 GB (16.71 million rows/s., 133.72 MB/s.)
0 rows in set. Elapsed: 9.799 sec. Processed 200.25 million rows, 1.60 GB (20.44 million rows/s., 163.50 MB/s.) |
@nickitat review plz :) |
Consolidate GCD codec tests (Follow up to #53149)
Fixes: #48349
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
Added "GCD" aka. "greatest common denominator" as a new data compression codec. The codec computes the GCD of all column values, and then divides each value by the GCD. The GCD codec is a data preparation codec (similar to Delta and DoubleDelta) and cannot be used stand-alone. It works with data integer, decimal and date/time type. A viable use case for the GCD codec are column values that change (increase/decrease) in multiples of the GCD, e.g. 24 - 28 - 16 - 24 - 8 - 24 (assuming GCD = 4).