Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Small data compression performance #51

Merged
merged 5 commits into from Jul 24, 2022
Merged

Small data compression performance #51

merged 5 commits into from Jul 24, 2022

Conversation

mwlon
Copy link
Owner

@mwlon mwlon commented Jul 24, 2022

  • Reduced number of prefixes used for small data. Default compression ratio should balance compression time vs compressed size for all n (defaulting to 256 was especially slow in the 100 < n < 16k range). E.g. in the 2^10 <= n < 2^12 range this uses up to 64 prefixes by default.
  • Improved prefix optimization speed by about 30%.
  • Added a new dataset of decimal-valued floats

@mwlon mwlon merged commit 9bb9fb2 into main Jul 24, 2022
@mwlon mwlon deleted the small-data-opt branch September 4, 2022 22:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant