We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and
privacy statement. We’ll occasionally send you account related emails.
Already on GitHub?
to your account
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Currently we have vectorised unpacking of 32-bit integers. We then extend or truncate this to support smaller or larger destination types.
Describe the solution you'd like
I would like optimised kernels for all 4 common sizes of packed representation
Describe alternatives you've considered
The text was updated successfully, but these errors were encountered:
Add unpack8, unpack16, unpack64 (apache#2276)
Add unpack8, unpack16, unpack64 (#2276) ~10-50% faster (#2278)
* Add unpack8, unpack16, unpack64 (#2276)
* Add zero-extend fallback
* Fix copy-elision
* Switch to using seq_macro
* Remove unused function
* Update docs
Successfully merging a pull request may close this issue.