ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking #9614

pitrou · 2021-03-02T16:28:39Z

The custom per-ISA code generation scripts (AVX2, AVX512) are replaced with a single code generation script that outputs xsimd code for any SIMD bit-width, in an ISA-agnostic way.

Also add a Neon optimized version of bit-unpacking that leverages the generated code for 128-bit SIMD.

pitrou · 2021-03-02T16:29:34Z

Merging this is currently blocked by xsimd issues xtensor-stack/xsimd#421 and xtensor-stack/xsimd#422.

pitrou · 2021-03-02T16:34:19Z

@jianxind @cyb70289 Feel free to take a look. It would be interesting to get performance numbers on Neon (one affected benchmark should be parquet-encoding-benchmark, perhaps also parquet-column-io-benchmark and parquet-arrow-reader-writer-benchmark).

github-actions · 2021-03-02T19:53:10Z

https://issues.apache.org/jira/browse/ARROW-11839

cyb70289 · 2021-03-03T07:23:42Z

I tested Neon performance of parquet-encoding-benchmark, parquet-column-io-benchmark and parquet-arrow-reader-writer-benchmark. No obvious difference found against master branch.

Compiler does good optimization for bpacking code. Disassembler shows shift and and operations are vectorized. https://godbolt.org/z/41G8Ta

pitrou · 2021-03-03T14:03:31Z

Interesting, thank you.

nealrichardson · 2021-04-12T23:49:13Z

Is there anything left to do here (other than perhaps rebase and hope CI is fixed)?

The custom per-ISA code generation scripts (AVX2, AVX512) are replaced with a single code generation script that outputs xsimd code for any SIMD bit-width, in an ISA-agnostic way. Also add a Neon optimized version of bit-unpacking that leverages the generated code for 128-bit SIMD.

pitrou · 2021-04-13T11:22:23Z

@nealrichardson We must make sure that compilation works on all C++ builds.

pitrou · 2021-04-13T11:35:07Z

Crossbow builds: https://github.com/ursacomputing/crossbow/branches/all?query=build-217

pitrou · 2021-04-13T16:14:09Z

Will merge now.

The custom per-ISA code generation scripts (AVX2, AVX512) are replaced with a single code generation script that outputs xsimd code for any SIMD bit-width, in an ISA-agnostic way. Also add a Neon optimized version of bit-unpacking that leverages the generated code for 128-bit SIMD. Closes apache#9614 from pitrou/ARROW-11839-xsimd-bpacking Authored-by: Antoine Pitrou <antoine@python.org> Signed-off-by: Antoine Pitrou <antoine@python.org>

pitrou force-pushed the ARROW-11839-xsimd-bpacking branch from 3ec021f to ad11400 Compare March 2, 2021 16:42

github-actions bot added the Component: C++ label Mar 2, 2021

pitrou force-pushed the ARROW-11839-xsimd-bpacking branch 2 times, most recently from c442ff6 to 74b577f Compare April 6, 2021 16:27

pitrou added 2 commits April 13, 2021 13:20

Bump xsimd version

57e0857

pitrou force-pushed the ARROW-11839-xsimd-bpacking branch from 74b577f to 57e0857 Compare April 13, 2021 11:21

pitrou closed this in d7558bf Apr 13, 2021

pitrou deleted the ARROW-11839-xsimd-bpacking branch April 13, 2021 16:14

asfimport mentioned this pull request Apr 13, 2021

[C++] Rewrite bit-unpacking optimizations using xsimd #27686

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking #9614

ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking #9614

pitrou commented Mar 2, 2021

pitrou commented Mar 2, 2021

pitrou commented Mar 2, 2021

github-actions bot commented Mar 2, 2021

cyb70289 commented Mar 3, 2021

pitrou commented Mar 3, 2021

nealrichardson commented Apr 12, 2021

pitrou commented Apr 13, 2021

pitrou commented Apr 13, 2021

pitrou commented Apr 13, 2021

ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking #9614

ARROW-11839: [C++] Use xsimd for generation of accelerated bit-unpacking #9614

Conversation

pitrou commented Mar 2, 2021

pitrou commented Mar 2, 2021

pitrou commented Mar 2, 2021

github-actions bot commented Mar 2, 2021

cyb70289 commented Mar 3, 2021

pitrou commented Mar 3, 2021

nealrichardson commented Apr 12, 2021

pitrou commented Apr 13, 2021

pitrou commented Apr 13, 2021

pitrou commented Apr 13, 2021