[C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40095

pitrou · 2024-02-15T17:41:07Z

Describe the enhancement requested

According to previous observations, it seems the AVX512 accelerations of BYTE_STREAM_SPLIT perform equal or worse then their AVX2 counterparts.

Besides, the SSE2 and AVX2 accelerations, already performing at 5-10 GB/s or more, are amply fast enough.

We could therefore simply remove the AVX512 accelerations.

Component(s)

Benchmarking, C++, Parquet

pitrou · 2024-02-15T17:41:26Z

@cyb70289 @wgtmac @mapleFU @felipecrv Opinions on this?

felipecrv · 2024-02-15T22:23:18Z

According to previous observations, it seems the AVX512 accelerations of BYTE_STREAM_SPLIT perform equal or worse then their AVX2 counterparts.

Isn't that due to the effects of down-clocking when the CPU is executing too many AVX-512 instructions? It might be better in the future or to users that have a way to avoid that down-clocking.

pitrou · 2024-02-16T08:54:41Z

I don't think all CPUs have down-clocking, do they? But regardless, if it doesn't provide a significant benefit, it doesn't make much sense to keep those codepaths, IMHO.

cyb70289 · 2024-02-16T09:00:30Z

I remember I normalized benchmark by cpu frequency, avx512 is still worse than avx2 (maybe sse4) on caslake.

felipecrv · 2024-02-16T15:47:14Z

I don't think all CPUs have down-clocking, do they? But regardless, if it doesn't provide a significant benefit, it doesn't make much sense to keep those codepaths, IMHO.

Alright, then it might be a case of memory bandwidth being the bottleneck and not CPU uops per second. Let's remove it. ✂️

mapleFU · 2024-02-17T08:11:10Z

I think BYTE_STREAM_SPLIT encoding/decoding might be memory-bound operations. We can remove the AVX512 impl first. If anyone wants to improve or requires AVX512, we can also revert it back...

wgtmac · 2024-02-17T15:41:55Z

This removal looks reasonable. I have consulted some people at Intel on this but didn't get any useful answer.

…SPLIT encoding Two reasons: * the SSE2 and AVX2 variants are already fast enough (on the order of 10 GB/s) * the AVX512 variants do not seem faster, and can even be slower, on tested Intel machines

…encoding (#40127) Two reasons: * the SSE2 and AVX2 variants are already fast enough (on the order of 10 GB/s) * the AVX512 variants do not seem faster, and can even be slower, on tested Intel machines * Closes: #40095 Authored-by: Antoine Pitrou <antoine@python.org> Signed-off-by: Antoine Pitrou <antoine@python.org>

…SPLIT encoding (apache#40127) Two reasons: * the SSE2 and AVX2 variants are already fast enough (on the order of 10 GB/s) * the AVX512 variants do not seem faster, and can even be slower, on tested Intel machines * Closes: apache#40095 Authored-by: Antoine Pitrou <antoine@python.org> Signed-off-by: Antoine Pitrou <antoine@python.org>

pitrou added the Type: enhancement label Feb 15, 2024

github-actions bot added Component: Parquet Component: C++ Component: Benchmarking labels Feb 15, 2024

github-actions bot mentioned this issue Feb 19, 2024

GH-40095: [C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40127

Merged

github-actions bot assigned pitrou Feb 19, 2024

pitrou closed this as completed in #40127 Feb 19, 2024

pitrou added this to the 16.0.0 milestone Feb 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40095

[C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40095

pitrou commented Feb 15, 2024

pitrou commented Feb 15, 2024

felipecrv commented Feb 15, 2024

pitrou commented Feb 16, 2024

cyb70289 commented Feb 16, 2024

felipecrv commented Feb 16, 2024

mapleFU commented Feb 17, 2024

wgtmac commented Feb 17, 2024

[C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40095

[C++][Parquet] Remove AVX512 variants of BYTE_STREAM_SPLIT encoding #40095

Comments

pitrou commented Feb 15, 2024

Describe the enhancement requested

Component(s)

pitrou commented Feb 15, 2024

felipecrv commented Feb 15, 2024

pitrou commented Feb 16, 2024

cyb70289 commented Feb 16, 2024

felipecrv commented Feb 16, 2024

mapleFU commented Feb 17, 2024

wgtmac commented Feb 17, 2024