ARROW-8843: [C++] Compare bitmaps in words #7285

cyb70289 · 2020-05-27T02:16:57Z

Unaligned bitmap comparision are currently processed bit-by-bit.
Comparing word-by-word in uint64 improves performance significantly.
Bechmark(comparing two identical bitmaps with 64K bits) jumps from
86M/s to 5.7G/s on my test machine.

NOTE: This patch may hurt performance if two bitmaps differ at the very
begining bits, as it always loads and compares 64 bits if possible. Bit
by bit comparison will notice the difference and return false earlier.
This should not be a problem in practice.

github-actions · 2020-05-27T02:31:37Z

https://issues.apache.org/jira/browse/ARROW-8843

Unaligned bitmap comparision are currently processed bit-by-bit. Comparing word-by-word in uint64 improves performance significantly. Bechmark(comparing two identical bitmaps with 64K bits) jumps from 86M/s to 5.7G/s on my test machine. NOTE: This patch may hurt performance if two bitmaps differ at the very begining bits, as it always loads and compares 64 bits if possible. Bit by bit comparison will notice the difference and return false earlier. This should not be a problem in practice.

cyb70289 · 2020-05-27T09:55:12Z

Before (apply only benchmark code)

BitmapEqualsWithoutOffset/8192                195 ns          195 ns      3618366 bytes_per_second=39.1055G/s
BitmapEqualsWithOffset/8192                 91272 ns        91235 ns         7673 bytes_per_second=85.6307M/s

After

BitmapEqualsWithoutOffset/8192                193 ns          193 ns      3635971 bytes_per_second=39.6261G/s
BitmapEqualsWithOffset/8192                  1332 ns         1331 ns       510343 bytes_per_second=5.73058G/s

fsaintjacques · 2020-05-29T12:48:08Z

NOTE: This patch may hurt performance if two bitmaps differ at the very
begining bits, as it always loads and compares 64 bits if possible. Bit
by bit comparison will notice the difference and return false earlier.
This should not be a problem in practice.

This should be negligible in practice. The processor still load in unit of words.

fsaintjacques

I think the code can be improved a bit for readability.

fsaintjacques · 2020-05-29T14:32:44Z

cpp/src/arrow/util/bit_util.cc

+  right_offset %= 8;
+
+  // process in 64 bits
+  int64_t nwords = bit_length / 64;


Use snake_case for variables, e.g. n_words.

Thank you. Will change.
A quick question, is there formal doc for arrow C++ coding style?
I see class names are camel case.
Most variables are snake case, with some exceptions.
Most function names are camel case, with some exceptions(simple getter should be snake case?)

https://google.github.io/styleguide/cppguide.html

We indeed use snake_case for member accessors (this used to be the guidance in the Google C++ style guide but they might have made some changes).

FWIW I think nwords is fine, we use nbytes in plenty of places.

fsaintjacques · 2020-05-29T14:35:01Z

cpp/src/arrow/util/bit_util.cc

+  if (nwords > 1) {
+    bit_length -= (nwords - 1) * 64;
+
+    uint64_t left_word0 = BitUtil::ToLittleEndian(util::SafeLoadAs<uint64_t>(left));


It would be more readable if you wrap load into a lambda. The compiler will inline it.

auto load_word = [](uint8_t* bytes) { return BitUtil::ToLittleEndian(util::SafeLoadAs<uint64_t>(bytes)); } auto left_word0 = load_word(left); auto right_word0 = load_word(right);

fsaintjacques · 2020-05-29T15:07:31Z

cpp/src/arrow/util/bit_util.cc

+    uint64_t left_word0 = BitUtil::ToLittleEndian(util::SafeLoadAs<uint64_t>(left));
+    uint64_t right_word0 = BitUtil::ToLittleEndian(util::SafeLoadAs<uint64_t>(right));
+
+    do {


Make a shift lambda:

auto shift = [](uint64_t word, uint64_t next, uint8_t shift) -> uint64_t { if (shift == 0) return word; return (word >> shift) | (next << (64 - shift); }; auto next = load_word(left); auto left_word = shift(left_word0, next, left_offset); left_word0 = next;

I'd also think it would be worth transforming the loop into a fixed for-loop.

Done. Much cleaner code after refinement. Thanks.

fsaintjacques · 2020-06-01T18:38:58Z

@cyb70289 Thank for this. For future reference, if you add a new benchmark, do it first as a seperate commit, and then add the improvement in a following (chronologically) second commit. You'll be able to check the difference with ursabot, e.g. @ursabot benchmark [filter] <sha_of_first_commit>.

cyb70289 force-pushed the bitmap-equals branch 4 times, most recently from b6b716b to 9c6b82c Compare May 27, 2020 06:32

cyb70289 force-pushed the bitmap-equals branch from 9c6b82c to 0e63ebc Compare May 27, 2020 06:58

fsaintjacques self-requested a review May 27, 2020 16:17

wesm requested a review from pitrou May 29, 2020 16:33

fsaintjacques reviewed May 29, 2020

View reviewed changes

refine with lambda

fd5a843

cyb70289 force-pushed the bitmap-equals branch from cd03e60 to fd5a843 Compare June 1, 2020 04:13

fsaintjacques approved these changes Jun 1, 2020

View reviewed changes

fsaintjacques closed this in d25ccf4 Jun 1, 2020

cyb70289 deleted the bitmap-equals branch June 2, 2020 01:44

asfimport mentioned this pull request Jun 1, 2020

[C++] Optimize BitmapEquals unaligned case #17188

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-8843: [C++] Compare bitmaps in words #7285

ARROW-8843: [C++] Compare bitmaps in words #7285

cyb70289 commented May 27, 2020

github-actions bot commented May 27, 2020

cyb70289 commented May 27, 2020

fsaintjacques commented May 29, 2020

fsaintjacques left a comment

fsaintjacques May 29, 2020

cyb70289 Jun 1, 2020

wesm Jun 1, 2020

wesm Jun 1, 2020

fsaintjacques May 29, 2020

cyb70289 Jun 1, 2020

fsaintjacques May 29, 2020

cyb70289 Jun 1, 2020

fsaintjacques commented Jun 1, 2020 •

edited

ARROW-8843: [C++] Compare bitmaps in words #7285

ARROW-8843: [C++] Compare bitmaps in words #7285

Conversation

cyb70289 commented May 27, 2020

github-actions bot commented May 27, 2020

cyb70289 commented May 27, 2020

fsaintjacques commented May 29, 2020

fsaintjacques left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fsaintjacques commented Jun 1, 2020 • edited

fsaintjacques commented Jun 1, 2020 •

edited