New bit manipulation functions and 128-bit value library #7338

pdillinger · 2020-09-01T22:27:18Z

Summary: These new functions and 128-bit value bit operations are
expected to be used in a forthcoming Bloom filter alternative.

No functional changes to production code, just new code only called by
unit tests, cosmetic changes to existing headers, and fix an existing
function for a yet-unused template instantiation (BitsSetToOne on
something signed and smaller than 32 bits).

Test Plan: Unit tests included. Works with and without
TEST_UINT128_COMPAT=1 to check compatibility with and without
__uint128_t. Also added that parameter to the CircleCI build
build-linux-shared_lib-alt_namespace-status_checked.

Summary: These new functions and 128-bit value bit operations are expected to be used in a forthcoming Bloom filter alternative. No functional changes to production code, just new code only called by unit tests, cosmetic changes to existing headers, and fix an existing function for a yet-unused template instantiation (BitsSetToOne on something signed and smaller than 32 bits). Test Plan: Unit tests included. Works with and without TEST_UINT128_COMPAT=1 to check compatibility with and without __uint128_t. Also added that parameter to the CircleCI build build-linux-shared_lib-alt_namespace-status_checked.

jay-zhuang

LGTM. just a few comments and questions.

jay-zhuang · 2020-09-02T16:16:23Z

util/math.h

+  } else if (sizeof(T) <= sizeof(unsigned long)) {
+    return __builtin_ctzl(static_cast<unsigned long>(v));
+  } else {
+    return __builtin_ctzll(static_cast<unsigned long>(v));


should it be static_cast<unsigned long long>?

jay-zhuang · 2020-09-02T17:00:00Z

util/math.h

+  if (sizeof(T) <= sizeof(unsigned int)) {
+    int lz = __builtin_clz(static_cast<unsigned int>(v));
+    return int{sizeof(unsigned int)} * 8 - 1 - lz;
+  } else {
+    int lz = __builtin_clzll(static_cast<unsigned long long>(v));
+    return int{sizeof(unsigned long long)} * 8 - 1 - lz;
+  }


do we need to have __buildin_clzl() like other functions do? like:

else if (sizeof(T) <= sizeof(unsigned long)) { return lz lz = __buildin_clzl(static_cast<unsigned long>(v)); } else {

jay-zhuang · 2020-09-02T17:06:22Z

util/math.h

  }
 #else
  static_assert(sizeof(T) <= sizeof(unsigned long long), "type too big");
  if (sizeof(T) > sizeof(unsigned long)) {
    return __builtin_popcountll(static_cast<unsigned long long>(v));
  } else if (sizeof(T) > sizeof(unsigned int)) {
    return __builtin_popcountl(static_cast<unsigned long>(v));
-  } else {
+  } else if (sizeof(T) == sizeof(unsigned int)) {
    return __builtin_popcount(static_cast<unsigned int>(v));


nit: maybe better to have a consistent ordering of checks for all functions. like from type size from small to big:

if (size < sizeof(unsigned int)) { } else if (size == sizeof(unsigned int)) { } else if (size <= sizeof(unsigned long)) { } else { }

Makefile

jay-zhuang · 2020-09-02T18:51:06Z

util/math128.h

+  tmp += uint64_t{b & 0xffffFFFF} * uint64_t{a >> 32};
+  // Avoid overflow: first add lower 32 of tmp2, and later upper 32
+  uint64_t tmp2 = uint64_t{b >> 32} * uint64_t{a & 0xffffFFFF};
+  tmp += static_cast<uint32_t>(tmp2);


nit: tmp += tmp2 & 0xffffFFFF; to be consistent with others?

facebook-github-bot

@pdillinger has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

jay-zhuang · 2020-09-03T15:52:52Z

util/math128.h

+  uint64_t tmp = uint64_t{b & 0xffffFFFF} * uint64_t{a & 0xffffFFFF};
+  uint64_t lower = tmp & 0xffffFFFF;
+  tmp >>= 32;
+  tmp += uint64_t{b & 0xffffFFFF} * uint64_t{a >> 32};


It took me a while to figure out that it won't overflow 🤦
UINT_MAX * UINT_MAX + UINT_MAX + UINT_MAX = ULLONG_MAX

facebook-github-bot · 2020-09-03T16:35:59Z

@pdillinger merged this pull request in c4d8838.

Summary: These new functions and 128-bit value bit operations are expected to be used in a forthcoming Bloom filter alternative. No functional changes to production code, just new code only called by unit tests, cosmetic changes to existing headers, and fix an existing function for a yet-unused template instantiation (BitsSetToOne on something signed and smaller than 32 bits). Pull Request resolved: facebook#7338 Test Plan: Unit tests included. Works with and without TEST_UINT128_COMPAT=1 to check compatibility with and without __uint128_t. Also added that parameter to the CircleCI build build-linux-shared_lib-alt_namespace-status_checked. Reviewed By: jay-zhuang Differential Revision: D23494945 Pulled By: pdillinger fbshipit-source-id: 5c0dc419100d9df5d4d9abb153b2855d5aea39e8

pdillinger requested a review from jay-zhuang September 1, 2020 22:27

facebook-github-bot added the CLA Signed label Sep 1, 2020

pdillinger force-pushed the more_math branch from 6767c18 to f010b5c Compare September 1, 2020 22:56

pdillinger force-pushed the more_math branch from f010b5c to 7c27667 Compare September 1, 2020 23:00

Fix technical UB on shifting

6d685c0

pdillinger mentioned this pull request Sep 2, 2020

Add options for forcing AVX and AVX2 instructions #7334

Closed

jay-zhuang reviewed Sep 2, 2020

View reviewed changes

jay-zhuang approved these changes Sep 2, 2020

View reviewed changes

Polish / fixes from feedback

5da4350

facebook-github-bot reviewed Sep 3, 2020

View reviewed changes

jay-zhuang reviewed Sep 3, 2020

View reviewed changes

facebook-github-bot closed this in c4d8838 Sep 3, 2020

facebook-github-bot added the Merged label Sep 3, 2020

This was referenced Aug 30, 2022

Unsupport type '__uint128_t' when I used arm-linux-gnueabihf rust-rocksdb/rust-rocksdb#681

Closed

Fix int128 compatibility check(Issue #681) rust-rocksdb/rust-rocksdb#682

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New bit manipulation functions and 128-bit value library #7338

New bit manipulation functions and 128-bit value library #7338

pdillinger commented Sep 1, 2020

jay-zhuang left a comment

jay-zhuang Sep 2, 2020

jay-zhuang Sep 2, 2020

jay-zhuang Sep 2, 2020 •

edited

Loading

jay-zhuang Sep 2, 2020

facebook-github-bot left a comment

jay-zhuang Sep 3, 2020

facebook-github-bot commented Sep 3, 2020

New bit manipulation functions and 128-bit value library #7338

New bit manipulation functions and 128-bit value library #7338

Conversation

pdillinger commented Sep 1, 2020

jay-zhuang left a comment

Choose a reason for hiding this comment

jay-zhuang Sep 2, 2020

Choose a reason for hiding this comment

jay-zhuang Sep 2, 2020

Choose a reason for hiding this comment

jay-zhuang Sep 2, 2020 • edited Loading

Choose a reason for hiding this comment

jay-zhuang Sep 2, 2020

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

jay-zhuang Sep 3, 2020

Choose a reason for hiding this comment

facebook-github-bot commented Sep 3, 2020

jay-zhuang Sep 2, 2020 •

edited

Loading