Fix `euclidean_dist` in IVF-Flat search #1122

Nyrio · 2023-01-05T17:00:57Z

This is a tricky bug so the fix deserves some explanation. The previous implementation of euclidean_dist was the following in vectorized cases, where x and y are int32 vectors of 4 int8 each and acc is a single int32 number to accumulate the distance in:

// Compute vectorized absolute differences independently.
const auto diff = static_cast<int32_t>(__vabsdiffs4(x, y));
// Square, reduce, and add to the accumulator.
acc = dp4a(diff, diff, acc);

Now consider the following case:

x = 0x80; // -128, 0, 0, 0
y = 0x7f; //  127, 0, 0, 0

The difference between -128 and 127 is 255, represented as FF (__vabsdiffs4 is smart enough not to compute abs(a-b) which would result in 01). However, if we call the signed version of dp4a, FF is cast from int8 to int32 as FFFFFFFF (or -1). The square of -1 is 1, which is added to acc (instead of 65025).

As the output of __vabsdiffs4 is correct when considered as an unsigned number, and as addition is the same for signed and unsigned in 2's complement (and acc is positive anyway), the easiest fix is to use the unsigned version of dp4a, which will cast overflowed differences properly to 32 bits. The previous code simply becomes:

const auto diff = __vabsdiffs4(x, y);
acc = dp4a(diff, diff, static_cast<uint32_t>(acc));

Additionally, to avoid underflows in the non-vectorized unsigned case, I replaced the subtraction with __usad (absolute difference of unsigned numbers). Note that using the subtraction was correct anyway, because the addition/subtraction is the same for unsigned and signed integers, as well as the least significant half of the multiplication (which is the part that is stored), and the square of a number is also the square of its opposite. Consider:

uint32_t a = 10;
uint32_t b = 20;
uint32_t c = a - b; // fffffff6, i.e -10 or 4294967286
uint32_t d = c * c; // (ffffffec)00000064, i.e 100

achirkin

Wow, that must have been a tough investigation, good job! You've explained the problem very nice here, but maybe it's worth adding a short gist of it (or a link) in the code?

cpp/include/raft/spatial/knn/detail/ivf_flat_search.cuh

…underflowing unsigned subtraction with __usad for the absolute difference

…istance

tfeher

Thanks Louis for fixing this problem, the PR looks good to me!

Could you update the second part of the PR description? You say

added comment in the unsigned, non-vectorized implementation

But that has changed during the review.

Nyrio · 2023-01-09T20:18:59Z

Could you update the second part of the PR description?

Ah, yes, done.

cjnolet

Changes LGTM. Thanks @Nyrio!

achirkin

LGTM!

cjnolet · 2023-01-10T13:18:56Z

/merge

Nyrio added 3 commits January 5, 2023 16:22

Fix euclidean_dist in ivf_flat_search

7e1bcd9

Offsetting is unnecessary

4b31ddf

Use dp4a wrapper instead of instrinsic __dp4a

615d3f6

Nyrio requested review from a team as code owners January 5, 2023 17:00

github-actions bot added cpp python labels Jan 5, 2023

Nyrio added bug Something isn't working 3 - Ready for Review non-breaking Non-breaking change labels Jan 5, 2023

Nyrio requested review from tfeher and achirkin January 5, 2023 17:03

Nyrio self-assigned this Jan 5, 2023

achirkin requested changes Jan 6, 2023

View reviewed changes

cpp/include/raft/spatial/knn/detail/ivf_flat_search.cuh Show resolved Hide resolved

cpp/include/raft/spatial/knn/detail/ivf_flat_search.cuh Outdated Show resolved Hide resolved

Nyrio added 2 commits January 9, 2023 12:34

Add comment to clarify why we use unsigned version of dp4a + replace …

7033213

…underflowing unsigned subtraction with __usad for the absolute difference

Merge remote-tracking branch 'origin/branch-23.02' into bug-ivfflat-d…

a6666ae

…istance

Nyrio requested a review from achirkin January 9, 2023 11:35

tfeher approved these changes Jan 9, 2023

View reviewed changes

cjnolet approved these changes Jan 9, 2023

View reviewed changes

achirkin approved these changes Jan 10, 2023

View reviewed changes

rapids-bot bot merged commit b5c2b39 into rapidsai:branch-23.02 Jan 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `euclidean_dist` in IVF-Flat search #1122

Fix `euclidean_dist` in IVF-Flat search #1122

Nyrio commented Jan 5, 2023 •

edited

Loading

achirkin left a comment

tfeher left a comment

Nyrio commented Jan 9, 2023

cjnolet left a comment

achirkin left a comment

cjnolet commented Jan 10, 2023

Fix euclidean_dist in IVF-Flat search #1122

Fix euclidean_dist in IVF-Flat search #1122

Conversation

Nyrio commented Jan 5, 2023 • edited Loading

achirkin left a comment

Choose a reason for hiding this comment

tfeher left a comment

Choose a reason for hiding this comment

Nyrio commented Jan 9, 2023

cjnolet left a comment

Choose a reason for hiding this comment

achirkin left a comment

Choose a reason for hiding this comment

cjnolet commented Jan 10, 2023

Fix `euclidean_dist` in IVF-Flat search #1122

Fix `euclidean_dist` in IVF-Flat search #1122

Nyrio commented Jan 5, 2023 •

edited

Loading