Fix TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13361

timgrein · 2024-05-12T09:49:27Z

Closes #13210

Description

The following test failed as it produced two different lists of ids for a sorted and unsorted HNSW byte vector graph as one graph didn't find a higher scoring doc the other one found:
gradlew test --tests TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults -Dtests.seed=B41BEC5619361A16 -Dtests.locale=hi-IN -Dtests.timezone=Atlantic/Stanley -Dtests.asserts=true -Dtests.file.encoding=UTF-8

Considering that the graphs of 2 indices are organized differently we need to explore a lot of candidates to ensure that both searchers find the same docs. Increasing beamWidth (number of nearest neighbor candidates to track while searching the graph for each newly inserted node) from 5 to 10 fixes the test.

…dAndUnsortedIndicesReturnSameResults.

benwtrent · 2024-05-14T11:40:46Z

@timgrein could you determine if the scores the same or not? I wonder if we are getting tripped up by doc IDs being the tie breaker for equal scores.

timgrein · 2024-05-14T14:23:36Z

@benwtrent

Without increasing k we'll get the following for the failing test instance:

TOP 1 docs:
Document<stored<id:23>> 9.601536E-5
Document<stored<id:119>> 7.3713694E-5
Document<stored<id:163>> 7.087675E-5
Document<stored<id:148>> 7.051192E-5
Document<stored<id:51>> 6.879472E-5
  
TOP 2 docs:
Document<stored<id:23>> 9.601536E-5
Document<stored<id:193>> 8.53898E-5
Document<stored<id:119>> 7.3713694E-5
Document<stored<id:163>> 7.087675E-5
Document<stored<id:148>> 7.051192E-5

(So it seems like the first/unsorted index doesn't find document 193, when k is too small (60); I guess due to the different index structure?)

Increasing k to 80 leads to the following results for the previously failing test instance:

TOP 1 docs:
Document<stored<id:23>> 9.601536E-5
Document<stored<id:193>> 8.53898E-5
Document<stored<id:119>> 7.3713694E-5
Document<stored<id:163>> 7.087675E-5
Document<stored<id:148>> 7.051192E-5

TOP 2 docs:
Document<stored<id:23>> 9.601536E-5
Document<stored<id:193>> 8.53898E-5
Document<stored<id:119>> 7.3713694E-5
Document<stored<id:163>> 7.087675E-5
Document<stored<id:148>> 7.051192E-5

benwtrent · 2024-05-14T15:02:45Z

@timgrein what is the beamwidth set to in the failing case?

We may want to increase the beamWidth size to just make the test more consistent.

int beamWidth = random().nextInt(10) + 10; // from the previous base of 5

timgrein · 2024-05-14T15:14:37Z

@benwtrent The beam width for the failing test case was the smallest value possible 5. Increased the minimum to 10 according to your suggestion (which also fixes the test without increasing k). Do we still want to keep the increased k? Increasing it didn't seem to affect execution time too much at least for this test instance and it should probably reduce the likelihood of a flaky fail even further.

benwtrent · 2024-05-14T19:10:13Z

Do we still want to keep the increased k?

I would rather not, we keep bumping it up, eventually we are going to stop searching in the graph altogether and just brute force, which ruins the reason for the test.

timgrein · 2024-05-15T07:43:01Z

eventually we are going to stop searching in the graph altogether and just brute force, which ruins the reason for the test

Makes sense, decreased k again to 60 👍

ChrisHegarty

LGTM

…ults (apache#13361) Considering that the graphs of 2 indices are organized differently we need to explore a lot of candidates to ensure that both searchers find the same docs. Increasing beamWidth (number of nearest neighbor candidates to track while searching the graph for each newly inserted node) from 5 to 10 fixes the test.

Increase number of explored candidates in HnswGraphTestCase.testSorte…

a6ce30b

…dAndUnsortedIndicesReturnSameResults.

Increase beam width

2af088e

Decrease k to 60

39f31a0

timgrein mentioned this pull request May 17, 2024

Reproducible failure TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13380

Closed

ChrisHegarty approved these changes May 17, 2024

View reviewed changes

ChrisHegarty merged commit 24fd426 into apache:main May 17, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13361

Fix TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13361

timgrein commented May 12, 2024 •

edited

Loading

benwtrent commented May 14, 2024

timgrein commented May 14, 2024 •

edited

Loading

benwtrent commented May 14, 2024

timgrein commented May 14, 2024 •

edited

Loading

benwtrent commented May 14, 2024

timgrein commented May 15, 2024

ChrisHegarty left a comment

Fix TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13361

Fix TestHnswByteVectorGraph.testSortedAndUnsortedIndicesReturnSameResults #13361

Conversation

timgrein commented May 12, 2024 • edited Loading

Closes #13210

Description

benwtrent commented May 14, 2024

timgrein commented May 14, 2024 • edited Loading

benwtrent commented May 14, 2024

timgrein commented May 14, 2024 • edited Loading

benwtrent commented May 14, 2024

timgrein commented May 15, 2024

ChrisHegarty left a comment

Choose a reason for hiding this comment

timgrein commented May 12, 2024 •

edited

Loading

timgrein commented May 14, 2024 •

edited

Loading

timgrein commented May 14, 2024 •

edited

Loading