CASSANDRA-16209 Log Warning Rather than Verbose Trace when Preview Repair Validation Conflicts with Incremental Repair #776

maedhroz · 2020-10-13T21:32:13Z

No description provided.

…onflicts w/ a pending incremental repair.

maedhroz · 2020-10-13T23:28:19Z

test/distributed/org/apache/cassandra/distributed/test/PreviewRepairTest.java

+            SimpleCondition continuePreviewRepair = new SimpleCondition();
+            // this pauses the validation request sent from node1 to node2 until the inc repair below has run
+            cluster.filters()
+                   .outbound()


Note: If we don't use outbound() explicitly here, we'll deadlock, with the single-threaded anti-entropy stage blocked waiting for the incremental repair propose message, which itself has to be processed on the same thread.

* Revert "Skip similarity score caching when using brute force (apache#769)" This reverts commit ca7eea6. * Revert " Compute similarity to query only once per indexed vector (per replica) (apache#723)" This reverts commit 692491f. * Add test that exposed score caching impl flaw * Do not revert fix to VectorTester Similarity score caching does not currently work when there are updates to vectors. The added test shows the issue. Conceptually, the problem materializes when we observe an earlier instance of a row with a close score and don't observe the row's later instance of a low score. The error is ranking some rows higher than than they should be. The problem stems from updates to vectors. If we are searching for vector `v` and assume we have a row in sstable `a` that is close to `v` and an update to that row in sstable `b` that is far from `v`, the graph search will only find the version of the row in `a`. Then, the score caching will only observe `a`'s close score and then in `VectorTopKProcessor` we will assume that the row's score is for vector `a`, but that is out of date. As far as I understand, we don't have a way to know the vector for which we scored against, which means we don't have a way to verify that the vector in `VectorTopKProcessor` (the one read from storage) is the same as the one from the index.

Issue a warning rather than an error when preview repair validation c…

11dabad

…onflicts w/ a pending incremental repair.

maedhroz force-pushed the CASSANDRA-16209 branch from cb713c7 to 11dabad Compare October 13, 2020 23:20

beef up CircleCI

1656937

maedhroz commented Oct 13, 2020

View reviewed changes

smiklosovic closed this Mar 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CASSANDRA-16209 Log Warning Rather than Verbose Trace when Preview Repair Validation Conflicts with Incremental Repair #776

CASSANDRA-16209 Log Warning Rather than Verbose Trace when Preview Repair Validation Conflicts with Incremental Repair #776

Uh oh!

maedhroz commented Oct 13, 2020

Uh oh!

maedhroz Oct 13, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CASSANDRA-16209 Log Warning Rather than Verbose Trace when Preview Repair Validation Conflicts with Incremental Repair #776

CASSANDRA-16209 Log Warning Rather than Verbose Trace when Preview Repair Validation Conflicts with Incremental Repair #776

Uh oh!

Conversation

maedhroz commented Oct 13, 2020

Uh oh!

maedhroz Oct 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maedhroz Oct 13, 2020 •

edited

Loading