Skip to content

Conversation

@smiklosovic
Copy link
Contributor

No description provided.

jbellis pushed a commit to jbellis/cassandra that referenced this pull request Jun 26, 2024
…apache#1163)

* "all the keys between smallest and largest" is a terrible approximation of the keys in the search set that are actually present in the memtable; compute this actual subset (which is inexpensive to do) before either brute forcing or searching the index
* brute force (reading vectors from the sstable and sorting by distance, instead of going through ann index) is observed to be 3x-5x more expensive per row than index scan.  This incorporates that into the decision whether to use brute force, and makes the multiplier configurable; default is a more conservative 3x.
michaelsembwever pushed a commit to thelastpickle/cassandra that referenced this pull request Sep 25, 2024
…apache#1163)

* "all the keys between smallest and largest" is a terrible approximation of the keys in the search set that are actually present in the memtable; compute this actual subset (which is inexpensive to do) before either brute forcing or searching the index
* brute force (reading vectors from the sstable and sorting by distance, instead of going through ann index) is observed to be 3x-5x more expensive per row than index scan.  This incorporates that into the decision whether to use brute force, and makes the multiplier configurable; default is a more conservative 3x.
michaelsembwever pushed a commit to thelastpickle/cassandra that referenced this pull request Jan 7, 2026
…apache#1163)

* "all the keys between smallest and largest" is a terrible approximation of the keys in the search set that are actually present in the memtable; compute this actual subset (which is inexpensive to do) before either brute forcing or searching the index
* brute force (reading vectors from the sstable and sorting by distance, instead of going through ann index) is observed to be 3x-5x more expensive per row than index scan.  This incorporates that into the decision whether to use brute force, and makes the multiplier configurable; default is a more conservative 3x.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant