Skip to content

vb-17: fix cach eviction discrepancies#19

Open
nvzm123 wants to merge 9 commits into
SearchScale:benchmark-with-latest-cuvsfrom
nvzm123:vb-17__zackm
Open

vb-17: fix cach eviction discrepancies#19
nvzm123 wants to merge 9 commits into
SearchScale:benchmark-with-latest-cuvsfrom
nvzm123:vb-17__zackm

Conversation

@nvzm123
Copy link
Copy Markdown

@nvzm123 nvzm123 commented May 12, 2026

We weren't prewarming the data before running search which led to huge search-time discrepancies. This PR adds prewarming after we built our index and potentially evicted from our cache, we reread vectors from disk to prewarm cache.

E.g. with an index size of 10M 1536d 32fp (~58 GB) and a dynamic Java Heap that spikes above 70GB, depending on our Host RAM size, we were having to always spend the price to reload data from disk that was evicted when running g5.8x (128 GB Host RAM) and frequently also seeing the same on r8gd.8x (256 GB RAM). With this fix, those discrepancies disappear and we see the results we would have expected given the same benchmarks at 1M (where the Java Heap never spiked so large as to evict cached data).

@nvzm123
Copy link
Copy Markdown
Author

nvzm123 commented May 12, 2026

@cjnolet
Copy link
Copy Markdown

cjnolet commented May 13, 2026

@nvzm123 i would really like us to put effort into the cuVS-lucene backend for cuVS-bench, rather than continue to improve these scripts. Is that something you could take on? It's possible Claude might be able to assist with this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants