[FEA] Optimize use of main memory for ANN indexes #1708

cjnolet · 2023-08-01T19:48:10Z

We need to find a good approach to efficiently performing queries on architectures like G&H. Ideally, we would be able to train and keep very large indices in memory but have smart ways of caching only what we need on the GPU such that we can increase cached hits as much as possible and reduce the number of transfers to/from the GPU.

I'm creating this issue so we can track discussions and progress on this feature.

cjnolet added feature request New feature or request Vector Search labels Aug 1, 2023

cjnolet assigned benfred Aug 1, 2023

cjnolet changed the title ~~[FEA] Optimize use of main memory for ANN queries~~ [FEA] Optimize use of main memory for ANN indexes Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Optimize use of main memory for ANN indexes #1708

[FEA] Optimize use of main memory for ANN indexes #1708

cjnolet commented Aug 1, 2023

[FEA] Optimize use of main memory for ANN indexes #1708

[FEA] Optimize use of main memory for ANN indexes #1708

Comments

cjnolet commented Aug 1, 2023