Question/Feature: Does "view" allow adding new items directly to disk? #97

leoplusx · 2023-06-09T15:37:26Z

Describe what you are looking for

AFAIK, "view" allows us to memory map the index to disk. This way, we can load an index that doesn't fit into RAM.

I was just wondering if that will also work for adding items to an index.

If so, what is the process?

Instantiate the index
index.save()
index.view() from file
index.add()

If that does work, is it necessary to call index.save() again at any point, or will each index.add() operation directly write to disk?

If memory mapping does not work for adding items, then we will always need a machine with enough RAM to hold the entire index at least for the creation of that index, or for any adding operation. Is that correct?

Thanks.

Can you contribute to the implementation?

I can contribute

Is your feature request specific to a certain interface?

Python bindings

Contact Details

No response

Is there an existing issue for this?

I have searched the existing issues

Code of Conduct

I agree to follow this project's Code of Conduct

ashvardanian · 2023-06-09T17:07:45Z

For now, its not supported, but its two minor releases away. It won’t be done through add and will instead use the upcoming merge feature #84

leoplusx · 2023-06-12T07:07:18Z

Let me see if I understand who it would work:

Let's say I have a machine with 128 GB RAM and 300 GB of index data - so more data than would fit into RAM.

It sounds as though I could then assemble such an index like this:

Create sub-indices:
- 100 GB -> index1 (create in RAM, then write to disk)
- 100 GB -> index2 (create in RAM, then write to disk)
- 100 GB -> index3 (create in RAM, then write to disk)
Use merge to merge those indices on disk into one large index on disk, without loading any of them into RAM.
Use view to search that large index, without loading it into RAM.

Is that how it would work?

ashvardanian · 2023-06-12T08:08:58Z

Yes, you are right

leoplusx added the enhancement New feature or request label Jun 9, 2023

ashvardanian closed this as completed Jul 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question/Feature: Does "view" allow adding new items directly to disk? #97

Question/Feature: Does "view" allow adding new items directly to disk? #97

leoplusx commented Jun 9, 2023

ashvardanian commented Jun 9, 2023

leoplusx commented Jun 12, 2023

ashvardanian commented Jun 12, 2023

Question/Feature: Does "view" allow adding new items directly to disk? #97

Question/Feature: Does "view" allow adding new items directly to disk? #97

Comments

leoplusx commented Jun 9, 2023

Describe what you are looking for

Can you contribute to the implementation?

Is your feature request specific to a certain interface?

Contact Details

Is there an existing issue for this?

Code of Conduct

ashvardanian commented Jun 9, 2023

leoplusx commented Jun 12, 2023

ashvardanian commented Jun 12, 2023