Build the trees in parallel #32

Kerollmops · 2023-12-02T14:55:15Z

This PR plans to build the trees in parallel and store the tree nodes in files before storing them in the database using the write transactions.

However, the current implementation needs to be fixed. We are creating read transactions to be able to read from different threads in parallel, but those transactions can only read the committed changes, which means that it cannot see the non-committed user items/vectors.

After some research, I found a clean solution to the original problem. As it is valid to keep pointers to the entries' data safely (if not using fancy LMDB features like encryption), I created the ImmutableLeafs data structure that lists the leaf nodes and keeps pointers to them, all of that from the current RwTxn. We, therefore, no longer need to commit the user items transaction before being able to build the trees.

TODO

Check and fix the tests.
Compute the n_trees by ourselves.
Update the README.
Use the right TMPDIR variable.
Document the ImmutableLeafs and specifically the safety of it (because it is).
Rename the build_in_parallel method into the build one.

Kerollmops · 2023-12-06T20:40:24Z

I did some experiments to compare the speed and size of the database with Spotify/Annoy. The results are very good for arroy. We are always faster and the database is always smaller.

That's probably related to the high number of mutex and synchronisation needed to store tree nodes in the Annoy database format where Meilisearch/arroy only requires a single atomic sequential number.

On the other hand, the size of the database probably differs from the fact that we store lists of integers in RoaringBitmaps instead of in uncompressed lists as Annoy does.

When Meilisearch/arroy takes 52s to index 95832 vectors of 678 dimensions in 200 trees by using 12 threads, Spotify/Annoy takes 69s. And when we force both library to use a single thread Arroy takes 225s when Annoy takes 338s. Even if we are using LMDB and documents are stored in a B-Tree that supports atomic operations.

Kerollmops force-pushed the parallel-building branch 5 times, most recently from 6aedb57 to b6a66ca Compare December 3, 2023 21:54

Kerollmops mentioned this pull request Dec 3, 2023

Rayon everything #12

Closed

Kerollmops added this to the v0.2.0 milestone Dec 4, 2023

Kerollmops force-pushed the parallel-building branch 5 times, most recently from dec0e40 to 840cb30 Compare December 6, 2023 09:50

Kerollmops added 18 commits December 6, 2023 11:19

Prefer shuffling once than running a lot of randoms

52cee33

Create the shape of building the trees in parallel

613827e

Finalize building the trees in parallel

66acc66

Make the import_movies work in parallel

ab88693

Fix an example

90eb3e4

Introduce the ImmutableLeafs concurrent struct

773ceba

Reduce the Rng constraints

0e4748c

Introduce the randomly_split_children function

b74d1cd

Make the number of trees a parameter

2ca49ea

Prefer not consuming the Rng

50c4b66

Document the new parallel build method

873aec7

Introduce the Writer::append_item method

d0412c0

Improve the import-vectors example

ff04c03

Add more logging to the Writer::build method

43fd92d

Document internal structs

42e97a6

Make it possible to build without specifying the trees

cbf4fb4

Do not store the leafs in memory

057a88f

Use more roaring bitmaps

3d19230

Kerollmops added 3 commits December 6, 2023 11:25

Add more logging and fix it

ba78f96

Introduce a new special example

3954c98

Fix the tree generator

3fab85f

Kerollmops force-pushed the parallel-building branch from 840cb30 to 44d7120 Compare December 6, 2023 10:35

Make it compile

fc402f0

Kerollmops force-pushed the parallel-building branch from 44d7120 to fc402f0 Compare December 6, 2023 10:40

Document the ImmutableSubsetLeafs methods and type

aea7970

Kerollmops force-pushed the parallel-building branch from 462ab1c to aea7970 Compare December 6, 2023 10:49

Kerollmops added 7 commits December 6, 2023 12:00

Use Relaxed instead of slow atomic operations

84971b0

Fix some stuff

4a47ec8

Move the TmpNodes codec on the type

5a58040

Update the README

15153d4

Introduce the set_tmpdir Writer function

30629f1

Use the tempfile_in feature when the tmpdir is defined

5706b3b

Fix the tests

6b91ae7

Kerollmops force-pushed the parallel-building branch from 901f4ac to 6b91ae7 Compare December 6, 2023 14:09

Kerollmops marked this pull request as ready for review December 6, 2023 14:09

Kerollmops merged commit 25eb41b into main Dec 6, 2023
5 checks passed

Kerollmops deleted the parallel-building branch December 6, 2023 14:11

Kerollmops mentioned this pull request Dec 6, 2023

Multi-thread the make_tree #8

Closed

This was referenced Dec 6, 2023

Spotify-Inspired: Elevating Meilisearch with Hybrid Search and Rust Kerollmops/blog#3

Open

Multithreading and Memory-Mapping: Refining ANN Performance with Arroy Kerollmops/blog#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build the trees in parallel #32

Build the trees in parallel #32

Kerollmops commented Dec 2, 2023 •

edited

Kerollmops commented Dec 6, 2023 •

edited

Build the trees in parallel #32

Build the trees in parallel #32

Conversation

Kerollmops commented Dec 2, 2023 • edited

TODO

Kerollmops commented Dec 6, 2023 • edited

Kerollmops commented Dec 2, 2023 •

edited

Kerollmops commented Dec 6, 2023 •

edited