VectorDB and Judge Parallelism #55

luis-gasparschroeder · 2025-06-10T06:55:29Z

The LLMSimilarityEvaluator requires async execution to ensure performant e2e latency.

To achieve that, I multi-threaded the exploration logic. To ensure thread safety, the metadata objects and Vector DB ID generation maintain RW locks.

AlexCuadron

Quite nice PR! The biggest issue I found is the widespread use of RLock, even when sometimes its not needed and following the google style docstring.

vcache/vcache_core/cache/embedding_store/vector_db/strategies/faiss.py

vcache/vcache_core/cache/embedding_store/embedding_metadata_storage/strategies/in_memory.py

vcache/vcache_core/cache/embedding_store/vector_db/strategies/chroma.py

vcache/vcache_core/cache/cache.py

vcache/vcache_core/cache/embedding_store/vector_db/strategies/faiss.py

vcache/vcache_policy/strategies/dynamic_local_threshold.py

tests/unit/VectorDB/test_thread_safety.py

vcache/vcache_core/cache/embedding_store/vector_db/strategies/faiss.py

luis-gasparschroeder · 2025-06-11T20:59:04Z

Thank you for the comments @AlexCuadron. I fixed them.

AlexCuadron

Is this correct?

vcache/vcache_policy/strategies/dynamic_local_threshold.py

AlexCuadron

Quite nice PR, some threading potential errors, better to fix them now than later

AlexCuadron · 2025-06-12T10:22:25Z

vcache/vcache_core/cache/embedding_store/vector_db/strategies/chroma.py

+                self._init_vector_store(len(embedding))
+            if self.collection.count() == 0:
+                return []
+            k_ = min(k, self.collection.count())


self.index.ntotal could change between the check and usage if another thread adds/removes embeddings.

vcache/vcache_policy/strategies/dynamic_local_threshold.py

vcache/vcache_core/cache/embedding_store/embedding_metadata_storage/strategies/in_memory.py

vcache/vcache_policy/strategies/dynamic_local_threshold.py

AlexCuadron · 2025-06-12T10:24:31Z

vcache/vcache_core/cache/embedding_store/vector_db/strategies/faiss.py

+                self._init_vector_store(len(embedding))
+
+            # Atomic ID generation and assignment
+            embedding_id = self.__next_embedding_id


If add_with_ids() fails, __next_embedding_id is not incremented, but the ID was already assigned and returned, causing ID reuse.

AlexCuadron · 2025-06-12T10:26:03Z

vcache/vcache_core/cache/embedding_store/embedding_metadata_storage/strategies/in_memory.py

+            embedding_id (int): The ID of the embedding to update.
+            observation (Tuple[float, int]): The observation tuple (similarity, label).
+        """
+        entry_lock = self._get_entry_lock(embedding_id)


Lock ordering violation. If another thread holds _store_lock and tries to get entry_lock, deadlock occurs. with entry lock should already fix this I think (?)

vcache/vcache_policy/strategies/dynamic_local_threshold.py

luis-gasparschroeder · 2025-06-13T14:34:59Z

This async logic is too complicated. We have a better logic proposal in #65. I close this PR.

luis-gasparschroeder and others added 2 commits June 9, 2025 22:10

Added async logic

9a324d6

Implemented locking logic for vector db's

25fbe49

luis-gasparschroeder self-assigned this Jun 10, 2025

luis-gasparschroeder linked an issue Jun 10, 2025 that may be closed by this pull request

Async Re-Evaluation #43

Closed

5 tasks

AlexCuadron and others added 4 commits June 10, 2025 00:21

Commented out thread executor

060b449

Implemented thread logic

6566e29

Formatting

6ab0dcf

Implemented locking test for metadata

3260a52

luis-gasparschroeder requested a review from AlexCuadron June 10, 2025 18:48

luis-gasparschroeder marked this pull request as ready for review June 10, 2025 18:50

AlexCuadron mentioned this pull request Jun 11, 2025

Code Review Comments for Parallelism Implementation #62

Closed

AlexCuadron requested changes Jun 11, 2025

View reviewed changes

luis-gasparschroeder mentioned this pull request Jun 11, 2025

Support/test multiple Vector DBs #56

Open

4 tasks

luis-gasparschroeder added 3 commits June 11, 2025 22:45

Adjusted function and class comments to Google docstring guidelines

b7b2378

Fixed return type

4a584d9

Removed brittle assertion

c114b73

Made unit tests runnable by adding unique names

3dcdb05

AlexCuadron self-requested a review June 12, 2025 10:10

AlexCuadron reviewed Jun 12, 2025

View reviewed changes

vcache/vcache_policy/strategies/dynamic_local_threshold.py Show resolved Hide resolved

AlexCuadron requested changes Jun 12, 2025

View reviewed changes

Added e2e threading test

959815e

luis-gasparschroeder closed this Jun 13, 2025

luis-gasparschroeder removed a link to an issue Jun 13, 2025

Async Re-Evaluation #43

Closed

5 tasks

VectorDB and Judge Parallelism #55

VectorDB and Judge Parallelism #55

Uh oh!

Conversation

luis-gasparschroeder commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexCuadron left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

luis-gasparschroeder commented Jun 11, 2025

Uh oh!

AlexCuadron left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlexCuadron left a comment

Choose a reason for hiding this comment

Uh oh!

AlexCuadron Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AlexCuadron Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

AlexCuadron Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

luis-gasparschroeder commented Jun 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

luis-gasparschroeder commented Jun 10, 2025 •

edited

Loading