Multimodal search by DimasfromLavoisier · Pull Request #276 · ssciwr/AMMICO

DimasfromLavoisier · 2025-12-15T16:49:05Z

This is a PR for a new version f a multimodal search module. Few things left until its final version:

Tests
~~2. A method for multi-query search~~
~~3. Small fixes according copilot and other AI reviews~~
~~4. Update demo notebook~~

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

…nstead

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

ammico/multimodal_search.py

sonarqubecloud · 2025-12-17T16:14:28Z

Quality Gate passed

Issues
10 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

iulusoy

I'm having some issues with memory on the GPU. Is it possible that there is a memory leak in the image encoding when building the FAISS?
Maybe you can try allocating your local GPU memory to something else largely, to reduce the amount of memory ammico can use, to reproduce the issue; or use much more data.

iulusoy · 2025-12-18T11:18:57Z

ammico/test/test_multimodal_search.py

+
+@pytest.mark.long
+def test_multimodal_search_combined_query(get_path):
+    model = MultimodalEmbeddingsModel()


here my GPU runs out of memory and it fails with

torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 464.00 MiB. GPU 0 has a total capacity of 5.55 GiB of which 337.88 MiB is free. Including non-PyTorch memory, this process has 5.20 GiB memory in use. Of the allocated memory 4.57 GiB is allocated by PyTorch, and 553.08 MiB is reserved by PyTorch but unallocated. If reserved but unallocated memory is large try setting PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True to avoid fragmentation. See documentation for Memory Management (https://pytorch.org/docs/stable/notes/cuda.html#environment-variables)

maybe here we can add a try - except block to catch this error and fallback to CPU?
The problem starts in

ammico/multimodal_search.py:237: in index_images embeddings = self.model.encode_image( ../../../miniforge3/envs/ammico/lib/python3.13/site-packages/torch/utils/_contextlib.py:120: in decorate_context return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ ammico/model.py:285: in encode_image embeddings = self.model.encode( ../../../miniforge3/envs/ammico/lib/python3.13/site-packages/torch/utils/_contextlib.py:120: in decorate_context return func(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ ../../../miniforge3/envs/ammico/lib/python3.13/site-packages/sentence_transformers/SentenceTransformer.py:1094: in encode out_features = self.forward(features, **kwargs) ...

If I set

def test_multimodal_search_combined_query(get_path): model = MultimodalEmbeddingsModel(device="cpu") mms = MultimodalSearch(model=model)

the test runs fine.

iulusoy · 2025-12-18T11:24:33Z

ammico/notebooks/DemoNotebook_ammico.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "multim_s_model.index_images(\n",


Here also the kernel crashes, I assume it is the same memory problem. I suspect that it is due to a memory leak, torch maybe not releasing memory when it should..? Otherwise, why does the memory use accumulate so much during the run? (The first few encodings are usually fine.)

iulusoy · 2025-12-18T11:26:53Z

Also, somehow the code coverage is not showing, I assume this is because the PR is opened from a branch.
Other than the memory issue, the implementation looks very good.

iulusoy

With the other faiss library version, I also could not get it to run locally. But since it runs fine on the CPU, I would postpone this to the testing stage and merge the PR now.

DimasfromLavoisier requested review from Copilot and iulusoy December 15, 2025 16:49

Copilot AI reviewed Dec 15, 2025

View reviewed changes

Copilot started reviewing on behalf of DimasfromLavoisier December 15, 2025 17:19 View session

iulusoy and others added 5 commits December 16, 2025 16:06

fix: include audio model class in init

86abf73

fix: remove model from init, and reference model module in notebook i…

cb0a9fc

…nstead

add multimodal search module

f3380eb

[pre-commit.ci] auto fixes from pre-commit.com hooks

9d0ccb3

for more information, see https://pre-commit.ci

add multi query support

4d07e68

DimasfromLavoisier force-pushed the multimodal_search branch 3 times, most recently from f56ea29 to 6781827 Compare December 16, 2025 16:05

small fixes for code improvement

bf22f6c

DimasfromLavoisier force-pushed the multimodal_search branch from 6781827 to bf22f6c Compare December 16, 2025 16:16

DimasfromLavoisier requested a review from Copilot December 16, 2025 16:18

Copilot AI reviewed Dec 16, 2025

View reviewed changes

ammico/multimodal_search.py Outdated Show resolved Hide resolved

ammico/multimodal_search.py Outdated Show resolved Hide resolved

ammico/multimodal_search.py Outdated Show resolved Hide resolved

ammico/multimodal_search.py Outdated Show resolved Hide resolved

small adjustments

79d900e

DimasfromLavoisier force-pushed the multimodal_search branch 3 times, most recently from 08228b3 to fcc6934 Compare December 17, 2025 15:14

upd notebook

814ba54

DimasfromLavoisier force-pushed the multimodal_search branch from fcc6934 to 814ba54 Compare December 17, 2025 15:34

add tests

68a2d50

iulusoy requested changes Dec 18, 2025

View reviewed changes

iulusoy approved these changes Dec 22, 2025

View reviewed changes

iulusoy merged commit a408b99 into ssciwr:main Jan 5, 2026
4 checks passed

iulusoy deleted the multimodal_search branch January 5, 2026 11:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal search#276

Multimodal search#276
iulusoy merged 9 commits intossciwr:mainfrom
DimasfromLavoisier:multimodal_search

DimasfromLavoisier commented Dec 15, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 17, 2025

Uh oh!

iulusoy left a comment

Uh oh!

iulusoy Dec 18, 2025

Uh oh!

iulusoy Dec 18, 2025

Uh oh!

iulusoy Dec 18, 2025

Uh oh!

iulusoy commented Dec 18, 2025

Uh oh!

iulusoy left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

DimasfromLavoisier commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Dec 17, 2025

Quality Gate passed

Uh oh!

iulusoy left a comment

Choose a reason for hiding this comment

Uh oh!

iulusoy Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

iulusoy Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

iulusoy Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

iulusoy commented Dec 18, 2025

Uh oh!

iulusoy left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DimasfromLavoisier commented Dec 15, 2025 •

edited

Loading