allow providing vlm captions with surrounding page text by edknv · Pull Request #1389 · NVIDIA/NeMo-Retriever

edknv · 2026-02-10T03:28:23Z

Description

This PR adds context_text_max_chars parameter to allow enriching VLM image captions with surrounding page text. When enabled, each image's caption prompt is prepended with nearby text, improving retrieval accuracy for documents where images and surrounding text are semantically linked.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

edknv and others added 6 commits February 9, 2026 19:25

allow providing vlm captions with surrounding page text

8f85a3a

lint

d768420

Merge branch 'main' into edwardk/vlm-caption-context-text

91dfafa

lint

8430ae0

Merge branch 'main' into edwardk/vlm-caption-context-text

3cd77e9

Merge branch 'main' into edwardk/vlm-caption-context-text

24436ae

edknv marked this pull request as ready for review February 23, 2026 21:25

edknv requested a review from a team as a code owner February 23, 2026 21:25

edknv requested review from ChrisJar, charlesbluca and jioffe502 February 23, 2026 21:25

jioffe502 approved these changes Feb 23, 2026

View reviewed changes

lint

6e48feb

edknv merged commit 9e99bb9 into NVIDIA:main Feb 24, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow providing vlm captions with surrounding page text#1389

allow providing vlm captions with surrounding page text#1389
edknv merged 7 commits intoNVIDIA:mainfrom
edknv:edwardk/vlm-caption-context-text

edknv commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

edknv commented Feb 10, 2026

Description

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants