Skip to content

allow providing vlm captions with surrounding page text#1389

Merged
edknv merged 7 commits intoNVIDIA:mainfrom
edknv:edwardk/vlm-caption-context-text
Feb 24, 2026
Merged

allow providing vlm captions with surrounding page text#1389
edknv merged 7 commits intoNVIDIA:mainfrom
edknv:edwardk/vlm-caption-context-text

Conversation

@edknv
Copy link
Collaborator

@edknv edknv commented Feb 10, 2026

Description

This PR adds context_text_max_chars parameter to allow enriching VLM image captions with surrounding page text. When enabled, each image's caption prompt is prepended with nearby text, improving retrieval accuracy for documents where images and surrounding text are semantically linked.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

@edknv edknv marked this pull request as ready for review February 23, 2026 21:25
@edknv edknv requested a review from a team as a code owner February 23, 2026 21:25
@edknv edknv merged commit 9e99bb9 into NVIDIA:main Feb 24, 2026
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants