[Model] Support nested structures for TensorSchema #26212

DarkLight1337 · 2025-10-04T06:22:55Z

Purpose

This helps simplify the implementation of HyperCLOVAX-SEED-Vision and enable merge_by_field_config=True for that model.

Other changes:

Improve the name of the field in Phi3VImagePixelInputs
Fix invalid import for GLM4.1V caused by [Multi Modal] Configurable MM Profiling #25631

cc @bbeckca

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request introduces support for nested tensor structures within TensorSchema, a significant enhancement for managing complex multimodal inputs. The refactoring in hyperclovax_vision.py to adopt this new schema notably improves code clarity and maintainability. I have identified two critical issues in the _call_hf_processor method where video input data structures are incorrectly constructed, which would result in runtime errors. My review provides specific suggestions to rectify these problems. The other modifications, including the core logic updates in tensor_schema.py and the corresponding test adjustments, appear to be well-implemented.

vllm/model_executor/models/hyperclovax_vision.py

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

vllm/model_executor/models/hyperclovax_vision.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-10-04T06:28:32Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for nested structures in TensorSchema, which simplifies the implementation for models like HyperCLOVAX-SEED-Vision. The changes include refactoring hyperclovax_vision.py to use the new TensorSchema capabilities, renaming a field in Phi3VImagePixelInputs for clarity, and fixing a minor bug in glm4_1v.py. The refactoring in hyperclovax_vision.py is substantial and improves code clarity and maintainability. I have one suggestion to further improve the maintainability of the parsing logic for multimodal inputs.

vllm/model_executor/models/hyperclovax_vision.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py

LGTM!

bbeckca · 2025-10-04T18:14:59Z

Looks good. Thanks for improving @DarkLight1337!

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

[Model] Support nested structures for TensorSchema

f9bc726

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested review from Isotr0py and ywang96 October 4, 2025 06:22

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 4, 2025

gemini-code-assist bot reviewed Oct 4, 2025

View reviewed changes

vllm/model_executor/models/hyperclovax_vision.py Show resolved Hide resolved

vllm/model_executor/models/hyperclovax_vision.py Outdated Show resolved Hide resolved

chatgpt-codex-connector bot reviewed Oct 4, 2025

View reviewed changes

vllm/model_executor/models/hyperclovax_vision.py Outdated Show resolved Hide resolved

vllm/model_executor/models/hyperclovax_vision.py Show resolved Hide resolved

Address comments

beb0b1b

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist bot reviewed Oct 4, 2025

View reviewed changes

vllm/model_executor/models/hyperclovax_vision.py Show resolved Hide resolved

Simplify

dde5433

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py approved these changes Oct 4, 2025

View reviewed changes

vllm-bot merged commit 44ea851 into vllm-project:main Oct 4, 2025
51 of 53 checks passed

DarkLight1337 deleted the tensor-schema-nested branch October 4, 2025 08:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Model] Support nested structures for TensorSchema #26212

[Model] Support nested structures for TensorSchema #26212

DarkLight1337 commented Oct 4, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Oct 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

bbeckca commented Oct 4, 2025

Uh oh!

Uh oh!

Uh oh!

[Model] Support nested structures for TensorSchema #26212

[Model] Support nested structures for TensorSchema #26212

Conversation

DarkLight1337 commented Oct 4, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

DarkLight1337 commented Oct 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bbeckca commented Oct 4, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Oct 4, 2025 •

edited by github-actions bot

Loading