Adding multi_specializations_frames by mohiso22 · Pull Request #909 · quic/efficient-transformers

mohiso22 · 2026-04-07T04:19:45Z

Adds multi-specialization support for Qwen2.5-VL and Qwen3-VL, enabling the vision encoder to be compiled with multiple resolution configurations (height/width/num_frames as lists) in one shot.
Introduces a dedicated _generate_multi_frame_specialization inference path that selects the right specialization at runtime, along with example scripts for both model families.

vbaddi · 2026-04-07T09:13:55Z

@mohiso22 can you fix the quickcheck pls?

quic-rishinr · 2026-04-08T11:05:56Z


+    def _generate_multi_frame_specialization(
+        self,
+        inputs: List[str] = None,


nit: update the type annotation for inputs as inputs: Dict[str, torch.Tensor]

quic-rishinr · 2026-04-10T11:00:10Z

-            }
-        ]
+            else:
+                assert vision_size * f < user_vision_size, (


nit: better to use exception instead of assert.

quic-rishinr · 2026-04-10T11:00:16Z

+            grid_height = grid_height * time * batch_size
+            if not user_vision_size:
+                max_vision_size = max(max_vision_size, vision_size * f)
+                assert max_vision_size < ctx_len, (


nit: better to use exception instead of assert.

quic-rishinr · 2026-04-10T11:01:11Z

            return self._generate_regular_batching(vision_prompts, generation_len, stream, **kwargs)

+    def run_prefill_multi_frame_specialization(
+        self, inputs: Optional[torch.Tensor], num_frames: Optional[int] = 1, generation_len: int = None


please add a doc string for the method

quic-rishinr · 2026-04-10T11:04:25Z

+        self._session.deactivate()
+        self._vision_session.activate()
+
+        if not num_frames:


nit: better to specific it as if num_frames ==0: num_frames = 1 and add a warning if that was the check. Since default value is set to 1 ideally we need not need this condition unless somewhere we are passing the value as none.

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

quic-rishinr · 2026-04-10T11:15:02Z

Merging the PR on release/v1.21.6. @mohiso22 please raise a new PR on mainline with the couple of changes requested.

…models as per reference from quic#909. Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>

mohiso22 requested review from quic-rishinr and vbaddi April 7, 2026 04:20

vbaddi assigned mohiso22 Apr 7, 2026

mohiso22 force-pushed the qwen_multi branch from bdf55f7 to c1038d4 Compare April 7, 2026 11:30

vbaddi reviewed Apr 7, 2026

View reviewed changes

Comment thread examples/image_text_to_text/models/qwen2_5_vl/multi_specialization_inference.py

quic-rishinr requested changes Apr 8, 2026

View reviewed changes

quic-sanising mentioned this pull request Apr 8, 2026

Multi Resolution Support for Qwen2.5-VL Model #875

Closed

4 tasks

mohiso22 marked this pull request as ready for review April 9, 2026 14:27

quic-rishinr requested changes Apr 10, 2026

View reviewed changes

quic-rishinr changed the base branch from main to release/v1.21.6 April 10, 2026 11:13

Mohit Soni added 10 commits April 10, 2026 16:44

Adding multi_specialization

387e6fa

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Adding qwen3vl multi specialization

55375aa

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Adding qwen3_vl_moe_multispecs

eb141c0

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Renaming Folder Name

cfc9adf

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Comments Addressed

d7c4a82

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Minor fix

ddb3934

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Minor Fixes

6113355

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

qwen3vl_moe_changes

9da2c79

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Adding quickcheck

c90cacf

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

Adding qwen-vl-utils in project.toml

d50cf6d

Signed-off-by: Mohit Soni <mohisoni@qti.qualcomm.com>

quic-rishinr force-pushed the qwen_multi branch from ccaac2c to d50cf6d Compare April 10, 2026 11:14

quic-rishinr merged commit adc4c18 into quic:release/v1.21.6 Apr 10, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding multi_specializations_frames#909

Adding multi_specializations_frames#909
quic-rishinr merged 10 commits into
quic:release/v1.21.6from
mohiso22:qwen_multi

mohiso22 commented Apr 7, 2026 •

edited by quic-rishinr

Loading

Uh oh!

vbaddi commented Apr 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quic-rishinr Apr 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quic-rishinr Apr 10, 2026

Uh oh!

quic-rishinr Apr 10, 2026

Uh oh!

quic-rishinr Apr 10, 2026

Uh oh!

quic-rishinr Apr 10, 2026

Uh oh!

quic-rishinr commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mohiso22 commented Apr 7, 2026 • edited by quic-rishinr Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vbaddi commented Apr 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quic-rishinr Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quic-rishinr Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

quic-rishinr Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

quic-rishinr Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

quic-rishinr Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

quic-rishinr commented Apr 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mohiso22 commented Apr 7, 2026 •

edited by quic-rishinr

Loading