Support Deepstack in qwen3-omni by aireenmei · Pull Request #3214 · AI-Hypercomputer/maxtext

aireenmei · 2026-02-23T06:13:16Z

Description

Original work #2729

rebase and minor fixes
add unit test TestDeepstackProcess

Tests

qwen3_omni_layer_test.py pass locally
The pylint error is unrelated and will be fixed by #3219

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-02-23T06:23:26Z

Codecov Report

❌ Patch coverage is 11.76471% with 15 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/layers/decoders.py	9.09%	9 Missing and 1 partial ⚠️
src/maxtext/models/models.py	25.00%	3 Missing ⚠️
src/maxtext/layers/encoders.py	0.00%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

github-actions · 2026-02-23T22:27:37Z

🤖 Hi @aireenmei, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

## 📋 Review Summary

This Pull Request effectively implements the Deepstack visual embedding injection into the Qwen3 Omni model. The integration points in the Decoder and vision_encoder look mostly correct, and the vectorized implementation of _deepstack_process using jnp.cumsum is clever and highly efficient.

🔍 General Feedback

Efficiency Highlight: The use of boolean masks with cumsum for aligning visual embeddings with sequence positions is an excellent, TPU-friendly pattern that avoids slow dynamic slice operations.
A logic bug in the scan_layers check could cause the model to silently drop visual embeddings if image_masks are not explicitly provided by the caller but deepstack_visual_embeds are present.
A minor style improvement can be made by making _deepstack_process a static method, aligning with typical JAX pure-function idioms.

github-actions · 2026-02-23T23:00:44Z

🤖 Hi @aireenmei, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

## 📋 Review Summary

This PR successfully integrates Deepstack visual embeddings into the decoder layers for the qwen3-omni models. The implementation effectively extends the vision encoder to output deep features and seamlessly processes them within the transformer decoder architecture.

🔍 General Feedback

The changes correctly decouple deep feature extraction from the main projection logic while adhering to existing layer configurations.
The unit tests added are comprehensive and correctly validate the scattering of visual tokens across the sequences based on the bidirectional mask.
A few minor inline suggestions are provided to handle edge cases relating to implicit type promotions and to prevent potential NaN propagation when computing masked visual embeddings.

hengtaoguo

LGTM, thank you Aireen, let's get this merged!

Co-authored-by: Eitan Porat <eporat@lightricks.com>

aireenmei force-pushed the qwen-deepstack branch from 49b62bc to c794255 Compare February 23, 2026 06:15

aireenmei force-pushed the qwen-deepstack branch from c794255 to dfa048a Compare February 23, 2026 07:01

aireenmei marked this pull request as ready for review February 23, 2026 21:16

aireenmei requested review from A9isha, NicoGrande, NuojCheng, RissyRan, SurbhiJainUSC, bvandermoon, gagika, gobbleturk, hengtaoguo, jesselu-google, jiangjy1982, khatwanimohit, richjames0, shralex, suexu1025 and vipannalla as code owners February 23, 2026 21:16

aireenmei force-pushed the qwen-deepstack branch 2 times, most recently from f19d310 to 7a3a9e2 Compare February 23, 2026 22:26

aireenmei added the gemini-review label Feb 23, 2026

github-actions Bot reviewed Feb 23, 2026

View reviewed changes

Comment thread src/maxtext/layers/decoders.py Outdated

Comment thread src/maxtext/layers/decoders.py Outdated

Comment thread src/maxtext/layers/decoders.py Outdated

Comment thread tests/unit/qwen3_omni_layers_test.py Outdated

aireenmei force-pushed the qwen-deepstack branch from 7a3a9e2 to c52f251 Compare February 23, 2026 22:55

aireenmei added gemini-review and removed gemini-review labels Feb 23, 2026

github-actions Bot reviewed Feb 23, 2026

View reviewed changes

Comment thread src/maxtext/layers/decoders.py

Comment thread tests/unit/qwen3_omni_layers_test.py

aireenmei force-pushed the qwen-deepstack branch from c52f251 to 58bf14e Compare February 23, 2026 23:33

hengtaoguo approved these changes Feb 23, 2026

View reviewed changes

entrpn approved these changes Feb 24, 2026

View reviewed changes

aireenmei force-pushed the qwen-deepstack branch from 58bf14e to 57c958a Compare February 24, 2026 20:21

Support Deepstack in qwen3-omni

7da6a17

Co-authored-by: Eitan Porat <eporat@lightricks.com>

aireenmei force-pushed the qwen-deepstack branch from 57c958a to 7da6a17 Compare February 24, 2026 20:27

aireenmei added the pull ready label Feb 24, 2026

copybara-service Bot merged commit 973d8e1 into main Feb 24, 2026
26 of 30 checks passed

copybara-service Bot deleted the qwen-deepstack branch February 24, 2026 22:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Deepstack in qwen3-omni#3214

Support Deepstack in qwen3-omni#3214
copybara-service[bot] merged 1 commit intomainfrom
qwen-deepstack

aireenmei commented Feb 23, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Feb 23, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 23, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Feb 23, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

Uh oh!

hengtaoguo left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aireenmei commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions Bot commented Feb 23, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

🔍 General Feedback

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Feb 23, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

🔍 General Feedback

Uh oh!

Uh oh!

Uh oh!

hengtaoguo left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aireenmei commented Feb 23, 2026 •

edited

Loading

codecov Bot commented Feb 23, 2026 •

edited

Loading

hengtaoguo left a comment •

edited

Loading