[model] support qwen3_next fp8 #5909

Jintao-Huang · 2025-09-22T11:03:46Z

No description provided.

gemini-code-assist · 2025-09-22T11:04:06Z

Summary of Changes

Hello @Jintao-Huang, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request integrates support for the FP8 quantized variants of the Qwen3-Next 80B Instruct and Thinking models, which is crucial for optimizing their performance and resource usage. Concurrently, it updates the comprehensive list of supported models in the documentation, indicating broader compatibility and readiness for several Qwen3-Coder and Kimi-VL models. A minor but important adjustment to a dependency version constraint for Intern-S1 models ensures better compatibility and stability.

Highlights

Qwen3-Next FP8 Model Support: Added explicit support for the FP8 quantized versions of the Qwen3-Next-80B-A3B-Instruct and Qwen3-Next-80B-A3B-Thinking models, enabling more efficient deployment and inference.
Documentation Updates: Updated both English and Chinese documentation for supported models, reflecting the newly added Qwen3-Next FP8 models and marking several existing Qwen3-Coder and Kimi-VL models as fully supported.
Dependency Version Constraint Refinement: Refined the transformers library version requirement for Intern-S1 models to be more precise, specifying a range of >=4.55.2,<4.56.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds support for FP8 variants of qwen3_next models. It also includes several other unrelated updates: it changes Megatron support flags for Qwen3-Coder and Kimi-VL models, and pins the transformers dependency for Intern-S1 models. While it's generally better to keep pull requests focused on a single concern for clarity and easier reviews, the changes appear correct. The core change for qwen3_next FP8 support is sound. However, a key concern is the new dependency on a development version of transformers (>=4.57.0.dev), which could introduce instability for users.

support qwen3_next fp8

1272a3d

gemini-code-assist bot reviewed Sep 22, 2025

View reviewed changes

tastelikefeet approved these changes Sep 22, 2025

View reviewed changes

hjh0119 approved these changes Sep 22, 2025

View reviewed changes

Jintao-Huang merged commit 2efd7de into modelscope:main Sep 22, 2025
1 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[model] support qwen3_next fp8 #5909

[model] support qwen3_next fp8 #5909

Uh oh!

Jintao-Huang commented Sep 22, 2025

Uh oh!

gemini-code-assist bot commented Sep 22, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[model] support qwen3_next fp8 #5909

[model] support qwen3_next fp8 #5909

Uh oh!

Conversation

Jintao-Huang commented Sep 22, 2025

Uh oh!

gemini-code-assist bot commented Sep 22, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants