Skip to content

add qwen35 ut#1482

Merged
wenhuach21 merged 32 commits intomainfrom
support_qwen35
Mar 2, 2026
Merged

add qwen35 ut#1482
wenhuach21 merged 32 commits intomainfrom
support_qwen35

Conversation

@wenhuach21
Copy link
Copy Markdown
Contributor

@wenhuach21 wenhuach21 commented Mar 2, 2026

Description

Please briefly describe your main changes, the motivation.

Type of Change

  • Bug fix
  • New feature
  • Documentation update
  • Performance improvement
  • Code refactoring
  • Other (please specify):

Related Issues

Fixes or relates to #

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.

wenhuach21 and others added 20 commits February 27, 2026 16:13
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Copilot AI review requested due to automatic review settings March 2, 2026 03:45
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new CUDA test for quantizing/loading a Qwen3.5 MoE model and introduces a helper to conditionally skip based on transformers version, while also disabling several existing MoE tests via commenting.

Changes:

  • Added check_version() helper and used it to gate a new Qwen3.5 MoE test with pytest.mark.skipif.
  • Added a new setup_qwen35_moe fixture and test_qwen3_5_moe test that quantizes + reloads a Qwen3.5 MoE model.
  • Commented out previously active quantization helpers/tests in test_moe_model.py.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 8 comments.

File Description
test/test_cuda/models/test_moe_model.py Adds Qwen3.5 MoE quantize/load test + version gating; comments out multiple existing tests/helpers
test/helpers.py Adds check_version() utility used by tests to skip based on dependency versions

wenhuach21 and others added 3 commits March 2, 2026 15:41
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
pre-commit-ci bot and others added 3 commits March 2, 2026 07:44
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@wenhuach21 wenhuach21 merged commit 2b5d117 into main Mar 2, 2026
15 checks passed
@wenhuach21 wenhuach21 deleted the support_qwen35 branch March 2, 2026 11:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants