Skip to content

Conversation

@yuhao-zh
Copy link
Collaborator

@yuhao-zh yuhao-zh commented Oct 23, 2025

📝 Change Type

Please select the type of change this PR introduces (choose one or more):

  • feat: New feature.
  • fix: Bug fix.
  • docs: Documentation only changes.
  • refactor: A code change that neither fixes a bug nor adds a feature.
  • perf: Performance improvement.
  • test: Adding missing tests or correcting existing tests.
  • chore: Maintenance tasks (e.g., updating dependencies).

💡 Description

Fix Qwen3 next attention layer's bug which make model run failed

Key Changes

  1. update backend to hybrid model
  2. fix qwen3_next model bug, force paged_size to 1
  3. update sglang to 0.5.3

🔗 Related Issues

List any issues this PR closes or relates to:

#101

✅ Checklist

Please ensure the following points are addressed before merging:

  • I have performed a self-review of my own code.
  • I have added/updated tests that prove my fix or feature works (if applicable).
  • I have updated the documentation (if necessary).
  • My code follows the project's style guidelines.

@yuhao-zh yuhao-zh changed the title Fix(model): fix qwen3 next bug & update sgalng to 0.5.3 fix(model): fix qwen3 next bug & update sgalng to 0.5.3 Oct 23, 2025
@yuhao-zh yuhao-zh changed the title fix(model): fix qwen3 next bug & update sgalng to 0.5.3 fix(model): fix qwen3 next bug & update sgalng to 0.5.4 Oct 24, 2025
@yuhao-zh yuhao-zh changed the title fix(model): fix qwen3 next bug & update sgalng to 0.5.4 fix(model): fix qwen3 next bug & update sgalng to 0.5.4.post1 Oct 27, 2025
@yuhao-zh yuhao-zh merged commit 988422f into main Oct 27, 2025
4 checks passed
@yuhao-zh yuhao-zh deleted the fix/model_qwen3_next branch October 27, 2025 10:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants