Fix group topk dispatch for glm5 by junhaha666 · Pull Request #2611 · ROCm/aiter

junhaha666 · 2026-04-03T13:22:58Z

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

github-actions · 2026-04-03T13:23:48Z

🏷️ CI Guide

Runs automatically on every PR:

✅ Pre-checks (submodule verification, code formatting)
✅ Aiter op tests (gfx942 + gfx950)
✅ Triton tests (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label	Tests
`ci:triton-355`	Run Triton tests on MI355 in addition to MI325
`ci:sglang`	SGLang integration tests
`ci:atom`	ATOM benchmark (DeepSeek-R1 + GPT-OSS)
`ci:vllm`	vLLM benchmark
`ci:all`	All of the above

Add labels via the sidebar or gh pr edit 2611 --add-label <label>

Copilot

Pull request overview

Updates the grouped top‑k dispatch logic to avoid using moe_fused_gate for configurations with large experts-per-group (notably affecting GLM5), and adjusts the associated benchmark test to skip the moe_fused_gate comparison in those cases.

Changes:

Update biased_grouped_topk dispatch to route to the HIP implementation when experts-per-group exceeds 32.
Gate the moe_fused_gate (“sglang”) performance/correctness comparison in the benchmark test to only run when experts-per-group is ≤ 32.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
op_tests/test_moeTopkSoftmax.py	Skips `moe_fused_gate` comparison when experts-per-group exceeds the supported threshold.
aiter/ops/topk.py	Adjusts dispatch logic to avoid `moe_fused_gate` when experts-per-group is too large.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

* Fix group topk dispatch for glm5 * update grouped_topk not compute topk group when group=1

junhaha666 requested review from a team and Copilot April 3, 2026 13:23

junhaha666 added the ci:atom label Apr 3, 2026

Fix group topk dispatch for glm5

2bed68a

junhaha666 force-pushed the jun/group_topk_glm5 branch from e800fb8 to 2bed68a Compare April 3, 2026 13:24

Copilot started reviewing on behalf of junhaha666 April 3, 2026 13:25 View session

Copilot AI reviewed Apr 3, 2026

View reviewed changes

Comment thread aiter/ops/topk.py

Comment thread aiter/ops/topk.py

Comment thread op_tests/test_moeTopkSoftmax.py

Comment thread op_tests/test_moeTopkSoftmax.py

update grouped_topk not compute topk group when group=1

0479a53

valarLip approved these changes Apr 4, 2026

View reviewed changes

valarLip merged commit 35d347b into main Apr 4, 2026
23 of 25 checks passed

valarLip deleted the jun/group_topk_glm5 branch April 4, 2026 04:31

sunway513 mentioned this pull request Apr 5, 2026

Add FlyDSL fused RoPE + KV Cache backend #2617

Closed

3 tasks

yzhou103 pushed a commit that referenced this pull request Apr 8, 2026

Fix group topk dispatch for glm5 (#2611)

b8400e0

* Fix group topk dispatch for glm5 * update grouped_topk not compute topk group when group=1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix group topk dispatch for glm5#2611

Fix group topk dispatch for glm5#2611
valarLip merged 2 commits intomainfrom
jun/group_topk_glm5

junhaha666 commented Apr 3, 2026

Uh oh!

github-actions bot commented Apr 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

junhaha666 commented Apr 3, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

github-actions bot commented Apr 3, 2026

🏷️ CI Guide

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants