Conversation
added 2 commits
March 3, 2026 18:13
3 problems targeting MI355X from AMD-AIM/reference-kernels@20260209. Also fixes eval.py regex to support underscored keys and booleans.
aiter's fused_moe kernel produces different results across calls with identical inputs on gfx950, causing the reference submission to fail correctness checks against itself.
added 2 commits
March 4, 2026 07:29
- mixed-mla: Add tp (tensor parallel) parameter, variable num_heads, qseqlen=4 prefill cases, updated test/benchmark shapes - moe-mxfp4: Updated benchmark shapes with TP=4/TP=8 variants, different batch sizes - mxfp4-mm: Added m=32 benchmark, adjusted shape set
aiter JIT compilation on first run can take 10+ minutes on MI355X, causing test timeouts. Bump all timeouts to 1800s (30 min).
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
20260209amd_202602.yamleval.pyincludes regex fix for underscore-containing test case keys and boolean value parsingKnown issue
The
moe-mxfp4reference submission calls aiter'sfused_moekernel which is non-deterministic on MI355X — it produces different results across identical calls, failing correctness checks against itself. This is documented in the task.yml.Test plan
mia1-p02-g29)