Revert "[WIP][Megatron-LM] feat: reduce extra qkv transpose in attn" by Xiaoming-AMD · Pull Request #641 · AMD-AGI/Primus

Xiaoming-AMD · 2026-04-01T08:26:09Z

Reverts #625

…625)" This reverts commit 7665157.

Copilot

Pull request overview

Reverts prior work in Megatron-LM Primus-Turbo attention intended to reduce extra QKV transposes, restoring earlier QKV layout handling and output layout conversion behavior.

Changes:

Updates PrimusTurboAttention.forward() QKV layout validation and conditional transpose behavior based on qkv_format.
Adjusts attention output reshaping/transposition back to expected (S, B, ...) layout.

Copilot · 2026-04-01T08:29:55Z

-        ), f"qkv_format only support {SUPPORTED_QKV_FORMATS}, but got {qkv_format}"
-        # NOTE(ruibin): The layout of q, k and v is (S, B, H, D). But attn accept the shape of qkv is (B, S, H, D).
-        query, key, value = [x.permute(1, 0, 2, 3) for x in (query, key, value)]
+        assert qkv_format in ("sbhd", "bhsd"), "qkv_format only support bshd, but got {qkv_format}"


The qkv_format validation looks inconsistent: it allows ("sbhd", "bhsd") but the error text says "bshd", and {qkv_format} won’t be interpolated because this isn’t an f-string. If the intended formats are SBHD and BSHD (matching the comment removed in this revert), update the allowed tuple and make the message an f-string (or use ValueError) so the reported format is correct.

Suggested change

assert qkv_format in ("sbhd", "bhsd"), "qkv_format only support bshd, but got {qkv_format}"

assert qkv_format in ("sbhd", "bshd"), f"qkv_format only supports 'sbhd' and 'bshd', but got {qkv_format}"

Revert "[WIP][Megatron-LM] feat: reduce extra qkv transpose in attn (#…

4b1dca6

…625)" This reverts commit 7665157.

Xiaoming-AMD requested a review from wenxie-amd as a code owner April 1, 2026 08:26

Copilot AI review requested due to automatic review settings April 1, 2026 08:26

Xiaoming-AMD requested a review from limou102 as a code owner April 1, 2026 08:26

Xiaoming-AMD merged commit b61cddc into main Apr 1, 2026
5 of 7 checks passed

Copilot AI reviewed Apr 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "[WIP][Megatron-LM] feat: reduce extra qkv transpose in attn"#641

Revert "[WIP][Megatron-LM] feat: reduce extra qkv transpose in attn"#641
Xiaoming-AMD merged 1 commit into
mainfrom
revert-625-dev/zhangrb/refine_turbo_attn

Xiaoming-AMD commented Apr 1, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	assert qkv_format in ("sbhd", "bhsd"), "qkv_format only support bshd, but got {qkv_format}"
	assert qkv_format in ("sbhd", "bshd"), f"qkv_format only supports 'sbhd' and 'bshd', but got {qkv_format}"

Conversation

Xiaoming-AMD commented Apr 1, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants