Skip to content

Conversation

@Mogball
Copy link
Collaborator

@Mogball Mogball commented Apr 14, 2025

No description provided.

@Mogball Mogball requested a review from ptillet as a code owner April 14, 2025 18:53
@Mogball Mogball requested a review from peterbell10 April 14, 2025 18:53
@Mogball Mogball merged commit 31b2b23 into main Apr 14, 2025
8 checks passed
@Mogball Mogball deleted the mogball/fix_side_effects branch April 14, 2025 19:10
peterbell10 added a commit that referenced this pull request Apr 16, 2025
Mogball added a commit that referenced this pull request Apr 16, 2025
The first PR in this revert stack is somehow changing bitwise
equivalence. To unblock work, I am reverting them and relanding them
with better granularity.

Revert "[Blackwell] Optimize MMA warp specialization to allow multiple consumers of MMAv5 result (#6487)"

This reverts commit 96fc40d.

Revert "[TritonNVIDIAGPU] Add `MemRead<GlobalMemory>` to async TMA write ops (#6485)"

This reverts commit 31b2b23.

Revert "[TritonNVIDIAGPU] Revert MMAv5 write effect on barrier (#6484)"

This reverts commit f8a19d1.

Revert "[Dialect] Cleanup and improve granularity of side effects (#6476)"

This reverts commit f60465e.
Mogball added a commit that referenced this pull request Apr 16, 2025
Reland of #6485

All async TMA ops read the descriptor. MMAScaled is missing read effects
on the scale arguments, which are always in TMEM.
Mogball added a commit that referenced this pull request Apr 17, 2025
The first PR in this revert stack is somehow changing bitwise
equivalence. To unblock work, I am reverting them and relanding them
with better granularity.

Revert "[Blackwell] Optimize MMA warp specialization to allow multiple
consumers of MMAv5 result (#6487)"

This reverts commit 96fc40d.

Revert "[TritonNVIDIAGPU] Add `MemRead<GlobalMemory>` to async TMA write
ops (#6485)"

This reverts commit 31b2b23.

Revert "[TritonNVIDIAGPU] Revert MMAv5 write effect on barrier (#6484)"

This reverts commit f8a19d1.

Revert "[Dialect] Cleanup and improve granularity of side effects
(#6476)"

This reverts commit f60465e.
jtang10 pushed a commit to ROCm/triton that referenced this pull request Apr 17, 2025
…ang#6476 (triton-lang#6512)

The first PR in this revert stack is somehow changing bitwise
equivalence. To unblock work, I am reverting them and relanding them
with better granularity.

Revert "[Blackwell] Optimize MMA warp specialization to allow multiple
consumers of MMAv5 result (triton-lang#6487)"

This reverts commit 96fc40d.

Revert "[TritonNVIDIAGPU] Add `MemRead<GlobalMemory>` to async TMA write
ops (triton-lang#6485)"

This reverts commit 31b2b23.

Revert "[TritonNVIDIAGPU] Revert MMAv5 write effect on barrier (triton-lang#6484)"

This reverts commit f8a19d1.

Revert "[Dialect] Cleanup and improve granularity of side effects
(triton-lang#6476)"

This reverts commit f60465e.
njriasan pushed a commit to njriasan/triton that referenced this pull request Apr 18, 2025
…ang#6476 (triton-lang#6512)

The first PR in this revert stack is somehow changing bitwise
equivalence. To unblock work, I am reverting them and relanding them
with better granularity.

Revert "[Blackwell] Optimize MMA warp specialization to allow multiple
consumers of MMAv5 result (triton-lang#6487)"

This reverts commit 96fc40d.

Revert "[TritonNVIDIAGPU] Add `MemRead<GlobalMemory>` to async TMA write
ops (triton-lang#6485)"

This reverts commit 31b2b23.

Revert "[TritonNVIDIAGPU] Revert MMAv5 write effect on barrier (triton-lang#6484)"

This reverts commit f8a19d1.

Revert "[Dialect] Cleanup and improve granularity of side effects
(triton-lang#6476)"

This reverts commit f60465e.
Mogball added a commit that referenced this pull request Apr 23, 2025
Reland of #6485

All async TMA ops read the descriptor. MMAScaled is missing read effects
on the scale arguments, which are always in TMEM.
FindHao pushed a commit to FindHao/triton that referenced this pull request Apr 30, 2025
…ang#6476 (triton-lang#6512)

The first PR in this revert stack is somehow changing bitwise
equivalence. To unblock work, I am reverting them and relanding them
with better granularity.

Revert "[Blackwell] Optimize MMA warp specialization to allow multiple
consumers of MMAv5 result (triton-lang#6487)"

This reverts commit 96fc40d.

Revert "[TritonNVIDIAGPU] Add `MemRead<GlobalMemory>` to async TMA write
ops (triton-lang#6485)"

This reverts commit 31b2b23.

Revert "[TritonNVIDIAGPU] Revert MMAv5 write effect on barrier (triton-lang#6484)"

This reverts commit f8a19d1.

Revert "[Dialect] Cleanup and improve granularity of side effects
(triton-lang#6476)"

This reverts commit f60465e.
FindHao pushed a commit to FindHao/triton that referenced this pull request Apr 30, 2025
…#6518)

Reland of triton-lang#6485

All async TMA ops read the descriptor. MMAScaled is missing read effects
on the scale arguments, which are always in TMEM.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants