[AMDGPU] Respect existing glue when lowering convergence tokens #90834

ssahasra · 2024-05-02T08:32:45Z

No description provided.

llvmbot · 2024-05-02T08:33:18Z

@llvm/pr-subscribers-backend-amdgpu

Author: Sameer Sahasrabuddhe (ssahasra)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/90834.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/SIISelLowering.cpp (+10-11)

diff --git a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
index cb4efdc7cf657c..68dffdf8060486 100644
--- a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
+++ b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp
@@ -3859,20 +3859,19 @@ SDValue SITargetLowering::LowerCall(CallLoweringInfo &CLI,
   assert(Mask && "Missing call preserved mask for calling convention");
   Ops.push_back(DAG.getRegisterMask(Mask));
 
-  if (InGlue.getNode())
-    Ops.push_back(InGlue);
-
-  // NOTE: This potentially results in *two* glue operands, and the wrong one
-  // might possibly show up where the other was intended. In particular,
-  // Emitter::EmitMachineNode() expects only the glued convergence token if it
-  // exists. Similarly, the selection of the call expects to match only the
-  // InGlue operand if it exists.
   if (SDValue Token = CLI.ConvergenceControlToken) {
-    Ops.push_back(SDValue(DAG.getMachineNode(TargetOpcode::CONVERGENCECTRL_GLUE,
-                                             DL, MVT::Glue, Token),
-                          0));
+    SmallVector<SDValue, 2> GlueOps;
+    GlueOps.push_back(Token);
+    if (InGlue.getNode())
+      GlueOps.push_back(InGlue);
+
+    InGlue = SDValue(DAG.getMachineNode(TargetOpcode::CONVERGENCECTRL_GLUE,
+                                       DL, MVT::Glue, GlueOps), 0);
   }
 
+  if (InGlue.getNode())
+    Ops.push_back(InGlue);
+
   SDVTList NodeTys = DAG.getVTList(MVT::Other, MVT::Glue);
 
   // If we're doing a tall call, use a TC_RETURN here rather than an

github-actions · 2024-05-02T08:35:42Z

✅ With the latest revision this PR passed the C/C++ code formatter.

jayfoad · 2024-05-02T09:00:26Z

I guess no nodes actually require this yet. Are you expecting any? Or is this just defensive programming?

arsenm · 2024-05-02T09:29:31Z

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

-                          0));
+    SmallVector<SDValue, 2> GlueOps;
+    GlueOps.push_back(Token);
+    if (InGlue.getNode())


Don't need .getNode()

arsenm · 2024-05-02T09:29:40Z

llvm/lib/Target/AMDGPU/SIISelLowering.cpp

  }

+  if (InGlue.getNode())


Don't need .getNode()

ssahasra · 2024-05-03T07:13:29Z

I guess no nodes actually require this yet. Are you expecting any? Or is this just defensive programming?

What is "this" in this context?? When lowering calls, we do glue stuff onto the actually call instruction. Now we also need to glue the convergence token to the same call. The existing attempt happily created a second glue operand, and this change fixes that to have a single glue operand which could itself have a chain of glued operands.

jayfoad · 2024-05-14T09:45:48Z

I guess no nodes actually require this yet. Are you expecting any? Or is this just defensive programming?

What is "this" in this context??

"This" meant "this patch". I was trying to work out what kind of patch this is. Does it fix a bug? Why is there no test case?

ssahasra · 2024-05-15T03:51:01Z

I guess no nodes actually require this yet. Are you expecting any? Or is this just defensive programming?

"This" meant "this patch". I was trying to work out what kind of patch this is. Does it fix a bug? Why is there no test case?

This patch merely cements my improved understanding of how glue is supposed to be set on an operation. Callsites already have a glue operand that holds on to the arg nodes. The lowering of control flow tokens at a callsite was adding a second glue operand, which is unexpected. This patch cleans that up by chaining the token with the existing glue operand. In this narrow window inside the AMDGPU backend, this "bug" would have been triggered if those arg operands were actually accessed as glue. But they are not, so nothing broke when I put a second glue operand.

ssahasra requested review from jayfoad and arsenm May 2, 2024 08:32

llvmbot added the backend:AMDGPU label May 2, 2024

arsenm reviewed May 2, 2024

View reviewed changes

[AMDGPU] Respect existing glue when lowering convergence tokens

897cc6d

ssahasra force-pushed the ssahasra/convergence-glue branch from 3a5ee8a to 897cc6d Compare May 6, 2024 09:14

ssahasra merged commit 11e5d1c into llvm:main May 14, 2024
4 checks passed

ssahasra deleted the ssahasra/convergence-glue branch May 14, 2024 08:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Respect existing glue when lowering convergence tokens #90834

[AMDGPU] Respect existing glue when lowering convergence tokens #90834

ssahasra commented May 2, 2024

llvmbot commented May 2, 2024

github-actions bot commented May 2, 2024 •

edited

Loading

jayfoad commented May 2, 2024

arsenm May 2, 2024

arsenm May 2, 2024

ssahasra commented May 3, 2024

jayfoad commented May 14, 2024

ssahasra commented May 15, 2024

[AMDGPU] Respect existing glue when lowering convergence tokens #90834

[AMDGPU] Respect existing glue when lowering convergence tokens #90834

Conversation

ssahasra commented May 2, 2024

llvmbot commented May 2, 2024

github-actions bot commented May 2, 2024 • edited Loading

jayfoad commented May 2, 2024

arsenm May 2, 2024

Choose a reason for hiding this comment

arsenm May 2, 2024

Choose a reason for hiding this comment

ssahasra commented May 3, 2024

jayfoad commented May 14, 2024

ssahasra commented May 15, 2024

github-actions bot commented May 2, 2024 •

edited

Loading