[GISEL] More accounting for scalable vectors when operating on LLTs #80372

michaelmaitland · 2024-02-02T01:41:05Z

This will be stacked on by a PR I am currently working on.

llvmbot · 2024-02-02T01:41:37Z

@llvm/pr-subscribers-llvm-globalisel

Author: Michael Maitland (michaelmaitland)

Changes

This will be stacked on by a PR I am currently working on.

Full diff: https://github.com/llvm/llvm-project/pull/80372.diff

2 Files Affected:

(modified) llvm/lib/CodeGen/GlobalISel/CallLowering.cpp (+7-6)
(modified) llvm/lib/CodeGen/MachineVerifier.cpp (+4-3)

diff --git a/llvm/lib/CodeGen/GlobalISel/CallLowering.cpp b/llvm/lib/CodeGen/GlobalISel/CallLowering.cpp
index ccd9b13d730b6..3bd1542eeb746 100644
--- a/llvm/lib/CodeGen/GlobalISel/CallLowering.cpp
+++ b/llvm/lib/CodeGen/GlobalISel/CallLowering.cpp
@@ -412,7 +412,7 @@ static void buildCopyFromRegs(MachineIRBuilder &B, ArrayRef<Register> OrigRegs,
     // size, e.g. PartLLT == v2s64 and LLTy is v3s32, then first coerce it to
     // have the same elt type, i.e. v4s32.
     // TODO: Extend this coersion to element multiples other than just 2.
-    if (PartLLT.getSizeInBits() > LLTy.getSizeInBits() &&
+    if (TypeSize::isKnownGT(PartLLT.getSizeInBits(), LLTy.getSizeInBits()) &&
         PartLLT.getScalarSizeInBits() == LLTy.getScalarSizeInBits() * 2 &&
         Regs.size() == 1) {
       LLT NewTy = PartLLT.changeElementType(LLTy.getElementType())
@@ -529,7 +529,7 @@ static void buildCopyToRegs(MachineIRBuilder &B, ArrayRef<Register> DstRegs,
   // We could just insert a regular copy, but this is unreachable at the moment.
   assert(SrcTy != PartTy && "identical part types shouldn't reach here");
 
-  const unsigned PartSize = PartTy.getSizeInBits();
+  const TypeSize PartSize = PartTy.getSizeInBits();
 
   if (PartTy.isVector() == SrcTy.isVector() &&
       PartTy.getScalarSizeInBits() > SrcTy.getScalarSizeInBits()) {
@@ -539,7 +539,7 @@ static void buildCopyToRegs(MachineIRBuilder &B, ArrayRef<Register> DstRegs,
   }
 
   if (SrcTy.isVector() && !PartTy.isVector() &&
-      PartSize > SrcTy.getElementType().getSizeInBits()) {
+      TypeSize::isKnownGT(PartSize, SrcTy.getElementType().getSizeInBits())) {
     // Vector was scalarized, and the elements extended.
     auto UnmergeToEltTy = B.buildUnmerge(SrcTy.getElementType(), SrcReg);
     for (int i = 0, e = DstRegs.size(); i != e; ++i)
@@ -548,9 +548,10 @@ static void buildCopyToRegs(MachineIRBuilder &B, ArrayRef<Register> DstRegs,
   }
 
   if (SrcTy.isVector() && PartTy.isVector() &&
-      PartTy.getScalarSizeInBits() == SrcTy.getScalarSizeInBits() &&
-      SrcTy.getNumElements() < PartTy.getNumElements()) {
-    // A coercion like: v2f32 -> v4f32.
+      PartTy.getSizeInBits() == SrcTy.getSizeInBits() &&
+      ElementCount::isKnownLT(SrcTy.getElementCount(),
+                              PartTy.getElementCount())) {
+    // A coercion like: v2f32 -> v4f32 or nxv2f32 -> nxv4f32
     Register DstReg = DstRegs.front();
     B.buildPadVectorWithUndefElements(DstReg, SrcReg);
     return;
diff --git a/llvm/lib/CodeGen/MachineVerifier.cpp b/llvm/lib/CodeGen/MachineVerifier.cpp
index b182000a3d705..fe865ed6f8a0e 100644
--- a/llvm/lib/CodeGen/MachineVerifier.cpp
+++ b/llvm/lib/CodeGen/MachineVerifier.cpp
@@ -1400,7 +1400,8 @@ void MachineVerifier::verifyPreISelGenericInstruction(const MachineInstr *MI) {
     if (DstTy.isVector()) {
       // This case is the converse of G_CONCAT_VECTORS.
       if (!SrcTy.isVector() || SrcTy.getScalarType() != DstTy.getScalarType() ||
-          SrcTy.getNumElements() != NumDsts * DstTy.getNumElements())
+          SrcTy.isScalableVector() != DstTy.isScalableVector() ||
+          SrcTy.getSizeInBits() != NumDsts * DstTy.getSizeInBits())
         report("G_UNMERGE_VALUES source operand does not match vector "
                "destination operands",
                MI);
@@ -1477,8 +1478,8 @@ void MachineVerifier::verifyPreISelGenericInstruction(const MachineInstr *MI) {
     for (const MachineOperand &MO : llvm::drop_begin(MI->operands(), 2))
       if (MRI->getType(MI->getOperand(1).getReg()) != MRI->getType(MO.getReg()))
         report("G_CONCAT_VECTOR source operand types are not homogeneous", MI);
-    if (DstTy.getNumElements() !=
-        SrcTy.getNumElements() * (MI->getNumOperands() - 1))
+    if (DstTy.getElementCount() !=
+        SrcTy.getElementCount() * (MI->getNumOperands() - 1))
       report("G_CONCAT_VECTOR num dest and source elements should match", MI);
     break;
   }

…lvm#80372) This is stacked on by llvm#80377 and llvm#80378

…or type (#80377) This patch is stacked on #80372, #80307, and #80306.

…80378) This patch is stacked on #80372, #80307, and #80306. ShuffleVector on scalable vector types gets IRTranslate'd to G_SPLAT_VECTOR since a ShuffleVector that has operates on scalable vectors is a splat vector where the value of the splat vector is the 0th element of the first operand, because the index mask operand is the zeroinitializer (undef and poison are treated as zeroinitializer here). This is analogous to what happens in SelectionDAG for ShuffleVector. `buildSplatVector` is renamed to`buildBuildVectorSplatVector`. I did not make this a separate patch because it would cause problems to revert that change without reverting this change too.

michaelmaitland requested review from arsenm and tschuett February 2, 2024 01:41

llvmbot added the llvm:globalisel label Feb 2, 2024

This was referenced Feb 2, 2024

[RISCV][GISEL] Add IRTranslation for insertelement with scalable vector type #80377

Merged

[GISEL] Add IRTranslation for shufflevector on scalable vector types #80378

Merged

[GISEL] More accounting for scalable vectors when operating on LLTs

b731257

michaelmaitland force-pushed the more-scalable branch from ff32ae5 to b731257 Compare February 2, 2024 14:51

arsenm approved these changes Feb 2, 2024

View reviewed changes

michaelmaitland merged commit ad0acf9 into llvm:main Feb 2, 2024
4 checks passed

agozillon pushed a commit to agozillon/llvm-project that referenced this pull request Feb 5, 2024

[GISEL] More accounting for scalable vectors when operating on LLTs (l…

6e5a5f9

…lvm#80372) This is stacked on by llvm#80377 and llvm#80378

michaelmaitland added a commit that referenced this pull request Feb 19, 2024

[RISCV][GISEL] Add IRTranslation for insertelement with scalable vect…

44a46a0

…or type (#80377) This patch is stacked on #80372, #80307, and #80306.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GISEL] More accounting for scalable vectors when operating on LLTs #80372

[GISEL] More accounting for scalable vectors when operating on LLTs #80372

michaelmaitland commented Feb 2, 2024

llvmbot commented Feb 2, 2024

[GISEL] More accounting for scalable vectors when operating on LLTs #80372

[GISEL] More accounting for scalable vectors when operating on LLTs #80372

Conversation

michaelmaitland commented Feb 2, 2024

llvmbot commented Feb 2, 2024