[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation #80487

preames · 2024-02-02T20:24:11Z

Without this change, the included test cases crash the compiler. I believe this is fallout from the homogenous scalable struct work from a while back; I think we just forgot to update this case.

Likely to fix #80463.

Without this change, the included test cases crash the compiler. I believe this is fallout from the homogenous scalable struct work from a while back; I think we just forgot to update this case. Likely to fix llvm#80463.

llvmbot · 2024-02-02T20:24:38Z

@llvm/pr-subscribers-llvm-transforms

Author: Philip Reames (preames)

Changes

Without this change, the included test cases crash the compiler. I believe this is fallout from the homogenous scalable struct work from a while back; I think we just forgot to update this case.

Likely to fix #80463.

Full diff: https://github.com/llvm/llvm-project/pull/80487.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp (+2-2)
(modified) llvm/test/Transforms/MemCpyOpt/vscale-crashes.ll (+38)

diff --git a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
index 805bbe40bd7c7..1036b8ae963a2 100644
--- a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
+++ b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
@@ -677,9 +677,9 @@ bool MemCpyOptPass::processStoreOfLoad(StoreInst *SI, LoadInst *LI,
       if (isModSet(AA->getModRefInfo(SI, LoadLoc)))
         UseMemMove = true;
 
-      uint64_t Size = DL.getTypeStoreSize(T);
-
       IRBuilder<> Builder(P);
+      Value *Size = Builder.CreateTypeSize(Builder.getInt64Ty(),
+                                           DL.getTypeStoreSize(T));
       Instruction *M;
       if (UseMemMove)
         M = Builder.CreateMemMove(
diff --git a/llvm/test/Transforms/MemCpyOpt/vscale-crashes.ll b/llvm/test/Transforms/MemCpyOpt/vscale-crashes.ll
index 84b06f6071ff6..42ad92ee03f4d 100644
--- a/llvm/test/Transforms/MemCpyOpt/vscale-crashes.ll
+++ b/llvm/test/Transforms/MemCpyOpt/vscale-crashes.ll
@@ -85,5 +85,43 @@ define void @callslotoptzn(<vscale x 4 x float> %val, ptr %out) {
   ret void
 }
 
+%0 = type { <vscale x 8 x i8> }
+%1 = type { <vscale x 8 x i8>, <vscale x 8 x i8> }
+
+define void @memmove_vector(ptr %a, ptr %b) {
+; CHECK-LABEL: @memmove_vector(
+; CHECK-NEXT:    [[V:%.*]] = load <vscale x 8 x i8>, ptr [[A:%.*]], align 1
+; CHECK-NEXT:    store <vscale x 8 x i8> [[V]], ptr [[B:%.*]], align 1
+; CHECK-NEXT:    ret void
+;
+  %v = load <vscale x 8 x i8>, ptr %a, align 1
+  store <vscale x 8 x i8> %v, ptr %b, align 1
+  ret void
+}
+
+define void @memmove_agg1(ptr %a, ptr %b) {
+; CHECK-LABEL: @memmove_agg1(
+; CHECK-NEXT:    [[TMP1:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP2:%.*]] = mul i64 [[TMP1]], 8
+; CHECK-NEXT:    call void @llvm.memmove.p0.p0.i64(ptr align 1 [[B:%.*]], ptr align 1 [[A:%.*]], i64 [[TMP2]], i1 false)
+; CHECK-NEXT:    ret void
+;
+  %v = load %0, ptr %a, align 1
+  store %0 %v, ptr %b, align 1
+  ret void
+}
+
+define void @memmove_agg2(ptr %a, ptr %b) {
+; CHECK-LABEL: @memmove_agg2(
+; CHECK-NEXT:    [[TMP1:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP2:%.*]] = mul i64 [[TMP1]], 16
+; CHECK-NEXT:    call void @llvm.memmove.p0.p0.i64(ptr align 1 [[B:%.*]], ptr align 1 [[A:%.*]], i64 [[TMP2]], i1 false)
+; CHECK-NEXT:    ret void
+;
+  %v = load %1, ptr %a, align 1
+  store %1 %v, ptr %b, align 1
+  ret void
+}
+
 declare <vscale x 4 x i32> @llvm.experimental.stepvector.nxv4i32()
 declare void @llvm.masked.scatter.nxv4f32.nxv4p0f32(<vscale x 4 x float> , <vscale x 4 x ptr> , i32, <vscale x 4 x i1>)

github-actions · 2024-02-02T20:27:04Z

⚠️ C/C++ code formatter, clang-format found issues in your code. ⚠️

You can test this locally with the following command:

git-clang-format --diff b78b264518e0f341d99a4291cbf24134c7536f6d 22a3d91d6c0c157d3e6529b18ac30292fbb30a97 -- llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp

View the diff from clang-format here.

diff --git a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
index 1036b8ae96..7802926d20 100644
--- a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
+++ b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
@@ -678,8 +678,8 @@ bool MemCpyOptPass::processStoreOfLoad(StoreInst *SI, LoadInst *LI,
         UseMemMove = true;
 
       IRBuilder<> Builder(P);
-      Value *Size = Builder.CreateTypeSize(Builder.getInt64Ty(),
-                                           DL.getTypeStoreSize(T));
+      Value *Size =
+          Builder.CreateTypeSize(Builder.getInt64Ty(), DL.getTypeStoreSize(T));
       Instruction *M;
       if (UseMemMove)
         M = Builder.CreateMemMove(

nikic

LGTM

…on (llvm#80487) Without this change, the included test cases crash the compiler. I believe this is fallout from the homogenous scalable struct work from a while back; I think we just forgot to update this case. Likely to fix llvm#80463.

asb · 2024-02-07T13:32:59Z

@preames I believe this is a candidate for backporting? (the test case does indeed seem to crash on the release/18.x branch right now).

preames · 2024-02-07T15:11:27Z

@preames I believe this is a candidate for backporting? (the test case does indeed seem to crash on the release/18.x branch right now).

It could be, but I don't know we need to. It was a fuzzer failure, nothing from a real workload.

preames requested review from davemgreen, topperc and wangpc-pp February 2, 2024 20:24

preames requested a review from nikic as a code owner February 2, 2024 20:24

llvmbot added the llvm:transforms label Feb 2, 2024

nikic approved these changes Feb 2, 2024

View reviewed changes

hiraditya requested a review from alexey-bataev February 2, 2024 23:25

preames merged commit 42d6eb5 into llvm:main Feb 3, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation #80487

[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation #80487

preames commented Feb 2, 2024

llvmbot commented Feb 2, 2024

github-actions bot commented Feb 2, 2024

nikic left a comment

asb commented Feb 7, 2024

preames commented Feb 7, 2024

[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation #80487

[MemCpyOpt] Handle scalable aggregate types in memmove/memset formation #80487

Conversation

preames commented Feb 2, 2024

llvmbot commented Feb 2, 2024

github-actions bot commented Feb 2, 2024

nikic left a comment

Choose a reason for hiding this comment

asb commented Feb 7, 2024

preames commented Feb 7, 2024