[GlobalOpt] Prevent widenDestArray from shrinking an alloca. #144641

topperc · 2025-06-18T05:54:49Z

If the destination alloca for one of the memcpy calls we are
modifying is already larger than our desired size, we shouldn't
replace it with a smaller alloca.

If the destination alloca for one of the memcpy calls we are modifying is already larger than our desired size, we shouldn't replace it with a smaller alloca.

llvmbot · 2025-06-18T05:55:20Z

@llvm/pr-subscribers-llvm-transforms

Author: Craig Topper (topperc)

Changes

If the destination alloca for one of the memcpy calls we are
modifying is already larger than our desired size, we shouldn't
replace it with a smaller alloca.

Full diff: https://github.com/llvm/llvm-project/pull/144641.diff

2 Files Affected:

(modified) llvm/lib/Transforms/IPO/GlobalOpt.cpp (+4)
(added) llvm/test/Transforms/GlobalOpt/ARM/arm-widen-large-alloca.ll (+27)

diff --git a/llvm/lib/Transforms/IPO/GlobalOpt.cpp b/llvm/lib/Transforms/IPO/GlobalOpt.cpp
index 7db0586386506..f0cd00df23959 100644
--- a/llvm/lib/Transforms/IPO/GlobalOpt.cpp
+++ b/llvm/lib/Transforms/IPO/GlobalOpt.cpp
@@ -2103,6 +2103,10 @@ static void widenDestArray(CallInst *CI, const unsigned NumBytesToPad,
     unsigned ElementByteWidth = SourceDataArray->getElementByteSize();
     unsigned int TotalBytes = NumBytesToCopy + NumBytesToPad;
     unsigned NumElementsToCopy = divideCeil(TotalBytes, ElementByteWidth);
+    // Don't change size if already wide enough.
+    if (Alloca->getAllocatedType()->getArrayNumElements() >= NumElementsToCopy)
+      return;
+
     // Update destination array to be word aligned (memcpy(X,...,...))
     IRBuilder<> BuildAlloca(Alloca);
     AllocaInst *NewAlloca = BuildAlloca.CreateAlloca(ArrayType::get(
diff --git a/llvm/test/Transforms/GlobalOpt/ARM/arm-widen-large-alloca.ll b/llvm/test/Transforms/GlobalOpt/ARM/arm-widen-large-alloca.ll
new file mode 100644
index 0000000000000..4fca1ffefdcaf
--- /dev/null
+++ b/llvm/test/Transforms/GlobalOpt/ARM/arm-widen-large-alloca.ll
@@ -0,0 +1,27 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt < %s -mtriple=arm-none-eabi -passes=globalopt -S | FileCheck %s
+
+@.i8 = private unnamed_addr constant [3 x i8] [i8 1, i8 2, i8 3] , align 1
+
+define void @memcpy()  {
+; CHECK-LABEL: define void @memcpy() local_unnamed_addr {
+; CHECK-NEXT:  [[ENTRY:.*:]]
+; CHECK-NEXT:    [[ALLOCA1:%.*]] = alloca [4 x i8], align 1
+; CHECK-NEXT:    [[ALLOCA2:%.*]] = alloca [5 x i8], align 1
+; CHECK-NEXT:    [[CALL1:%.*]] = call i32 @bar(ptr nonnull [[ALLOCA1]])
+; CHECK-NEXT:    [[CALL2:%.*]] = call i32 @bar(ptr nonnull [[ALLOCA2]])
+; CHECK-NEXT:    call void @llvm.memcpy.p0.p0.i32(ptr noundef nonnull align 1 dereferenceable(3) [[ALLOCA1]], ptr noundef nonnull align 1 dereferenceable(3) @.i8, i32 4, i1 false)
+; CHECK-NEXT:    call void @llvm.memcpy.p0.p0.i32(ptr noundef nonnull align 1 dereferenceable(5) [[ALLOCA2]], ptr noundef nonnull align 1 dereferenceable(3) @.i8, i32 4, i1 false)
+; CHECK-NEXT:    ret void
+;
+entry:
+  %alloca1 = alloca [3 x i8], align 1
+  %alloca2 = alloca [5 x i8], align 1
+  %call1 = call i32 @bar(ptr nonnull %alloca1)
+  %call2 = call i32 @bar(ptr nonnull %alloca2)
+  call void @llvm.memcpy.p0.p0.i32(ptr noundef nonnull align 1 dereferenceable(3) %alloca1, ptr noundef nonnull align 1 dereferenceable(3) @.i8, i32 3, i1 false)
+  call void @llvm.memcpy.p0.p0.i32(ptr noundef nonnull align 1 dereferenceable(5) %alloca2, ptr noundef nonnull align 1 dereferenceable(3) @.i8, i32 3, i1 false)
+  ret void
+}
+
+declare i32 @bar(...)

nikic

The current implementation is also wrong in the case where one of the memcpys previously copied less than the full size of the global. It gets changes to copy the new size, which is larger and may overwrite other memory.

I believe you need to replicate all the checks from this condition (which is currently only checked for a single alloca, instead of all of them):

llvm-project/llvm/lib/Transforms/IPO/GlobalOpt.cpp

Lines 2185 to 2189 in 4f5b59f

    
           // For safety purposes lets add a constraint and only pad when 
        
           // NumElementsToCopy == destination array size == 
        
           // source which is a constant 
        
           if (NumElementsToCopy != DZSize || DZSize != SZSize) 
        
             continue;

nikic · 2025-06-18T08:27:49Z

Actually, that check is also incorrect because of the incorrect element instead of byte based logic.

nikic · 2025-06-18T08:44:26Z

TBH, the more I look at this code, the more unhappy I become. I submitted a full revert of the GlobalOpt code at #144652.

Partially reverts e37d736. The transform has a number of correctness and code quality issues, and will benefit from a from-scratch re-review more than incremental fixes. The correctness issues are hinted at in #144641, but I think it needs a larger rework to stop working on ArrayTypes and the implementation could use some other improvements (like callInstIsMemcpy should just be `dyn_cast<MemCpyInst>`). I can comment in more detail on a resubmission of the patch.

Partially reverts e37d736. The transform has a number of correctness and code quality issues, and will benefit from a from-scratch re-review more than incremental fixes. The correctness issues are hinted at in llvm/llvm-project#144641, but I think it needs a larger rework to stop working on ArrayTypes and the implementation could use some other improvements (like callInstIsMemcpy should just be `dyn_cast<MemCpyInst>`). I can comment in more detail on a resubmission of the patch.

Partially reverts e37d736. The transform has a number of correctness and code quality issues, and will benefit from a from-scratch re-review more than incremental fixes. The correctness issues are hinted at in llvm#144641, but I think it needs a larger rework to stop working on ArrayTypes and the implementation could use some other improvements (like callInstIsMemcpy should just be `dyn_cast<MemCpyInst>`). I can comment in more detail on a resubmission of the patch.

topperc added 2 commits June 17, 2025 22:35

Pre-commit test

0eebbdc

[GlobalOpt] Preven widenDestArray from shrinking an alloca.

4f5b59f

If the destination alloca for one of the memcpy calls we are modifying is already larger than our desired size, we shouldn't replace it with a smaller alloca.

topperc requested review from nikic and nasherm June 18, 2025 05:54

llvmbot added the llvm:transforms label Jun 18, 2025

topperc changed the title ~~[GlobalOpt] Preven widenDestArray from shrinking an alloca.~~ [GlobalOpt] Prevent widenDestArray from shrinking an alloca. Jun 18, 2025

nikic requested changes Jun 18, 2025

View reviewed changes

nikic mentioned this pull request Jun 18, 2025

[GlobalOpt] Revert global widening transform #144652

Merged

topperc closed this Jun 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GlobalOpt] Prevent widenDestArray from shrinking an alloca. #144641

[GlobalOpt] Prevent widenDestArray from shrinking an alloca. #144641

Uh oh!

topperc commented Jun 18, 2025

Uh oh!

llvmbot commented Jun 18, 2025

Uh oh!

nikic left a comment

Uh oh!

nikic commented Jun 18, 2025

Uh oh!

nikic commented Jun 18, 2025

Uh oh!

Uh oh!

	// For safety purposes lets add a constraint and only pad when
	// NumElementsToCopy == destination array size ==
	// source which is a constant
	if (NumElementsToCopy != DZSize \|\| DZSize != SZSize)
	continue;

[GlobalOpt] Prevent widenDestArray from shrinking an alloca. #144641

[GlobalOpt] Prevent widenDestArray from shrinking an alloca. #144641

Uh oh!

Conversation

topperc commented Jun 18, 2025

Uh oh!

llvmbot commented Jun 18, 2025

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

nikic commented Jun 18, 2025

Uh oh!

nikic commented Jun 18, 2025

Uh oh!

Uh oh!