[AggressiveInstCombine] Ignore debug instructions when load combining #70200

mikaelholmen · 2023-10-25T12:06:38Z

We previously included debug instructions when counting instructions when
looking for loads to combine. This meant that the presence of debug
instructions could affect optimization, as shown in the updated testcase.

This fixes #69925.

llvmbot · 2023-10-25T12:07:51Z

@llvm/pr-subscribers-llvm-transforms

Author: None (mikaelholmen)

Changes

We previously included debug instructions when counting instructions when
looking for loads to combine. This meant that the presence of debug
instructions could affect optimization, as shown in the updated testcase.

This fixes #69925.

Full diff: https://github.com/llvm/llvm-project/pull/70200.diff

2 Files Affected:

(modified) llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp (+10-1)
(added) llvm/test/Transforms/AggressiveInstCombine/AArch64/combine_ignore_debug.ll (+51)

diff --git a/llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp b/llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp
index a55d01645f10eb8..72f55e237ca9151 100644
--- a/llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp
+++ b/llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp
@@ -701,12 +701,21 @@ static bool foldLoadsRecursive(Value *V, LoadOps &LOps, const DataLayout &DL,
       Loc = Loc.getWithNewSize(LOps.LoadSize);
   } else
     Loc = MemoryLocation::get(End);
+
+  // Ignore debug info (and other "AssumeLike" intrinsics) so that's not counted
+  // against MaxInstrsToScan. Otherwise debug info could affect codegen.
+  auto IsAssumeLikeIntr = [](const Instruction &I) {
+    if (auto *II = dyn_cast<IntrinsicInst>(&I))
+      return II->isAssumeLikeIntrinsic();
+    return false;
+  };
   unsigned NumScanned = 0;
   for (Instruction &Inst :
        make_range(Start->getIterator(), End->getIterator())) {
     if (Inst.mayWriteToMemory() && isModSet(AA.getModRefInfo(&Inst, Loc)))
       return false;
-    if (++NumScanned > MaxInstrsToScan)
+
+    if (!IsAssumeLikeIntr(Inst) && ++NumScanned > MaxInstrsToScan)
       return false;
   }
 
diff --git a/llvm/test/Transforms/AggressiveInstCombine/AArch64/combine_ignore_debug.ll b/llvm/test/Transforms/AggressiveInstCombine/AArch64/combine_ignore_debug.ll
new file mode 100644
index 000000000000000..68455a1f9074ecb
--- /dev/null
+++ b/llvm/test/Transforms/AggressiveInstCombine/AArch64/combine_ignore_debug.ll
@@ -0,0 +1,51 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 3
+; RUN: opt -mtriple aarch64 -aggressive-instcombine-max-scan-instrs=1 -passes="aggressive-instcombine" -S < %s | FileCheck %s -check-prefix DBG
+; RUN: opt -strip-debug -mtriple aarch64 -aggressive-instcombine-max-scan-instrs=1 -passes="aggressive-instcombine" -S < %s | FileCheck %s -check-prefix NODBG
+
+; The DBG and NODBG cases should be the same. I.e. we should optimize the DBG
+; case too even if there is a dbg.value.
+
+target datalayout = "E"
+
+%s = type { i16, i16 }
+
+@e = global %s zeroinitializer, align 1
+@l = global %s zeroinitializer, align 1
+
+define void @test() {
+; DBG-LABEL: define void @test() {
+; DBG-NEXT:  entry:
+; DBG-NEXT:    [[L1:%.*]] = load i32, ptr @e, align 1
+; DBG-NEXT:    call void @llvm.dbg.value(metadata i32 undef, metadata [[META3:![0-9]+]], metadata !DIExpression()), !dbg [[DBG5:![0-9]+]]
+; DBG-NEXT:    store i32 [[L1]], ptr @l, align 1
+; DBG-NEXT:    ret void
+;
+; NODBG-LABEL: define void @test() {
+; NODBG-NEXT:  entry:
+; NODBG-NEXT:    [[L1:%.*]] = load i32, ptr @e, align 1
+; NODBG-NEXT:    store i32 [[L1]], ptr @l, align 1
+; NODBG-NEXT:    ret void
+;
+entry:
+  %l1 = load i16, ptr @e, align 1
+  call void @llvm.dbg.value(metadata i32 undef, metadata !3, metadata !DIExpression()), !dbg !5
+  %l2 = load i16, ptr getelementptr inbounds (%s, ptr @e, i16 0, i32 1), align 1
+  %e2 = zext i16 %l2 to i32
+  %e1 = zext i16 %l1 to i32
+  %s1 = shl nuw i32 %e1, 16
+  %o1 = or i32 %s1, %e2
+  store i32 %o1, ptr @l, align 1
+  ret void
+}
+
+declare void @llvm.dbg.value(metadata, metadata, metadata)
+
+!llvm.dbg.cu = !{!0}
+!llvm.module.flags = !{!2}
+
+!0 = distinct !DICompileUnit(language: DW_LANG_C11, file: !1)
+!1 = !DIFile(filename: "foo.c", directory: "/")
+!2 = !{i32 2, !"Debug Info Version", i32 3}
+!3 = !DILocalVariable(scope: !4)
+!4 = distinct !DISubprogram(unit: !0)
+!5 = !DILocation(scope: !4)

nikic · 2023-10-25T13:08:26Z

llvm/lib/Transforms/AggressiveInstCombine/AggressiveInstCombine.cpp

+  // against MaxInstrsToScan. Otherwise debug info could affect codegen.
+  auto IsAssumeLikeIntr = [](const Instruction &I) {
+    if (auto *II = dyn_cast<IntrinsicInst>(&I))
+      return II->isAssumeLikeIntrinsic();


Please check for DbgInfoIntrinsic only.

Please check for DbgInfoIntrinsic only.

Sure, I can do that. The reason I went for isAssumeLike is that on a previous similar fix I made in SLPVectorizer, I initially just looked for Dbg intrinsics but then got comments I should look for isAssumeLike.

nikic

LGTM

mikaelholmen · 2023-10-26T05:23:18Z

LGTM

Thanks for review.

A question: I'd prefer to submit the testcase precommit and the actual fix as two separate commits. How do I go about to do that now when I have both commits in the same pull request? Should I just go to my repo and git push the precommit, then rebase this PR (which would then just contain the fix) and then "Squash and merge" ?
Or should I just not bother about it and "Squash and merge" so we get all in the same commit?

nikic · 2023-10-26T07:34:35Z

See https://discourse.llvm.org/t/how-to-enable-the-function-rebase-and-merge/73990/3?u=nikic for a way to land multi-commit PRs in a way that GitHub understands.

mikaelholmen · 2023-10-26T07:38:04Z

See https://discourse.llvm.org/t/how-to-enable-the-function-rebase-and-merge/73990/3?u=nikic for a way to land multi-commit PRs in a way that GitHub understands.

Thanks!

We get different results with/without debug info present.

…llvm#70200) We previously included debug instructions when counting instructions when looking for loads to combine. This meant that the presence of debug instructions could affect optimization, as shown in the updated testcase. This fixes llvm#69925.

llvmbot added the llvm:transforms label Oct 25, 2023

mikaelholmen requested review from bipmis and davemgreen October 25, 2023 12:09

nikic reviewed Oct 25, 2023

View reviewed changes

nikic approved these changes Oct 25, 2023

View reviewed changes

mikaelholmen added 2 commits October 26, 2023 09:58

[test][AggressiveInstCombine] Precommit testcase for llvm#69925

34fe8be

We get different results with/without debug info present.

mikaelholmen force-pushed the aic_debug branch from c4f8989 to ce0a750 Compare October 26, 2023 08:00

mikaelholmen merged commit ce0a750 into llvm:main Oct 26, 2023
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AggressiveInstCombine] Ignore debug instructions when load combining #70200

[AggressiveInstCombine] Ignore debug instructions when load combining #70200

mikaelholmen commented Oct 25, 2023

llvmbot commented Oct 25, 2023

nikic Oct 25, 2023

mikaelholmen Oct 25, 2023

mikaelholmen Oct 25, 2023

nikic left a comment

mikaelholmen commented Oct 26, 2023

nikic commented Oct 26, 2023

mikaelholmen commented Oct 26, 2023

[AggressiveInstCombine] Ignore debug instructions when load combining #70200

[AggressiveInstCombine] Ignore debug instructions when load combining #70200

Conversation

mikaelholmen commented Oct 25, 2023

llvmbot commented Oct 25, 2023

nikic Oct 25, 2023

Choose a reason for hiding this comment

mikaelholmen Oct 25, 2023

Choose a reason for hiding this comment

mikaelholmen Oct 25, 2023

Choose a reason for hiding this comment

nikic left a comment

Choose a reason for hiding this comment

mikaelholmen commented Oct 26, 2023

nikic commented Oct 26, 2023

mikaelholmen commented Oct 26, 2023