[MemoryBuiltins] Cache the result of ObjectOffsetSizeVisitor::visit. #64796 #65326

bevin-hansson · 2023-09-05T13:34:13Z

visit will skip visiting instructions it already has visited
to avoid issues with cycles in the data graph. However,
the result of this skipping behavior is that if we
encounter the same instruction twice, and that instruction
has a well defined result and isn't part of a cycle, we
will introduce unknowns into the analysis even though we
knew the size and offset of the instruction's result.

Instead of skipping such instructions, keep a cache of
the result of visiting them. This result is initialized
to unknown() before visiting, so if we happen to visit
it again recursively (perhaps as the result of a cycle
or a phi), we will get unknown as the cached result and
exit out.

…lvm#64796 visit will skip visiting instructions it already has visited to avoid issues with cycles in the data graph. However, the result of this skipping behavior is that if we encounter the same instruction twice, and that instruction has a well defined result and isn't part of a cycle, we will introduce unknowns into the analysis even though we knew the size and offset of the instruction's result. Instead of skipping such instructions, keep a cache of the result of visiting them. This result is initialized to unknown() before visiting, so if we happen to visit it again recursively (perhaps as the result of a cycle or a phi), we will get unknown as the cached result and exit out.

nikic

LGTM

nikic · 2023-09-15T05:12:31Z

llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll

@@ -61,3 +61,59 @@ if.end:
  %size = call i64 @llvm.objectsize.i64.p0(ptr %p, i1 true, i1 true, i1 false)
  ret i64 %size
 }
+
+define dso_local i64 @pick_max_same(i32 noundef %n) local_unnamed_addr {


Drop dso_local, noundef, local_unnamed_addr, they should not be relevant.

llvmbot · 2023-09-15T06:53:46Z

@llvm/pr-subscribers-llvm-analysis

@llvm/pr-subscribers-llvm-transforms

Changes

visit will skip visiting instructions it already has visited to avoid issues with cycles in the data graph. However, the result of this skipping behavior is that if we encounter the same instruction twice, and that instruction has a well defined result and isn't part of a cycle, we will introduce unknowns into the analysis even though we knew the size and offset of the instruction's result.

Instead of skipping such instructions, keep a cache of
the result of visiting them. This result is initialized
to unknown() before visiting, so if we happen to visit
it again recursively (perhaps as the result of a cycle
or a phi), we will get unknown as the cached result and
exit out.

--
Full diff: https://github.com/llvm/llvm-project/pull/65326.diff

3 Files Affected:

(modified) llvm/include/llvm/Analysis/MemoryBuiltins.h (+1-1)
(modified) llvm/lib/Analysis/MemoryBuiltins.cpp (+8-4)
(modified) llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll (+56)

diff --git a/llvm/include/llvm/Analysis/MemoryBuiltins.h b/llvm/include/llvm/Analysis/MemoryBuiltins.h
index 711bbf6a0afe5f6..66d1885b92a4905 100644
--- a/llvm/include/llvm/Analysis/MemoryBuiltins.h
+++ b/llvm/include/llvm/Analysis/MemoryBuiltins.h
@@ -198,7 +198,7 @@ class ObjectSizeOffsetVisitor
   ObjectSizeOpts Options;
   unsigned IntTyBits;
   APInt Zero;
-  SmallPtrSet&lt;Instruction *, 8&gt; SeenInsts;
+  DenseMap&lt;Instruction *, SizeOffsetType&gt; SeenInsts;
 
   APInt align(APInt Size, MaybeAlign Align);
 
diff --git a/llvm/lib/Analysis/MemoryBuiltins.cpp b/llvm/lib/Analysis/MemoryBuiltins.cpp
index 53e089ba1feae57..cacebd987f307f1 100644
--- a/llvm/lib/Analysis/MemoryBuiltins.cpp
+++ b/llvm/lib/Analysis/MemoryBuiltins.cpp
@@ -733,10 +733,14 @@ SizeOffsetType ObjectSizeOffsetVisitor::computeImpl(Value *V) {
   if (Instruction *I = dyn_cast&lt;Instruction&gt;(V)) {
     // If we have already seen this instruction, bail out. Cycles can happen in
     // unreachable code after constant propagation.
-    if (!SeenInsts.insert(I).second)
-      return unknown();
-
-    return visit(*I);
+    auto P = SeenInsts.try_emplace(I, unknown());
+    if (!P.second)
+      return P.first-&gt;second;
+    SizeOffsetType Res = visit(*I);
+    // Cache the result for later visits. If we happened to visit this during
+    // the above recursion, we would consider it unknown until now.
+    SeenInsts[I] = Res;
+    return Res;
   }
   if (Argument *A = dyn_cast&lt;Argument&gt;(V))
     return visitArgument(*A);
diff --git a/llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll b/llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll
index 7937265a69afe1e..4f4d6a88e1693be 100644
--- a/llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll
+++ b/llvm/test/Transforms/LowerConstantIntrinsics/builtin-object-size-phi.ll
@@ -61,3 +61,59 @@ if.end:
   %size = call i64 @llvm.objectsize.i64.p0(ptr %p, i1 true, i1 true, i1 false)
   ret i64 %size
 }
+
+define i64 @pick_max_same(i32 %n) {
+; CHECK-LABEL: @pick_max_same(
+; CHECK-NEXT:  entry:
+; CHECK-NEXT:    [[BUFFER:%.*]] = alloca i8, i64 20, align 1
+; CHECK-NEXT:    [[COND:%.*]] = icmp eq i32 [[N:%.*]], 0
+; CHECK-NEXT:    br i1 [[COND]], label [[IF_ELSE:%.*]], label [[IF_END:%.*]]
+; CHECK:       if.else:
+; CHECK-NEXT:    [[OFFSETED:%.*]] = getelementptr i8, ptr [[BUFFER]], i64 10
+; CHECK-NEXT:    br label [[IF_END]]
+; CHECK:       if.end:
+; CHECK-NEXT:    [[P:%.*]] = phi ptr [ [[OFFSETED]], [[IF_ELSE]] ], [ [[BUFFER]], [[ENTRY:%.*]] ]
+; CHECK-NEXT:    ret i64 20
+;
+entry:
+  %buffer = alloca i8, i64 20
+  %cond = icmp eq i32 %n, 0
+  br i1 %cond, label %if.else, label %if.end
+
+if.else:
+  %offseted = getelementptr i8, ptr %buffer, i64 10
+  br label %if.end
+
+if.end:
+  %p = phi ptr [ %offseted, %if.else ], [ %buffer, %entry ]
+  %size = call i64 @llvm.objectsize.i64.p0(ptr %p, i1 false, i1 true, i1 false)
+  ret i64 %size
+}
+
+define i64 @pick_min_same(i32 %n) {
+; CHECK-LABEL: @pick_min_same(
+; CHECK-NEXT:  entry:
+; CHECK-NEXT:    [[BUFFER:%.*]] = alloca i8, i64 20, align 1
+; CHECK-NEXT:    [[COND:%.*]] = icmp eq i32 [[N:%.*]], 0
+; CHECK-NEXT:    br i1 [[COND]], label [[IF_ELSE:%.*]], label [[IF_END:%.*]]
+; CHECK:       if.else:
+; CHECK-NEXT:    [[OFFSETED:%.*]] = getelementptr i8, ptr [[BUFFER]], i64 10
+; CHECK-NEXT:    br label [[IF_END]]
+; CHECK:       if.end:
+; CHECK-NEXT:    [[P:%.*]] = phi ptr [ [[OFFSETED]], [[IF_ELSE]] ], [ [[BUFFER]], [[ENTRY:%.*]] ]
+; CHECK-NEXT:    ret i64 10
+;
+entry:
+  %buffer = alloca i8, i64 20
+  %cond = icmp eq i32 %n, 0
+  br i1 %cond, label %if.else, label %if.end
+
+if.else:
+  %offseted = getelementptr i8, ptr %buffer, i64 10
+  br label %if.end
+
+if.end:
+  %p = phi ptr [ %offseted, %if.else ], [ %buffer, %entry ]
+  %size = call i64 @llvm.objectsize.i64.p0(ptr %p, i1 true, i1 true, i1 false)
+  ret i64 %size
+}

This partially recovers a major compile-time regression introduced by #65326.

nikic · 2023-09-15T12:27:54Z

This change caused a large compile-time regression. I've mostly mitigated this with c0a64ec, but there is still some residual regression: http://llvm-compile-time-tracker.com/compare.php?from=0a692b6b9632e1460f9e0e983196f2be5879acd1&to=0bf8763781fa68fa63ee8c1f0d9f6040df97483c&stat=instructions%3Au

bevin-hansson · 2023-09-15T12:44:25Z

That's pretty unfortunate. I'm not sure what more can be done about it, it's bound to iterate further since it's given the chance to.

nikic · 2023-09-15T12:50:53Z

getObjectSize() is almost always called on root instructions (identified objects like allocas, globals) -- actually using it with objectsize intrinsics is rare. I expect this is the additional overhead of the map, not the extra iteration.

bevin-hansson · 2023-09-15T12:56:55Z

Aha, alright. I suppose it makes sense, the tests are probably not built with something like _FORTIFY_SOURCE that would use the intrinsic.

Is the remaining regression acceptable?

…lvm#64796 (llvm#65326) visit will skip visiting instructions it already has visited to avoid issues with cycles in the data graph. However, the result of this skipping behavior is that if we encounter the same instruction twice, and that instruction has a well defined result and isn't part of a cycle, we will introduce unknowns into the analysis even though we knew the size and offset of the instruction's result. Instead of skipping such instructions, keep a cache of the result of visiting them. This result is initialized to unknown() before visiting, so if we happen to visit it again recursively (perhaps as the result of a cycle or a phi), we will get unknown as the cached result and exit out.

This partially recovers a major compile-time regression introduced by llvm#65326.

bevin-hansson requested a review from a team as a code owner September 5, 2023 13:34

bevin-hansson requested a review from nikic September 5, 2023 14:00

nikic approved these changes Sep 15, 2023

View reviewed changes

Remove unnecessary attributes.

9cb6e82

llvmbot added llvm:analysis llvm:transforms labels Sep 15, 2023

bevin-hansson merged commit e412697 into llvm:main Sep 15, 2023
4 checks passed

nikic added a commit that referenced this pull request Sep 15, 2023

[MemoryBuiltins] Use SmallDenseMap for visited map (NFC)

c0a64ec

This partially recovers a major compile-time regression introduced by #65326.

ZijunZhaoCCK pushed a commit to ZijunZhaoCCK/llvm-project that referenced this pull request Sep 19, 2023

[MemoryBuiltins] Use SmallDenseMap for visited map (NFC)

8efed72

This partially recovers a major compile-time regression introduced by llvm#65326.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MemoryBuiltins] Cache the result of ObjectOffsetSizeVisitor::visit. #64796 #65326

[MemoryBuiltins] Cache the result of ObjectOffsetSizeVisitor::visit. #64796 #65326

bevin-hansson commented Sep 5, 2023

nikic left a comment

nikic Sep 15, 2023

llvmbot commented Sep 15, 2023 •

edited

nikic commented Sep 15, 2023

bevin-hansson commented Sep 15, 2023

nikic commented Sep 15, 2023

bevin-hansson commented Sep 15, 2023

[MemoryBuiltins] Cache the result of ObjectOffsetSizeVisitor::visit. #64796 #65326

[MemoryBuiltins] Cache the result of ObjectOffsetSizeVisitor::visit. #64796 #65326

Conversation

bevin-hansson commented Sep 5, 2023

nikic left a comment

Choose a reason for hiding this comment

nikic Sep 15, 2023

Choose a reason for hiding this comment

llvmbot commented Sep 15, 2023 • edited

nikic commented Sep 15, 2023

bevin-hansson commented Sep 15, 2023

nikic commented Sep 15, 2023

bevin-hansson commented Sep 15, 2023

llvmbot commented Sep 15, 2023 •

edited