[LoopPredication] Fix division by zero in case of zero branch weights #66506

danilaml · 2023-09-15T12:47:45Z

Treat the case where all branch weights are zero as if there was no profile.
Fixes #66382

Should be in line with the approach taken by BPI:

llvm-project/llvm/lib/Analysis/BranchProbabilityInfo.cpp

Lines 430 to 434 in 7472490

    
           if (WeightSum == 0 || ReachableIdxs.size() == 0) { 
        
             for (unsigned I = 0, E = TI->getNumSuccessors(); I != E; ++I) 
        
               Weights[I] = 1; 
        
             WeightSum = TI->getNumSuccessors(); 
        
           }

llvmbot · 2023-09-15T12:48:46Z

@llvm/pr-subscribers-llvm-transforms

Changes

Treat the case where all branch weights are zero as if there was no profile. Fixes #66382

Should be in line with the approach taken by BPI:

llvm-project/llvm/lib/Analysis/BranchProbabilityInfo.cpp

Lines 430 to 434 in 7472490

if (WeightSum == 0 || ReachableIdxs.size() == 0) {

for (unsigned I = 0, E = TI->getNumSuccessors(); I != E; ++I)

Weights[I] = 1;

WeightSum = TI->getNumSuccessors();

}

Full diff: https://github.com/llvm/llvm-project/pull/66506.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Scalar/LoopPredication.cpp (+3)
(modified) llvm/test/Transforms/LoopPredication/pr66382.ll (+20-1)

diff --git a/llvm/lib/Transforms/Scalar/LoopPredication.cpp b/llvm/lib/Transforms/Scalar/LoopPredication.cpp
index a58ab093a1f75d3..55079b4a42d2fae 100644
--- a/llvm/lib/Transforms/Scalar/LoopPredication.cpp
+++ b/llvm/lib/Transforms/Scalar/LoopPredication.cpp
@@ -967,6 +967,9 @@ bool LoopPredication::isLoopProfitableToPredicate() {
           Numerator += Weight;
         Denominator += Weight;
       }
+      // If all weights are zero act as if there was no profile data
+      if (Denominator == 0)
+        return BranchProbability::getBranchProbability(1, NumSucc);
       return BranchProbability::getBranchProbability(Numerator, Denominator);
     } else {
       assert(LatchBlock != ExitingBlock &amp;&amp;
diff --git a/llvm/test/Transforms/LoopPredication/pr66382.ll b/llvm/test/Transforms/LoopPredication/pr66382.ll
index 3ac4cac0615f464..f9a14d470453cf0 100644
--- a/llvm/test/Transforms/LoopPredication/pr66382.ll
+++ b/llvm/test/Transforms/LoopPredication/pr66382.ll
@@ -1,4 +1,4 @@
-; XFAIL: *
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 3
 ; RUN: opt -S -loop-predication-skip-profitability-checks=false -passes=&#x27;require&lt;scalar-evolution&gt;,loop-mssa(loop-predication)&#x27; %s | FileCheck %s
 
 target triple = &quot;x86_64-unknown-linux-gnu&quot;
@@ -6,7 +6,26 @@ target triple = &quot;x86_64-unknown-linux-gnu&quot;
 ; Function Attrs: nocallback nofree nosync willreturn
 declare void @llvm.experimental.guard(i1, ...) #0
 
+; Check that LoopPredication doesn&#x27;t crash on all-zero branch weights
 define void @foo() {
+; CHECK-LABEL: define void @foo() {
+; CHECK-NEXT:  entry:
+; CHECK-NEXT:    br label [[HEADER:%.*]]
+; CHECK:       Header:
+; CHECK-NEXT:    [[J2:%.*]] = phi i64 [ 0, [[ENTRY:%.*]] ], [ [[J_NEXT:%.*]], [[LATCH:%.*]] ]
+; CHECK-NEXT:    call void (i1, ...) @llvm.experimental.guard(i1 false, i32 0) [ &quot;deopt&quot;() ]
+; CHECK-NEXT:    [[J_NEXT]] = add i64 [[J2]], 1
+; CHECK-NEXT:    br i1 false, label [[LATCH]], label [[EXIT:%.*]]
+; CHECK:       Latch:
+; CHECK-NEXT:    [[SPECULATE_TRIP_COUNT:%.*]] = icmp ult i64 [[J2]], 0
+; CHECK-NEXT:    br i1 [[SPECULATE_TRIP_COUNT]], label [[HEADER]], label [[COMMON_RET_LOOPEXIT:%.*]], !prof [[PROF0:![0-9]+]]
+; CHECK:       common.ret.loopexit:
+; CHECK-NEXT:    br label [[COMMON_RET:%.*]]
+; CHECK:       common.ret:
+; CHECK-NEXT:    ret void
+; CHECK:       exit:
+; CHECK-NEXT:    br label [[COMMON_RET]]
+;
 entry:
   br label %Header

dexonsmith

It looks to me like the test case could probably be reduced. E.g., is the call to @llvm.experimental.guard necessary to trigger the bug? Do you need all of the attributes? Can you reduce the number of basic blocks?

I can't add comments to most of the file though because you've already committed an XFAILed version. (Since this is an assertion failure, does it even fail consistently when assertions are off? I could imagine it XPASS-ing on a non-assertion bot.)

dexonsmith · 2023-09-15T14:18:12Z

llvm/test/Transforms/LoopPredication/pr66382.ll

@@ -1,12 +1,31 @@
-; XFAIL: *


I don't think we usually land failing tests in tree (unless policies have changed? I'm not doing a ton of reviewing these days...). This makes it a bit harder to comment on them in the review.

I've seen it done before. Regarding assertions - without them the crash would be just division by zero. Otherwise, the buildbots would complain about XFAIL passing.

dexonsmith · 2023-09-15T14:43:50Z

llvm/test/Transforms/LoopPredication/pr66382.ll

+; CHECK-LABEL: define void @foo() {
+; CHECK-NEXT:  entry:
+; CHECK-NEXT:    br label [[HEADER:%.*]]
+; CHECK:       Header:
+; CHECK-NEXT:    [[J2:%.*]] = phi i64 [ 0, [[ENTRY:%.*]] ], [ [[J_NEXT:%.*]], [[LATCH:%.*]] ]
+; CHECK-NEXT:    call void (i1, ...) @llvm.experimental.guard(i1 false, i32 0) [ "deopt"() ]
+; CHECK-NEXT:    [[J_NEXT]] = add i64 [[J2]], 1
+; CHECK-NEXT:    br i1 false, label [[LATCH]], label [[EXIT:%.*]]
+; CHECK:       Latch:
+; CHECK-NEXT:    [[SPECULATE_TRIP_COUNT:%.*]] = icmp ult i64 [[J2]], 0
+; CHECK-NEXT:    br i1 [[SPECULATE_TRIP_COUNT]], label [[HEADER]], label [[COMMON_RET_LOOPEXIT:%.*]], !prof [[PROF0:![0-9]+]]
+; CHECK:       common.ret.loopexit:
+; CHECK-NEXT:    br label [[COMMON_RET:%.*]]
+; CHECK:       common.ret:
+; CHECK-NEXT:    ret void
+; CHECK:       exit:
+; CHECK-NEXT:    br label [[COMMON_RET]]
+;


It's not clear to me if this is checking anything relevant about the expected output of the pass, in the context of the interpretation of the branch weights as an even distribution. If it is, can you explain?

If not, is there a way to do that? Maybe you can observe somehow that the branch weights are correctly interpreted as "even" by looking at DEBUG output (the DEBUG_TYPE for this pass is loop-predication), or maybe STATISTIC?

For example, I see that there's a command-line option -loop-predication-latch-probability-scale, which is a scaling factor applied to the latch probability. This affects how the branch weights are used, ultimately changing the return of LoopPredication::isLoopProfitableToPredicate. Can you construct a test case where, if the branch probability is even (the correct interpretation of branch_weights of 0), then two RUN lines with different -loop-predication-latch-probability-scale will give you different output for STATISTIC and/or DEBUG? If so, then we could have see CHECK lines on the STATISTIC/DEBUG output.

Right now, it just checks that there are simply no crashes. experimental_guard is required otherwise the LoopPredication pass would not run

// There is nothing to do if the module doesn't use guards auto *GuardDecl = M->getFunction(Intrinsic::getName(Intrinsic::experimental_guard)); bool HasIntrinsicGuards = GuardDecl && !GuardDecl->use_empty(); auto *WCDecl = M->getFunction( Intrinsic::getName(Intrinsic::experimental_widenable_condition)); bool HasWidenableConditions = PredicateWidenableBranchGuards && WCDecl && !WCDecl->use_empty(); if (!HasIntrinsicGuards && !HasWidenableConditions) return false;

The test was reduced with bugpoint and llvm-reduce, so I don't know if it can be meaningfully reduced further (also why it doesn't really do much if there is no crash).

Trying to make it output something more meaningful would likely make the test bigger (and would require assertions).

Treat the case where all branch weights are zero as if there was no profile. Fixes llvm#66382

danilaml · 2023-09-18T15:35:17Z

@dexonsmith I've added a test that would test that zero weights would be treated as if there was no profile, and it also tests scale factor as a byproduct. I left the original test case since I think it's worth having a small reduced regression test for the original issue.

dexonsmith

LGTM!

…llvm#66506) Treat the case where all branch weights are zero as if there was no profile. Fixes llvm#66382

danilaml requested review from dexonsmith, annamthomas and aleks-tmb September 15, 2023 12:47

danilaml self-assigned this Sep 15, 2023

llvmbot added the llvm:transforms label Sep 15, 2023

llvm deleted a comment from llvmbot Sep 15, 2023

dexonsmith reviewed Sep 15, 2023

View reviewed changes

[LoopPredication] Fix division by zero in case of zero branch weights

9b7e1ba

Treat the case where all branch weights are zero as if there was no profile. Fixes llvm#66382

danilaml force-pushed the fix-zero-branchweights branch from 03614b0 to 9b7e1ba Compare September 18, 2023 15:31

dexonsmith self-requested a review September 19, 2023 00:15

dexonsmith approved these changes Sep 19, 2023

View reviewed changes

danilaml merged commit a668c0f into llvm:main Sep 19, 2023
2 checks passed

danilaml deleted the fix-zero-branchweights branch September 19, 2023 01:38

This was referenced Sep 19, 2023

[SelectionDAG] [NFC] Add pre-commit test for PR66701. srpande/llvm-project#2

Closed

[SelectionDAG] [NFC] Add pre-commit test for PR66701. srpande/llvm-project#3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoopPredication] Fix division by zero in case of zero branch weights #66506

[LoopPredication] Fix division by zero in case of zero branch weights #66506

danilaml commented Sep 15, 2023

llvmbot commented Sep 15, 2023

dexonsmith left a comment

dexonsmith Sep 15, 2023

danilaml Sep 15, 2023

dexonsmith Sep 15, 2023

danilaml Sep 15, 2023

danilaml commented Sep 18, 2023

dexonsmith left a comment

	if (WeightSum == 0 \|\| ReachableIdxs.size() == 0) {
	for (unsigned I = 0, E = TI->getNumSuccessors(); I != E; ++I)
	Weights[I] = 1;
	WeightSum = TI->getNumSuccessors();
	}

[LoopPredication] Fix division by zero in case of zero branch weights #66506

[LoopPredication] Fix division by zero in case of zero branch weights #66506

Conversation

danilaml commented Sep 15, 2023

llvmbot commented Sep 15, 2023

dexonsmith left a comment

Choose a reason for hiding this comment

dexonsmith Sep 15, 2023

Choose a reason for hiding this comment

danilaml Sep 15, 2023

Choose a reason for hiding this comment

dexonsmith Sep 15, 2023

Choose a reason for hiding this comment

danilaml Sep 15, 2023

Choose a reason for hiding this comment

danilaml commented Sep 18, 2023

dexonsmith left a comment

Choose a reason for hiding this comment