[LV] Compute SCEV for memcheck before unlinking #160326

igogo-x86 · 2025-09-23T15:35:59Z

When generating runtime memory checks for outer loops, we split blocks and later unlink them, making the memcheck block unreachable. For instructions in unreachable blocks, ScalarEvolution returns an unknown/poison SCEV, which is treated as a constant and thus loop-invariant. The cost model then assumes the check can be hoisted and underestimates its cost. See this code in GeneratedRTChecks::getCost:

        const SCEV *Cond = SE->getSCEV(MemRuntimeCheckCond);
        if (SE->isLoopInvariant(Cond, OuterLoop)) {

Set OuterLoop early in GeneratedRTChecks::create and compute the SCEV for MemRuntimeCheckCond before unlinking, so getCost() sees the cached expression rather than a poison constant.

When generating runtime memory checks for outer loops, we split blocks and later unlink them, making the memcheck block unreachable. For instructions in unreachable blocks, ScalarEvolution returns an unknown/poison SCEV, which is treated as a constant and thus loop-invariant. The cost model then assumes the check can be hoisted and underestimates its cost. Set OuterLoop early in GeneratedRTChecks::create and compute the SCEV for MemRuntimeCheckCond before unlinking, so getCost() sees the cached expression rather than a poison constant.

llvmbot · 2025-09-23T15:36:36Z

@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-vectorizers

Author: Igor Kirillov (igogo-x86)

Changes

When generating runtime memory checks for outer loops, we split blocks and later unlink them, making the memcheck block unreachable. For instructions in unreachable blocks, ScalarEvolution returns an unknown/poison SCEV, which is treated as a constant and thus loop-invariant. The cost model then assumes the check can be hoisted and underestimates its cost. See this code in GeneratedRTChecks::getCost:

        const SCEV *Cond = SE-&gt;getSCEV(MemRuntimeCheckCond);
        if (SE-&gt;isLoopInvariant(Cond, OuterLoop)) {

Set OuterLoop early in GeneratedRTChecks::create and compute the SCEV for MemRuntimeCheckCond before unlinking, so getCost() sees the cached expression rather than a poison constant.

Full diff: https://github.com/llvm/llvm-project/pull/160326.diff

1 Files Affected:

(modified) llvm/lib/Transforms/Vectorize/LoopVectorize.cpp (+9-2)

diff --git a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
index ca092dcfcb492..575f45b051cf6 100644
--- a/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
+++ b/llvm/lib/Transforms/Vectorize/LoopVectorize.cpp
@@ -1807,6 +1807,10 @@ class GeneratedRTChecks {
     BasicBlock *LoopHeader = L->getHeader();
     BasicBlock *Preheader = L->getLoopPreheader();
 
+    // Outer loop is used as part of later cost calculations (e.g. to
+    // determine if runtime checks are loop-invariant and can be hoisted).
+    OuterLoop = L->getParentLoop();
+
     // Use SplitBlock to create blocks for SCEV & memory runtime checks to
     // ensure the blocks are properly added to LoopInfo & DominatorTree. Those
     // may be used by SCEVExpander. The blocks will be un-linked from their
@@ -1850,6 +1854,11 @@ class GeneratedRTChecks {
       assert(MemRuntimeCheckCond &&
              "no RT checks generated although RtPtrChecking "
              "claimed checks are required");
+      // Compute SCEV while the block is reachable.
+      // After unlinking, SCEV returns unknown/poison (constant -> invariant),
+      // which makes getCost() wrongly discount hoisted checks.
+      if (OuterLoop)
+        PSE.getSE()->getSCEV(MemRuntimeCheckCond);
     }
 
     SCEVExp.eraseDeadInstructions(SCEVCheckCond);
@@ -1889,8 +1898,6 @@ class GeneratedRTChecks {
       LI->removeBlock(SCEVCheckBlock);
     }
 
-    // Outer loop is used as part of the later cost calculations.
-    OuterLoop = L->getParentLoop();
   }
 
   InstructionCost getCost() {

igogo-x86 · 2025-09-23T15:37:26Z

@david-arm Also worth checking if this patch doesn't open up the regression fixed by #160326

github-actions · 2025-09-23T15:39:59Z

✅ With the latest revision this PR passed the C/C++ code formatter.

david-arm · 2025-09-23T15:49:01Z

@david-arm Also worth checking if this patch doesn't open up the regression fixed by #160326

Isn't #160326 this PR, which hasn't landed yet?

david-arm · 2025-09-23T16:03:06Z

Can you add a test case please?

igogo-x86 · 2025-09-24T11:33:39Z

Ah, yes indeed - it was #76034. I added a test and it showed that my approach was also incorrect, so I had to go deeper and ask pointers for invariance

llvm/include/llvm/Transforms/Utils/LoopUtils.h

llvm/lib/Transforms/Utils/LoopUtils.cpp

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

david-arm · 2025-09-26T14:48:16Z

llvm/include/llvm/Transforms/Utils/LoopUtils.h


 /// Add code that checks at runtime if the accessed arrays in \p PointerChecks
 /// overlap. Returns the final comparator value or NULL if no check is needed.
+/// If \p HoistRuntimeChecks and \p TheLoop has a parent, sets \p


nit: Perhaps this should be something like:

/// If \p HoistRuntimeChecks is true and \p TheLoop has a parent, then \p HoistRuntimeChecks /// is set to true when all checks are outer-loop invariant, i.e. hoistable, /// or false otherwise.

What do you think?

david-arm · 2025-09-26T14:55:02Z

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

  Value *MemRuntimeCheckCond = nullptr;

+  /// True if memory checks are outer-loop invariant (hoistable).
+  /// Used to discount check cost for inner loops.


nit: Perhaps clearer written as 'Used to discount the cost of performing runtime checks for inner loops'?

david-arm · 2025-09-26T15:15:29Z

llvm/lib/Transforms/Utils/LoopUtils.cpp

      IsConflict = ChkBuilder.CreateOr(IsConflict, IsNegativeStride);
    }
+
+    if (AllChecksHoisted) {


I'm not sure all these checks are necessary. Can't you just do a single check at the very end before we return from the function, i.e.

auto *SE = Exp.getSE(); auto *OuterLoop = TheLoop->getParentLoop(); AllChecksHoisted = false; if (HoistRuntimeChecks && OuterLoop != nullptr) AllChecksHoisted = SE->isLoopInvariant(SE->getSCEV(MemoryRuntimeCheck)); Exp.eraseDeadInstructions(MemoryRuntimeCheck); return MemoryRuntimeCheck;

fhahn · 2025-09-29T08:45:34Z

llvm/include/llvm/Transforms/Utils/LoopUtils.h

                 const SmallVectorImpl<RuntimePointerCheck> &PointerChecks,
-                 SCEVExpander &Expander, bool HoistRuntimeChecks = false);
+                 SCEVExpander &Expander, bool HoistRuntimeChecks,
+                 bool &AllChecksHoisted);


Would it be possible/better to return this together with the runtime check using std::pair<> or somerhing like that?

fhahn · 2025-09-29T08:47:49Z

llvm/lib/Transforms/Utils/LoopUtils.cpp

    }
+
+    if (AllChecksHoisted) {
+      AllChecksHoisted &= SE->isLoopInvariant(SE->getSCEV(A.Start), OuterLoop);


Can we avoid going from IR value -> SCEV and just check the entries in PointerChecks for invariance?

Constructing SCEVs here for values that may be removed later may leave dangling IR value entries in the SCEV expression cache?

igogo-x86 requested review from david-arm and fhahn September 23, 2025 15:35

llvmbot added vectorizers llvm:transforms labels Sep 23, 2025

igogo-x86 added 2 commits September 24, 2025 10:24

Fix

a14a3a3

Add test, and also the patch didn't work

32ed96b

fhahn reviewed Sep 26, 2025

View reviewed changes

Address comments

6a482e8

david-arm reviewed Sep 26, 2025

View reviewed changes

fhahn reviewed Sep 29, 2025

View reviewed changes

[LV] Compute SCEV for memcheck before unlinking #160326

Are you sure you want to change the base?

[LV] Compute SCEV for memcheck before unlinking #160326

Conversation

igogo-x86 commented Sep 23, 2025

Uh oh!

llvmbot commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

igogo-x86 commented Sep 23, 2025

Uh oh!

github-actions bot commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

david-arm commented Sep 23, 2025

Uh oh!

david-arm commented Sep 23, 2025

Uh oh!

igogo-x86 commented Sep 24, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

david-arm Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

david-arm Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

david-arm Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

fhahn Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

fhahn Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

llvmbot commented Sep 23, 2025 •

edited

Loading

github-actions bot commented Sep 23, 2025 •

edited

Loading