[flang][StackArrays] skip analysis of very large functions #71047

tblah · 2023-11-02T11:00:34Z

The stack arrays pass uses data flow analysis to determine whether heap allocations are freed on all paths out of the function.

interp_domain_em_part2 in spec2017 wrf generates over 120k operations, including almost 5k fir.if operations and over 200 fir.do_loop operations, all in the same function. The MLIR data flow analysis framework cannot provide reasonable performance for such cases because there is a combinatorial explosion in the number of control flow paths through the function, all of which must be checked to determine if the heap allocations will be freed.

This patch skips the stack arrays pass for ridiculously large functions (defined as having more than 1000 fir.allocmem operations). This threshold is configurable at runtime with a command line argument.

With this patch, compiling this file is more than 80% faster.

The stack arrays pass uses data flow analysis to determine whether heap allocations are freed on all paths out of the function. interp_domain_em_part2 in spec2017 wrf generates over 120k operations, including almost 5k fir.if operations and over 200 fir.do_loop operations, all in the same function. The MLIR data flow analysis framework cannot provide reasonable performance for such cases because there is a combinatorial explosion in the number of control flow paths through the function, all of which must be checked to determine if the heap allocations will be freed. This patch skips the stack arrays pass for ridiculously large functions (defined as having more than 1000 fir.allocmem operations). This threshold is configurable at runtime with a command line argument. With this patch, compiling this file is more than 80% faster.

tblah · 2023-11-02T11:00:59Z

@d-smirnov for some reason it didn't let me add you as a reviewer

llvmbot · 2023-11-02T11:01:11Z

@llvm/pr-subscribers-flang-fir-hlfir

Author: Tom Eccles (tblah)

Changes

The stack arrays pass uses data flow analysis to determine whether heap allocations are freed on all paths out of the function.

interp_domain_em_part2 in spec2017 wrf generates over 120k operations, including almost 5k fir.if operations and over 200 fir.do_loop operations, all in the same function. The MLIR data flow analysis framework cannot provide reasonable performance for such cases because there is a combinatorial explosion in the number of control flow paths through the function, all of which must be checked to determine if the heap allocations will be freed.

This patch skips the stack arrays pass for ridiculously large functions (defined as having more than 1000 fir.allocmem operations). This threshold is configurable at runtime with a command line argument.

With this patch, compiling this file is more than 80% faster.

Full diff: https://github.com/llvm/llvm-project/pull/71047.diff

1 Files Affected:

(modified) flang/lib/Optimizer/Transforms/StackArrays.cpp (+17)

diff --git a/flang/lib/Optimizer/Transforms/StackArrays.cpp b/flang/lib/Optimizer/Transforms/StackArrays.cpp
index 9b90aed5a17ae73..7b066ec7a2bfda6 100644
--- a/flang/lib/Optimizer/Transforms/StackArrays.cpp
+++ b/flang/lib/Optimizer/Transforms/StackArrays.cpp
@@ -42,6 +42,12 @@ namespace fir {
 
 #define DEBUG_TYPE "stack-arrays"
 
+static llvm::cl::opt<std::size_t> maxAllocsPerFunc(
+    "stack-arrays-max-allocs",
+    llvm::cl::desc("The maximum number of heap allocations to consider in one "
+                   "function before skipping (to save compilation time)"),
+    llvm::cl::init(1000), llvm::cl::Hidden);
+
 namespace {
 
 /// The state of an SSA value at each program point
@@ -411,6 +417,17 @@ void AllocationAnalysis::processOperation(mlir::Operation *op) {
 mlir::LogicalResult
 StackArraysAnalysisWrapper::analyseFunction(mlir::Operation *func) {
   assert(mlir::isa<mlir::func::FuncOp>(func));
+  size_t nAllocs = 0;
+  func->walk([&nAllocs](fir::AllocMemOp) { nAllocs++; });
+  // don't bother with the analysis if there are no heap allocations
+  if (nAllocs == 0)
+    return mlir::success();
+  if ((maxAllocsPerFunc != 0) && (nAllocs > maxAllocsPerFunc)) {
+    LLVM_DEBUG(llvm::dbgs() << "Skipping stack arrays for function with "
+                            << nAllocs << " heap allocations");
+    return mlir::success();
+  }
+
   mlir::DataFlowSolver solver;
   // constant propagation is required for dead code analysis, dead code analysis
   // is required to mark blocks live (required for mlir dense dfa)

kiranchandramohan · 2023-11-02T12:14:47Z

flang/lib/Optimizer/Transforms/StackArrays.cpp

+  // don't bother with the analysis if there are no heap allocations
+  if (nAllocs == 0)
+    return mlir::success();
+  if ((maxAllocsPerFunc != 0) && (nAllocs > maxAllocsPerFunc)) {


If maxAllocsPerFunc is 0, should the pass run?

I was intending this to follow the idiom of "set the limit to zero for unlimited"

I see. May be just document that.

kiranchandramohan

LG with comment added.

tblah requested a review from kiranchandramohan November 2, 2023 11:00

llvmbot added flang Flang issues not falling into any other category flang:fir-hlfir labels Nov 2, 2023

kiranchandramohan reviewed Nov 2, 2023

View reviewed changes

kiranchandramohan approved these changes Nov 2, 2023

View reviewed changes

Document set threshold to 0 for unlimitted allocations

6b49d22

tblah merged commit e215324 into llvm:main Nov 3, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[flang][StackArrays] skip analysis of very large functions #71047

[flang][StackArrays] skip analysis of very large functions #71047

tblah commented Nov 2, 2023

tblah commented Nov 2, 2023

llvmbot commented Nov 2, 2023

kiranchandramohan Nov 2, 2023

tblah Nov 2, 2023

kiranchandramohan Nov 2, 2023

kiranchandramohan left a comment

[flang][StackArrays] skip analysis of very large functions #71047

[flang][StackArrays] skip analysis of very large functions #71047

Conversation

tblah commented Nov 2, 2023

tblah commented Nov 2, 2023

llvmbot commented Nov 2, 2023

kiranchandramohan Nov 2, 2023

Choose a reason for hiding this comment

tblah Nov 2, 2023

Choose a reason for hiding this comment

kiranchandramohan Nov 2, 2023

Choose a reason for hiding this comment

kiranchandramohan left a comment

Choose a reason for hiding this comment