[analyzer][NFC] Explain why operator new/delete should never be eval-called #161370

balazs-benics-sonarsource · 2025-09-30T13:10:19Z

Downstream, some change triggered an investigation if we could move a checker callback from check::PostCall to eval::Call. After a lengthy investigation that lead to ExprEngine::VisitCXXNewExpr we realized that CXXNewExprs only trigger a PreCall and PostCall, but never an EvalCall. It also had a FIXME that maybe it should trigger it.

Remember, it called defaultEvalCall which either inlines or conservatively evaluates aka. invalidates the call. But never probes the checker eval-calls to see if any would step in.

After implementing the changes to trigger the eval call for the checkers, I realized that it doesn't really make sense because we are eval-calling user-provided functions, that we can't be really sure about their semantics, thus there is no generic way to properly implement the eval call callback.
This touches on an important point. It only ever makes sense to eval call functions that has a clear spec. such as standard functions, as implementing the callback would prevent the inlining of that function, risking regressing analysis quality if the implemented model is not complete/correct enough.

As a conclusion, I opted for not exposing the eval call event to checkers, in other words, keep everything as-is, but document my journey.

CPP-6585

…called Downstream, some change triggered an investigation if we could move a checker callback from check::PostCall to eval::Call. After a lengthy investigation that lead to ExprEngine::VisitCXXNewExpr we realized that CXXNewExprs only trigger a PreCall and PostCall, but never an EvalCall. It also had a fixme that maybe it should trigger it. Remember, it called `defaultEvalCall` which either inlines or conservatively evaluates aka. invalidates the call. But never propes the checker evalCalls to see if any would step in. After implementing the changes to trigger the eval call for the checkers, I realized that it doesn't really make sense because we are eval-calling user-provided functions, that we can't be really sure about their semantics, thus there is no generic way to properly implement the eval call callback. This touches on an important point. It only ever makes sense to eval call functions that has a clear spec. such as standard functions, as implementing the callback would prevent the inlining of that function, risking regressing analysis quality if the implemented model is not complete/correct enough. As a conclusion, I opted for not exposing the eval call event to checkers, in other words, keep everything as-is, but document my journey. CPP-6585

llvmbot · 2025-09-30T13:10:44Z

@llvm/pr-subscribers-clang

@llvm/pr-subscribers-clang-static-analyzer-1

Author: Balázs Benics (balazs-benics-sonarsource)

Changes

Downstream, some change triggered an investigation if we could move a checker callback from check::PostCall to eval::Call. After a lengthy investigation that lead to ExprEngine::VisitCXXNewExpr we realized that CXXNewExprs only trigger a PreCall and PostCall, but never an EvalCall. It also had a FIXME that maybe it should trigger it.

Remember, it called defaultEvalCall which either inlines or conservatively evaluates aka. invalidates the call. But never probes the checker eval-calls to see if any would step in.

After implementing the changes to trigger the eval call for the checkers, I realized that it doesn't really make sense because we are eval-calling user-provided functions, that we can't be really sure about their semantics, thus there is no generic way to properly implement the eval call callback.
This touches on an important point. It only ever makes sense to eval call functions that has a clear spec. such as standard functions, as implementing the callback would prevent the inlining of that function, risking regressing analysis quality if the implemented model is not complete/correct enough.

As a conclusion, I opted for not exposing the eval call event to checkers, in other words, keep everything as-is, but document my journey.

CPP-6585

Full diff: https://github.com/llvm/llvm-project/pull/161370.diff

4 Files Affected:

(modified) clang/lib/StaticAnalyzer/Checkers/AnalysisOrderChecker.cpp (+2-1)
(modified) clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp (+8)
(modified) clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp (+11-1)
(modified) clang/test/Analysis/cxxctr-evalcall-analysis-order.cpp (+17)

diff --git a/clang/lib/StaticAnalyzer/Checkers/AnalysisOrderChecker.cpp b/clang/lib/StaticAnalyzer/Checkers/AnalysisOrderChecker.cpp
index e64153d53bbd6..309e3d250de06 100644
--- a/clang/lib/StaticAnalyzer/Checkers/AnalysisOrderChecker.cpp
+++ b/clang/lib/StaticAnalyzer/Checkers/AnalysisOrderChecker.cpp
@@ -129,7 +129,8 @@ class AnalysisOrderChecker
       llvm::errs() << " {argno: " << Call.getNumArgs() << '}';
       llvm::errs() << " [" << Call.getKindAsString() << ']';
       llvm::errs() << '\n';
-      return true;
+      // We can't return `true` from this callback without binding the return
+      // value. Let's just fallthrough here and return `false`.
     }
     return false;
   }
diff --git a/clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp b/clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp
index 392c7eeea234a..b08a81f2889fd 100644
--- a/clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp
+++ b/clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp
@@ -262,6 +262,14 @@ class CheckerDocumentation
   /// state. This callback allows a checker to provide domain specific knowledge
   /// about the particular functions it knows about.
   ///
+  /// Note that to evaluate a call, the handler MUST bind the return value if
+  /// its a non-void function. Invalidate the arguments if necessary.
+  ///
+  /// Note that in general, user-provided functions should not be eval-called
+  /// because the checker can't predict the exact semantics/contract of the
+  /// callee, and by having the eval::Call callback, we also prevent it from
+  /// getting inlined, potentially regressing analysis quality.
+  ///
   /// \returns true if the call has been successfully evaluated
   /// and false otherwise. Note, that only one checker can evaluate a call. If
   /// more than one checker claims that they can evaluate the same call the
diff --git a/clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp b/clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp
index dee34e3e9d6a5..7a9d097861f42 100644
--- a/clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp
+++ b/clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp
@@ -909,7 +909,13 @@ void ExprEngine::VisitCXXNewAllocatorCall(const CXXNewExpr *CNE,
   ExplodedNodeSet DstPostCall;
   StmtNodeBuilder CallBldr(DstPreCall, DstPostCall, *currBldrCtx);
   for (ExplodedNode *I : DstPreCall) {
-    // FIXME: Provide evalCall for checkers?
+    // Operator new calls (CXXNewExpr) are intentionally not eval-called,
+    // because it does not make sense to eval-call user-provided functions.
+    // 1) If the new operator can be inlined, then don't prevent it from
+    //    inlining by having an eval-call of that operator.
+    // 2) If it can't be inlined, then the default conservative modeling
+    //    is what we want anyway.
+    // So the best is to not allow eval-calling CXXNewExprs from checkers.
     defaultEvalCall(CallBldr, I, *Call);
   }
   // If the call is inlined, DstPostCall will be empty and we bail out now.
@@ -1110,6 +1116,10 @@ void ExprEngine::VisitCXXDeleteExpr(const CXXDeleteExpr *CDE,
   if (AMgr.getAnalyzerOptions().MayInlineCXXAllocator) {
     StmtNodeBuilder Bldr(DstPreCall, DstPostCall, *currBldrCtx);
     for (ExplodedNode *I : DstPreCall) {
+      // Intentionally either inline or conservative eval-call the operator
+      // delete, but triggering an eval-call event for checkers.
+      // As detailed at handling CXXNewExprs, in short, because it does not
+      // really make sense to eval-call user-provided functions.
       defaultEvalCall(Bldr, I, *Call);
     }
   } else {
diff --git a/clang/test/Analysis/cxxctr-evalcall-analysis-order.cpp b/clang/test/Analysis/cxxctr-evalcall-analysis-order.cpp
index 0e1ec2f9de566..743c5ad0fa8cd 100644
--- a/clang/test/Analysis/cxxctr-evalcall-analysis-order.cpp
+++ b/clang/test/Analysis/cxxctr-evalcall-analysis-order.cpp
@@ -18,16 +18,33 @@ void foo() {
   C C0;
   C C1(42);
   C *C2 = new C{2, 3};
+  delete C2;
 }
 
 // CHECK:  PreCall (C::C) [CXXConstructorCall]
 // CHECK-NEXT:  EvalCall (C::C) {argno: 0} [CXXConstructorCall]
 // CHECK-NEXT:  PostCall (C::C) [CXXConstructorCall]
+
 // CHECK-NEXT:  PreCall (C::C) [CXXConstructorCall]
 // CHECK-NEXT:  EvalCall (C::C) {argno: 1} [CXXConstructorCall]
 // CHECK-NEXT:  PostCall (C::C) [CXXConstructorCall]
+
 // CHECK-NEXT:  PreCall (operator new) [CXXAllocatorCall]
+//    COMMENT: Operator new calls (CXXNewExpr) are intentionally not eval-called,
+//    COMMENT: because it does not make sense to eval call user-provided functions.
+//    COMMENT: 1) If the new operator can be inlined, then don't prevent it from
+//    COMMENT:    inlining by having an eval-call of that operator.
+//    COMMENT: 2) If it can't be inlined, then the default conservative modeling
+//    COMMENT:    is what we anyways want anyway.
+//    COMMENT: So the EvalCall event will not be triggered for operator new calls.
+// CHECK-NOT:   EvalCall
 // CHECK-NEXT:  PostCall (operator new) [CXXAllocatorCall]
+
 // CHECK-NEXT:  PreCall (C::C) [CXXConstructorCall]
 // CHECK-NEXT:  EvalCall (C::C) {argno: 2} [CXXConstructorCall]
 // CHECK-NEXT:  PostCall (C::C) [CXXConstructorCall]
+
+// CHECK-NEXT: PreCall (operator delete) [CXXDeallocatorCall]
+//    COMMENT: Same reasoning as for CXXNewExprs above.
+// CHECK-NOT:  EvalCall
+// CHECK-NEXT: PostCall (operator delete) [CXXDeallocatorCall]

balazs-benics-sonarsource · 2025-09-30T13:10:49Z

This relates to past patches like https://reviews.llvm.org/D41266 and https://reviews.llvm.org/D124845

clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp

clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp

clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp

Co-authored-by: Donát Nagy <donat.nagy@ericsson.com>

NagyDonat

Overall I agree with these comments.

However, note that in MallocChecker.cpp we already have precedent for code that performs EvalCall for user-defined functions if they are marked by ownership annotations:

llvm-project/clang/lib/StaticAnalyzer/Checkers/MallocChecker.cpp

Line 1709 in 98d43ef

if (isAllocatingOwnershipAttrCall(Call)) {

Once I though about introducing an "EvalCall, but if the function cannot be inlined" callback, but it was shelved for some reason that I don't recall. Perhaps it should be resurrected for situations like this?

balazs-benics-sonarsource · 2025-09-30T14:36:07Z

Overall I agree with these comments.

However, note that in MallocChecker.cpp we already have precedent for code that performs EvalCall for user-defined functions if they are marked by ownership annotations:

llvm-project/clang/lib/StaticAnalyzer/Checkers/MallocChecker.cpp

Line 1709 in 98d43ef

if (isAllocatingOwnershipAttrCall(Call)) {

I am aware. That case is justified because we have enough hints about the semantics. So it's fine.
In general, it's more borderline for all operator news.

Once I though about introducing an "EvalCall, but if the function cannot be inlined" callback, but it was shelved for some reason that I don't recall. Perhaps it should be resurrected for situations like this?

We need APIs for controlling what gets certainly inlined, and what gets definitely not inlined.
For exampe, std::sort should never be inlined, but any member functions of std::array should be always inlined.
As of today, we have no means to control this.
What you propose also makes sense, but I'd say a bit more niche use case.

…called (#161370) Downstream, some change triggered an investigation if we could move a checker callback from check::PostCall to eval::Call. After a lengthy investigation that lead to ExprEngine::VisitCXXNewExpr we realized that CXXNewExprs only trigger a PreCall and PostCall, but never an EvalCall. It also had a FIXME that maybe it should trigger it. Remember, it called `defaultEvalCall` which either inlines or conservatively evaluates aka. invalidates the call. But never probes the checker eval-calls to see if any would step in. After implementing the changes to trigger the eval call for the checkers, I realized that it doesn't really make sense because we are eval-calling user-provided functions, that we can't be really sure about their semantics, thus there is no generic way to properly implement the eval call callback. This touches on an important point. It only ever makes sense to eval call functions that has a clear spec. such as standard functions, as implementing the callback would prevent the inlining of that function, risking regressing analysis quality if the implemented model is not complete/correct enough. As a conclusion, I opted for not exposing the eval call event to checkers, in other words, keep everything as-is, but document my journey. CPP-6585

balazs-benics-sonarsource requested review from Xazax-hun, haoNoQ and NagyDonat September 30, 2025 13:10

balazs-benics-sonarsource added the clang:static analyzer label Sep 30, 2025

llvmbot added the clang Clang issues not falling into any other category label Sep 30, 2025

balazs-benics-sonarsource requested a review from necto September 30, 2025 13:11

necto approved these changes Sep 30, 2025

View reviewed changes

clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp Show resolved Hide resolved

balazs-benics-sonarsource commented Sep 30, 2025

View reviewed changes

clang/lib/StaticAnalyzer/Checkers/CheckerDocumentation.cpp Show resolved Hide resolved

balazs-benics-sonarsource commented Sep 30, 2025

View reviewed changes

clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp Show resolved Hide resolved

Mention PreCall/PostCall as an alternative.

57a083c

NagyDonat reviewed Sep 30, 2025

View reviewed changes

clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp Outdated Show resolved Hide resolved

Add the missing avoid in comment

251e55d

Co-authored-by: Donát Nagy <donat.nagy@ericsson.com>

NagyDonat approved these changes Sep 30, 2025

View reviewed changes

balazs-benics-sonarsource enabled auto-merge (squash) September 30, 2025 14:36

balazs-benics-sonarsource merged commit afe73f4 into llvm:main Sep 30, 2025
9 checks passed

balazs-benics-sonarsource deleted the bb/explain-why-operator-new-shouldnt-be-eval-called branch September 30, 2025 14:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[analyzer][NFC] Explain why operator new/delete should never be eval-called #161370

[analyzer][NFC] Explain why operator new/delete should never be eval-called #161370

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

llvmbot commented Sep 30, 2025 •

edited

Loading

Uh oh!

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NagyDonat left a comment

Uh oh!

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

[analyzer][NFC] Explain why operator new/delete should never be eval-called #161370

[analyzer][NFC] Explain why operator new/delete should never be eval-called #161370

Conversation

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

llvmbot commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NagyDonat left a comment

Choose a reason for hiding this comment

Uh oh!

balazs-benics-sonarsource commented Sep 30, 2025

Uh oh!

Uh oh!

Uh oh!

llvmbot commented Sep 30, 2025 •

edited

Loading