refactor LibraryModelsHandler.onOverrideMayBeNullExpr #754

XN137 · 2023-03-31T17:09:53Z

this does a similar refactoring as in #747

i held back on putting this up for review because i was assuming the removal of analysis.nullnessFromDataflow(state, expr) could be controversial

we can use this review to figure our what kind of tests are missing that would make clear why nullnessFromDataflow was being called inside a handler

XN137 · 2023-03-31T17:15:28Z

nullaway/src/main/java/com/uber/nullaway/handlers/LibraryModelsHandler.java

+    boolean isMethodUnannotated =
+        getCodeAnnotationInfo(state.context).isSymbolUnannotated(methodSymbol, this.config);
+    if (exprMayBeNull) {
+      if (optLibraryModels.hasNonNullReturn(methodSymbol, state.getTypes(), isMethodUnannotated)) {
        return false;


this is the only handler check supporting a true -> false transition

XN137 · 2023-03-31T17:16:22Z

nullaway/src/main/java/com/uber/nullaway/handlers/LibraryModelsHandler.java

-          || !optLibraryModels.nullImpliesNullParameters(methodSymbol).isEmpty()) {
-        // These mean the method might be null, depending on dataflow and arguments. We force
-        // dataflow to run.
-        return analysis.nullnessFromDataflow(state, expr) || exprMayBeNull;


see the odd nullnessFromDataflow being run, even without consideration the current value of exprMayBeNull.

the comment above suggests this is intentional and important, however no test is failing without this dataflow call.

So, from the point of view of code calling this Handler extension point in NullAway.java, with this as the only relevant handler, the change is:

Before:

We ran dataflow here, and return true only if dataflow said true or if exprMayBeNull == true

If we get false out of the full handlers pipeline, we are done, the expression is non-null

Otherwise we call dataflow again.

After:

We return true from the handler unconditionally in this case (meaning if we have either of the above library models)

We call dataflow after getting true from the full handlers pipeline.

Either way, the expression is only consider @Nullable if both the if condition logic in this handler matches and dataflow returns @Nullable and non-null otherwise. So they seem equivalent from the point of view of NullAway.java.

There is a small subtlety here, though, which is that other handlers in the chain will observe the result of LibraryModelsHandler before any call to dataflow on NullAway.java. They will observe different values before/after this change.

I don't think we are relying on dataflow being run here for the correctness of other handlers implementing onOverrideMayBeNullExpr, so this is probably fine? But it's worth pointing that small change in semantics.

One thing I do see possibly happening is that after this change, this handler will set exprMayBeNull == true a lot more often for handlers further down the chain, which could cause some handlers that are checking for reasons to transition nullable -> non-null to do extra work. At the same time, the double call to dataflow is not particularly expensive, because we cache the results of the dataflow analysis.

That said, taking a look at the handlers we currently have, the above concern is theoretical for now. All other handlers implementing this method (RestrictiveAnnotationHandler, InferredJARModelsHandler, and OptionalEmptinessHandler) do strictly less work when exprMayBeNull == true.

Long digression, but basically convinced that this change is a good idea. Worth internal/performance testing, but I'd expect it to be mostly the same, just clearer code! (for which: thank you!)

I think this analysis is correct. Since the new code structure is significantly clearer and easier to understand, I think we would probably want benchmarks showing some measurable performance difference before (re-)introducing an early call to dataflow in a handler. As it stands, I don't expect this change to have a measurable impact

msridhar

This change also LGTM, but is it dependent on #753, or independent? Maybe it's safer to land after #753? Not that any tests are failing now, but I'd rather be confident in the overall logical structure and then get this in

XN137 · 2023-04-04T15:27:44Z

@msridhar i think it is independent (assuming there is no good reason why it was running dataflow early before...) but of course we should rebase and re-run CI after one of them got merged, to confirm all tests are still passing with the combination of the two

lazaroclapp

Minor suggestion and a long digression, but overall this refactor sounds great to me!

lazaroclapp · 2023-04-04T17:33:44Z

nullaway/src/main/java/com/uber/nullaway/handlers/LibraryModelsHandler.java

+    boolean isMethodUnannotated =
+        getCodeAnnotationInfo(state.context).isSymbolUnannotated(methodSymbol, this.config);
+    if (exprMayBeNull) {
+      if (optLibraryModels.hasNonNullReturn(methodSymbol, state.getTypes(), isMethodUnannotated)) {


Why not:

if (exprMayBeNull) { // This is the only case in which we may switch the result from @Nullable to @NonNull: return !optLibraryModels.hasNonNullReturn(methodSymbol, state.getTypes(), isMethodUnannotated); }

Guess there is some consistency in all returns being boolean literals, but not sure the extra if nesting is worth it.

i am aware of this simplification, i have mentioned it here before:

#747 (comment)

when there is only a single reason, we could of course use the condition in a single return statement if preferred?

guess back then there was no preference for the single return statement, but happy to adjust it now here

the suggested comment is true now, but will it stay true in the future as well?
or is this comment really just "local" to the LibraryModelsHandler logic?

let me know your preference and whether we should also adjust the other onOverrideMayBeNullExpr implementations that also only have a single if (for consistency?)

I just think in this case it's clearer to have the return with the call just to avoid extra if nesting and thus the issue of matching branch conditions to returns, but that's a personal preference. Don't feel super strongly either way. In the linked example, I agree that having each condition in its own if check is more readable than some complicated boolean expression with ||/&&, but this case here is a single call and everything else is nested just one level deep.

Either way: happy to keep the code as is, just a suggestion (cc: @msridhar , any thoughts/preference here?)

Edit: Actually, I see it was changed to the suggested case here. So the current code seems good to me.

lazaroclapp · 2023-04-04T17:47:00Z

nullaway/src/main/java/com/uber/nullaway/handlers/LibraryModelsHandler.java

-          || !optLibraryModels.nullImpliesNullParameters(methodSymbol).isEmpty()) {
-        // These mean the method might be null, depending on dataflow and arguments. We force
-        // dataflow to run.
-        return analysis.nullnessFromDataflow(state, expr) || exprMayBeNull;


So, from the point of view of code calling this Handler extension point in NullAway.java, with this as the only relevant handler, the change is:

Before:

We ran dataflow here, and return true only if dataflow said true or if exprMayBeNull == true

If we get false out of the full handlers pipeline, we are done, the expression is non-null

Otherwise we call dataflow again.

After:

We return true from the handler unconditionally in this case (meaning if we have either of the above library models)

We call dataflow after getting true from the full handlers pipeline.

Either way, the expression is only consider @Nullable if both the if condition logic in this handler matches and dataflow returns @Nullable and non-null otherwise. So they seem equivalent from the point of view of NullAway.java.

There is a small subtlety here, though, which is that other handlers in the chain will observe the result of LibraryModelsHandler before any call to dataflow on NullAway.java. They will observe different values before/after this change.

I don't think we are relying on dataflow being run here for the correctness of other handlers implementing onOverrideMayBeNullExpr, so this is probably fine? But it's worth pointing that small change in semantics.

One thing I do see possibly happening is that after this change, this handler will set exprMayBeNull == true a lot more often for handlers further down the chain, which could cause some handlers that are checking for reasons to transition nullable -> non-null to do extra work. At the same time, the double call to dataflow is not particularly expensive, because we cache the results of the dataflow analysis.

That said, taking a look at the handlers we currently have, the above concern is theoretical for now. All other handlers implementing this method (RestrictiveAnnotationHandler, InferredJARModelsHandler, and OptionalEmptinessHandler) do strictly less work when exprMayBeNull == true.

Long digression, but basically convinced that this change is a good idea. Worth internal/performance testing, but I'd expect it to be mostly the same, just clearer code! (for which: thank you!)

lazaroclapp

LGTM.

XN137 · 2023-04-05T16:25:49Z

thanks for the review, please let me know the results of internal testing and performance impact of all these changes if possible

msridhar · 2023-04-05T17:13:40Z

thanks for the review, please let me know the results of internal testing and performance impact of all these changes if possible

Thanks again for the contribution! For performance impact, the best tests we have for that are the jmh tests in open source. Usually it's hard to see a statistically significant performance win there, as there is a good amount of noise in the runs. If you're going to try it, you should try to run on a system with as little load / interference as possible

XN137 · 2023-04-06T07:05:36Z

thanks for the info

Long digression, but basically convinced that this change is a good idea. Worth internal/performance testing, but I'd expect it to be mostly the same, just clearer code! (for which: thank you!)

due to this comment i thought there might be some internal testing going on outside of the JMH benchmarks...

msridhar · 2023-04-06T14:22:43Z

thanks for the info

Long digression, but basically convinced that this change is a good idea. Worth internal/performance testing, but I'd expect it to be mostly the same, just clearer code! (for which: thank you!)

due to this comment i thought there might be some internal testing going on outside of the JMH benchmarks...

You're right that there is some internal performance testing that @lazaroclapp can do. But those numbers are even more noisy than the JMH benchmarks :-) So, outside of some unusual coding pattern or combination of flags that doesn't show up in the benchmarks, it probably won't surface any further perf insights (in my opinion).

lazaroclapp · 2023-04-06T20:34:33Z

thanks for the info

Long digression, but basically convinced that this change is a good idea. Worth internal/performance testing, but I'd expect it to be mostly the same, just clearer code! (for which: thank you!)

due to this comment i thought there might be some internal testing going on outside of the JMH benchmarks...

You're right that there is some internal performance testing that @lazaroclapp can do. But those numbers are even more noisy than the JMH benchmarks :-) So, outside of some unusual coding pattern or combination of flags that doesn't show up in the benchmarks, it probably won't surface any further perf insights (in my opinion).

To be fair, I was mostly thinking of internal conformance testing (i.e. any changes to the behavior of NullAway on our full codebase) + JMH for performance benchmarking. I am not sure anything that doesn't show up in JMH benchmarks will show up in internal builds, since those are a lot more noisy, but definitely will report if something does :)

msridhar · 2023-04-26T13:20:24Z

@XN137 FYI I sent an email to you a while back at the email address associated with your GH commits. Maybe you saw it, I'm just not sure. If you didn't see it, you can email me at the address on my web site from whatever address works best for you

…)" This reverts commit 8821567.

XN137 mentioned this pull request Mar 31, 2023

refactor: streamline mayBeNullExpr flow #753

Merged

XN137 commented Mar 31, 2023

View reviewed changes

XN137 marked this pull request as ready for review March 31, 2023 17:19

XN137 force-pushed the refactor-LibraryModelsHandler branch from 08c88ee to 51d2a7b Compare April 1, 2023 07:26

msridhar reviewed Apr 3, 2023

View reviewed changes

refactor LibraryModelsHandler.onOverrideMayBeNullExpr

e59cef7

XN137 force-pushed the refactor-LibraryModelsHandler branch from 51d2a7b to e59cef7 Compare April 4, 2023 17:35

lazaroclapp reviewed Apr 4, 2023

View reviewed changes

review: single return with comment

faf71d4

lazaroclapp approved these changes Apr 5, 2023

View reviewed changes

lazaroclapp merged commit 8821567 into uber:master Apr 5, 2023

XN137 deleted the refactor-LibraryModelsHandler branch April 5, 2023 16:25

msridhar added a commit to msridhar/NullAway that referenced this pull request Jul 18, 2023

Revert "refactor LibraryModelsHandler.onOverrideMayBeNullExpr (uber#754…

0a727f5

…)" This reverts commit 8821567.

msridhar added a commit to msridhar/NullAway that referenced this pull request Jul 18, 2023

Revert "refactor LibraryModelsHandler.onOverrideMayBeNullExpr (uber#754…

ae32f87

…)" This reverts commit 8821567.

msridhar added a commit to msridhar/NullAway that referenced this pull request Jul 19, 2023

Revert "refactor LibraryModelsHandler.onOverrideMayBeNullExpr (uber#754…

570ca6f

…)" This reverts commit 8821567.

msridhar added a commit to msridhar/NullAway that referenced this pull request Jul 19, 2023

Revert "refactor LibraryModelsHandler.onOverrideMayBeNullExpr (uber#754…

b25529f

…)" This reverts commit 8821567.

msridhar added a commit to msridhar/NullAway that referenced this pull request Jul 19, 2023

Revert "refactor LibraryModelsHandler.onOverrideMayBeNullExpr (uber#754…

274b8c5

…)" This reverts commit 8821567.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor LibraryModelsHandler.onOverrideMayBeNullExpr #754

refactor LibraryModelsHandler.onOverrideMayBeNullExpr #754

XN137 commented Mar 31, 2023

XN137 Mar 31, 2023

XN137 Mar 31, 2023

lazaroclapp Apr 4, 2023

msridhar Apr 4, 2023

msridhar left a comment

XN137 commented Apr 4, 2023

lazaroclapp left a comment

lazaroclapp Apr 4, 2023

XN137 Apr 4, 2023 •

edited

Loading

XN137 Apr 4, 2023

lazaroclapp Apr 5, 2023 •

edited

Loading

lazaroclapp Apr 4, 2023

lazaroclapp left a comment

XN137 commented Apr 5, 2023

msridhar commented Apr 5, 2023

XN137 commented Apr 6, 2023

msridhar commented Apr 6, 2023

lazaroclapp commented Apr 6, 2023

msridhar commented Apr 26, 2023

refactor LibraryModelsHandler.onOverrideMayBeNullExpr #754

refactor LibraryModelsHandler.onOverrideMayBeNullExpr #754

Conversation

XN137 commented Mar 31, 2023

XN137 Mar 31, 2023

Choose a reason for hiding this comment

XN137 Mar 31, 2023

Choose a reason for hiding this comment

lazaroclapp Apr 4, 2023

Choose a reason for hiding this comment

msridhar Apr 4, 2023

Choose a reason for hiding this comment

msridhar left a comment

Choose a reason for hiding this comment

XN137 commented Apr 4, 2023

lazaroclapp left a comment

Choose a reason for hiding this comment

lazaroclapp Apr 4, 2023

Choose a reason for hiding this comment

XN137 Apr 4, 2023 • edited Loading

Choose a reason for hiding this comment

XN137 Apr 4, 2023

Choose a reason for hiding this comment

lazaroclapp Apr 5, 2023 • edited Loading

Choose a reason for hiding this comment

lazaroclapp Apr 4, 2023

Choose a reason for hiding this comment

lazaroclapp left a comment

Choose a reason for hiding this comment

XN137 commented Apr 5, 2023

msridhar commented Apr 5, 2023

XN137 commented Apr 6, 2023

msridhar commented Apr 6, 2023

lazaroclapp commented Apr 6, 2023

msridhar commented Apr 26, 2023

XN137 Apr 4, 2023 •

edited

Loading

lazaroclapp Apr 5, 2023 •

edited

Loading