Better handling of iterating over a map's key set #535

msridhar · 2021-12-28T23:42:24Z

Example:

   void foo(Map<String, Object> m){
     for(String k: m.keySet()) {
       m.get(k).toString(); // NullAway thinks it's a null deref
     }
   }

NullAway currently warns on the above code, but it does not warn for the following code:

   void foo2(Map<String, Object> m){
     String k = "baz";
     for(m.containsKey(k)) {
       m.get(k).toString(); 
     }
   }

Ideally, we would generalize our handling of the second case to handle the first case as well. Developers may find it strange that the second case is handled but not the first.

msridhar · 2022-01-06T18:18:56Z

Some thoughts here. Consider the following code:

import java.util.*;
class MyClass {
    void test(Map m) {
	for (Object o: m.keySet()) {
	    m.get(o);
	}
    }
}

The Checker Framework generates the following control-flow graph for the test() method.
MyClass-test-Map.dot.pdf

This is the key part of it:

Based on this CFG, I think we could pattern-match to handle the common case of code like the above that uses an enhanced for loop, as follows. During dataflow analysis, at an assignment node:

Check if the RHS is of the form m.keySet().iterator() and the LHS is a Checker-generated temp var like iter#num0. If so, this is an iterator() call corresponding to an enhanced for loop. In the successor store, we could add an access path of the form m.get(iter#num0.next()) as @NonNull.
Check if the RHS is of the form iter#num0.next() and the LHS is a local variable like o. If m.get(iter#num0.next()) is in the predecessor store, we could add m.get(o) to the successor store. This is essentially a very limited form of local must-alias tracking.

Checking for temporary variables like iter#num0 is important since for such variables created for enhanced for loops, we know they will never be re-assigned, which makes step 2 a safe-ish operation. It is still possible that m gets re-assigned to a different Map between the two assignments matched above, leading to unsoundness. But we are already unsound for such cases; e.g., we do not report an error for the following case:

void mapReassign(Map m, Object o) {
  if (m.containsKey(o)) {
    m = new HashMap();
    m.get(o).toString(); // null dereference!
  }
}

So the proposed handling would not add a new source of unsoundness.

@lazaroclapp thoughts?

lazaroclapp · 2022-01-07T20:58:12Z

This makes perfect sense to me. I assume we don't know of any other case where CF uses iter#num[...] as a naming pattern (other than enhanced for loops)? Also, no variable in the code itself can have a name clashing with those temporaries, right?

In that case, this seems perfectly safe to me (where safe is "as safe as our existing handling of .containsKey(...) or better, as noted above).

#554) Fixes #535 Uses the strategy outlined in this comment: #535 (comment)

msridhar mentioned this issue Jan 6, 2022

Add test case for unsound map reassignment handling #541

Merged

msridhar mentioned this issue Jan 15, 2022

Reason about iterating over a map's key set using an enhanced for loop #554

Merged

msridhar closed this as completed in #554 Jan 21, 2022

msridhar added a commit that referenced this issue Jan 21, 2022

Reason about iterating over a map's key set using an enhanced for loop (

4ec519c

#554) Fixes #535 Uses the strategy outlined in this comment: #535 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better handling of iterating over a map's key set #535

Better handling of iterating over a map's key set #535

msridhar commented Dec 28, 2021

msridhar commented Jan 6, 2022

lazaroclapp commented Jan 7, 2022

Better handling of iterating over a map's key set #535

Better handling of iterating over a map's key set #535

Comments

msridhar commented Dec 28, 2021

msridhar commented Jan 6, 2022

lazaroclapp commented Jan 7, 2022