Swift: collection/tuple content for dictionary flow #13947

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

MathiasVP merged 14 commits into github:main from rdmarsh2:rdmarsh2/swift/dictionary-flow-tuples

Sep 7, 2023

Contributor

rdmarsh2 commented Aug 10, 2023

This PR models dictionary content as being TupleContent nested within CollectionContent, and adds a new DictionarySubscriptNode to represent the intermediate step, where only the TupleContent has been stored, or only the CollectionContent has been read. This means models for the generic Collection protocol will work for dictionaries without modification.

rdmarsh2 added 2 commits

August 10, 2023 20:52


          Swift: collection/tuple content for dictionary flow

70c2ef5


          Swift: Add Dictionary models

d3c68c7

rdmarsh2 requested a review from a team as a code owner

August 10, 2023 20:57

github-actions bot added the Swift label

Contributor

hvitved commented Aug 11, 2023

This PR models dictionary content as being TupleContent nested within CollectionContent

FTR, this is also how it's done in C# 👍 (though instead of TupleContent it's a FieldContent representing a KeyValuePair)

Contributor Author

rdmarsh2 commented Aug 11, 2023

That's good to know. Swift has had first-class tuples from the start, so they just alias (Key, Value) as the element type.

rdmarsh2 added 3 commits

August 11, 2023 17:31


          Swift: autoformat

f5fac66


          Swift: QLDoc for Dicitonary.qll

653a229


          Swift: Change note for dictionary flow

f047161

github-actions bot added the documentation label


          Swift: Autoformat Dictionary.qll

3f0a249

MathiasVP reviewed

View reviewed changes

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Show resolved Hide resolved

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Outdated

Comment on lines 761 to 771

+                    subscript.getArgument(0).getExpr() = node1.asExpr() and
+                    node2.(DictionarySubscriptNode).getExpr() = subscript and
+                    c.isSingleton(any(Content::TupleContent tc | tc.getIndex() = 1))
+                    or
+                    assign.getSource() = node1.asExpr() and
+                    node2.(DictionarySubscriptNode).getExpr() = subscript and
+                    c.isSingleton(any(Content::TupleContent tc | tc.getIndex() = 1))
+                    or
+                    node1.(DictionarySubscriptNode) = node1 and
+                    node2.asExpr() = subscript and
+                    c.isSingleton(any(Content::CollectionContent cc))

Contributor

MathiasVP Aug 14, 2023

So now we do:

source -> TupleContent -> CollectionContent

in two steps (which makes sense), and I gather that the last of these three cases is the CollectionContent step. But why do we need two cases to cover the first step? I would have thought that only the middle cases was needed, and the first case would happen through reverse reads?

Contributor

geoffw0 Aug 15, 2023

I think there are two steps to assemble the tuple because one is for the key and one is for the value.

Contributor Author

rdmarsh2 Aug 15, 2023

The first case is the step from the key in dict[key] = value - I typoed and it should be tc.getIndex() = 0 there.

geoffw0 reviewed

View reviewed changes

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Outdated Show resolved Hide resolved

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Outdated Show resolved Hide resolved

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Outdated

Comment on lines 761 to 771

+                    subscript.getArgument(0).getExpr() = node1.asExpr() and
+                    node2.(DictionarySubscriptNode).getExpr() = subscript and
+                    c.isSingleton(any(Content::TupleContent tc | tc.getIndex() = 1))
+                    or
+                    assign.getSource() = node1.asExpr() and
+                    node2.(DictionarySubscriptNode).getExpr() = subscript and
+                    c.isSingleton(any(Content::TupleContent tc | tc.getIndex() = 1))
+                    or
+                    node1.(DictionarySubscriptNode) = node1 and
+                    node2.asExpr() = subscript and
+                    c.isSingleton(any(Content::CollectionContent cc))

Contributor

geoffw0 Aug 15, 2023

I think there are two steps to assemble the tuple because one is for the key and one is for the value.

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Show resolved Hide resolved

rdmarsh2 added 2 commits

August 15, 2023 17:58


          Swift: add tests for broken dictionary flow case

a9f5471


          Swift: fixes around DictionaryContent

79368c1

MathiasVP reviewed

View reviewed changes

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Show resolved Hide resolved

swift/ql/lib/codeql/swift/dataflow/internal/DataFlowPrivate.qll Outdated

Comment on lines 878 to 881

+                (
+                  c.isSingleton(any(Content::FieldContent fc)) or
+                  c.isSingleton(any(Content::TupleContent tc))
+                )

Contributor

MathiasVP Aug 16, 2023

Why this restriction? Do we not want to clear CollectionContent and ArrayContent?

Contributor

geoffw0 Aug 16, 2023

... or EnumContent?

Contributor Author

rdmarsh2 Aug 16, 2023

We don't want to clear CollectionContent and ArrayContent at a post-update node because they may have been only partially overwritten - consider the case here. It shouldn't clear the content from line 779.

I think we do want to clear EnumContent, though. Good catch.

rdmarsh2 added 2 commits

August 16, 2023 17:52


          Swift: add EnumContent to clearsContent

3ee3eab


          Swift: add QLDoc for DictionarySubscriptNode

d3cc366

geoffw0 reviewed

View reviewed changes

Contributor

geoffw0 left a comment

My concerns have been addressed, how about you @MathiasVP

Contributor

MathiasVP commented Aug 17, 2023

My concerns have been addressed, how about you @MathiasVP

I think so, yes. I'd still like to see a DCA run on this PR, though 🙏. In addition, it would be nice if we could have a testcase that demonstrated the impact of 3ee3eab (although this can be done in a future PR if you prefer).

Contributor

geoffw0 commented Aug 22, 2023

Possible analysis performance regression showing in the DCA runs, particularly the second one. Not sure if it's real or wobble. A small real regression would be acceptable.

In addition, it would be nice if we could have a testcase that demonstrated the impact of 3ee3eab (although this can be done in a future PR if you prefer).

I agree.

Contributor Author

rdmarsh2 commented Sep 5, 2023

I haven't been able to find any obvious performance issues locally, and the worst cases do look like wobble (very high standard deviations for the slower run). I wonder if we should implement type pruning and see if that fixes performance...

Contributor

MathiasVP commented Sep 5, 2023

If you can't reproduce it then it's probably fine. But let's do another DCA run after the merge-conflict has been resolved just to be safe 🤞.


          Merge branch 'main' into rdmarsh2/swift/dictionary-flow-tuples

5bdd959

Contributor Author

rdmarsh2 commented Sep 7, 2023

Timings in the latest DCA run look reasonable

Contributor

MathiasVP commented Sep 7, 2023 •

edited

Loading

Indeed, this LGTM! Now we just need to:

Do a round of autoformatting
Accept the test improvements

and then I think this PR is good to go 🎉

rdmarsh2 added 3 commits

September 7, 2023 16:14


          Swift: autoformat

4f4491a


          Swift: update a test expectation for dictionary flow

0fff540


          Swift: fix test expectation properly

603f2cd

MathiasVP approved these changes

View reviewed changes

Contributor

MathiasVP left a comment

LGTM!

MathiasVP merged commit 49fee35 into github:main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Swift