JS: Add precise data-flow steps for Promise.all #3478

erik-krogh · 2020-05-14T17:05:06Z

We can now precisely analyze something like the below in a DataFlow::Configuration.

var [clean, tainted] = await Promise.all(["clean", source]);
sink(clean); // OK - but flagged by taint-tracking
sink(tainted); // NOT OK

A TaintTracking::Configuration will still mix up the two properties.
This is caused both by the heap/directory taint-steps in TaintTracking.qll, and by the taint-steps in Promise.qll.
(You have to remove both to get rid of the imprecision).

esbena

LGTM, just a bit of polish required, and a change note for "promises" and bluebird.

The taint step precision can be addressed later - lets continue that discussion in https://github.com/github/codeql-javascript-team/issues/117.

esbena · 2020-05-15T07:06:17Z

javascript/ql/src/semmle/javascript/Arrays.qll

+      exists(int i |
+        element = this.getElement(i) and
+        obj = this and
+        prop = i.toString()


I think we should have the following instead of the ad-hoc conversion with .toString, and also in the name of hypothetical performance.

string arrayElement(int i) { i < 5 and result = i.toString() or result = arrayElement() }

javascript/ql/src/semmle/javascript/Promises.qll

Co-authored-by: Esben Sparre Andreasen <esbena@github.com>

esbena

Approved. See inline comment.

esbena · 2020-05-15T08:23:53Z

javascript/ql/src/semmle/javascript/Arrays.qll

      exists(int i |
        element = this.getElement(i) and
        obj = this and
-        prop = i.toString()
+        prop = arrayElement(i)


We can eliminate getAnElement() because we happen to always know the element order in an ArrayCreationNode?
I think that is forwards compatible in practice, but please indicate with a 👍 that this change was intentional.

asgerf · 2020-05-15T08:36:40Z

Hm, I'm a little sceptical of doing something that only works for pure data-flow, not taint-tracking.

I was thinking we could locally recognize tuple steps through Promise.all, add those as taint steps, and use them to filter out the usual heap steps. With pseudo-code, something like this:

predicate promiseAllStep(Node pred, Node succ) { ... }

predicate heapStep(Node pred, Node succ) {
  // ...
   exists(ArrayExpr array | array = succ.asExpr() |
     not promiseAllStep(pred, _)
     pred = array.getAnElement().flow()
    )
)

erik-krogh · 2020-05-15T09:01:19Z

I was thinking we could locally recognize tuple steps through Promise.all, add those as taint steps, and use them to filter out the usual heap steps. With pseudo-code, something like this:

I like the idea, I'll look at that afterwards.
For now I think this is good to merge after an evaluation.

asgerf · 2020-05-15T09:15:16Z

javascript/ql/test/library-tests/Promises/flow2.js

+
+	var [clean2, tainted2] = await Promise.resolve(Promise.all(["clean", source]));
+	sink(clean2); // OK - but flagged by taint-tracking
+	sink(tainted2); // NOT OK


Could you add a test case where the inputs to Promise.all are promises?

I can't quite picture how this would work. It seems to me we end up with the sequence of steps:

Store step into promise with promise-value pseudo-property

Store step into array object with array-index property

Load step from the array object to Promise.all with promise-value pseudo-property

Load step from Promise.all to array access with array-index property

The above sequence is not reducible to plan data flow because the innermost store/load pair have different property names.

The way I see it, Promise.all converts an "array of promises" to a "promise of an array", which in general requires us to swap the order of two properties. This requires two loads steps followed by two store steps (which we unfortunately can't do).

The way I see it, Promise.all converts an "array of promises" to a "promise of an array", which in general requires us to swap the order of two properties. This requires two loads steps followed by two store steps (which we unfortunately can't do).

Your analysis is correct, and we end up not tracking data-flow when the inputs to Promise.all are promises.

Yeah that's really unfortunate. This PR is a bit of a hard sell with that restriction. Do you mind withholding this PR for a bit while trying out the local heuristic?

Yeah that's really unfortunate. This PR is a bit of a hard sell with that restriction. Do you mind withholding this PR for a bit while trying out the local heuristic?

I think I got it working nicely.
The last test-case is still not flagged by a DataFlow::Configuration, but the TaintTracking::Configuration no longer has any FPs in the test.

erik-krogh · 2020-05-17T14:02:36Z

An evaluation came back with mixed results (redo of the 10 worst).

I tried to restrict the precise array-elements to Promise.all.
Evaluation on that looks good. (After I initially got this weird result).

…k-krogh/3478

asgerf

Thanks! LGTM now

…omise.all

…k-krogh/3478

erik-krogh · 2020-05-20T09:06:15Z

@asgerf can I get a re-approve?

erik-krogh · 2020-05-20T09:59:21Z

@esbena can I get a re-approve/merge?
QLucie won't let me merge without your approval.

erik-krogh added 2 commits May 14, 2020 18:55

implement precise data-flow steps for Promise.all

e98f794

add tests

5132e61

erik-krogh added JS Awaiting evaluation Do not merge yet, this PR is waiting for an evaluation to finish labels May 14, 2020

erik-krogh requested a review from a team as a code owner May 14, 2020 17:05

update expected output

6775294

erik-krogh force-pushed the PromiseAll branch from d94ea95 to 6775294 Compare May 14, 2020 20:26

esbena reviewed May 15, 2020

View reviewed changes

erik-krogh and others added 3 commits May 15, 2020 09:58

remove redundant instanceof check

cb96ee8

Co-authored-by: Esben Sparre Andreasen <esbena@github.com>

add change note for bluebird and "Promise"

4eb9684

restrict the number of stored array elements

dd3342b

esbena previously approved these changes May 15, 2020

View reviewed changes

asgerf requested changes May 15, 2020

View reviewed changes

add test for promise inside Promise.all

3138918

erik-krogh dismissed esbena’s stale review via 3138918 May 15, 2020 09:49

erik-krogh added 2 commits May 15, 2020 22:02

more precise taint-tracking for Promise.all

e2cd7e6

restrict precise array elements to Promise.all()

8717f7b

erik-krogh removed the Awaiting evaluation Do not merge yet, this PR is waiting for an evaluation to finish label May 17, 2020

Merge branch 'master' of https://github.com/github/codeql into pr/eri…

bd3c4d4

…k-krogh/3478

asgerf previously approved these changes May 18, 2020

View reviewed changes

update expected output after restricting precise array tracking to Pr…

c6276dd

…omise.all

erik-krogh dismissed asgerf’s stale review via c6276dd May 18, 2020 09:49

asgerf previously approved these changes May 18, 2020

View reviewed changes

Merge branch 'master' of https://github.com/github/codeql into pr/eri…

70a28f6

…k-krogh/3478

erik-krogh dismissed asgerf’s stale review via 70a28f6 May 18, 2020 14:06

Merge branch 'master' of https://github.com/github/codeql into pr/eri…

aa396a3

…k-krogh/3478

asgerf approved these changes May 20, 2020

View reviewed changes

esbena approved these changes May 20, 2020

View reviewed changes

semmle-qlci merged commit 2bbc1c2 into github:master May 20, 2020

erik-krogh mentioned this pull request Sep 9, 2021

JS: Support a taint tracking for arguments of .apply() function call #6559

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JS: Add precise data-flow steps for Promise.all #3478

JS: Add precise data-flow steps for Promise.all #3478

Uh oh!

erik-krogh commented May 14, 2020

Uh oh!

esbena left a comment

Uh oh!

esbena May 15, 2020

Uh oh!

Uh oh!

esbena left a comment

Uh oh!

esbena May 15, 2020

Uh oh!

asgerf commented May 15, 2020

Uh oh!

erik-krogh commented May 15, 2020

Uh oh!

asgerf May 15, 2020

Uh oh!

erik-krogh May 15, 2020

Uh oh!

asgerf May 15, 2020

Uh oh!

erik-krogh May 15, 2020

Uh oh!

erik-krogh commented May 17, 2020 •

edited

Loading

Uh oh!

asgerf left a comment

Uh oh!

erik-krogh commented May 20, 2020

Uh oh!

erik-krogh commented May 20, 2020

Uh oh!

Uh oh!

JS: Add precise data-flow steps for Promise.all #3478

JS: Add precise data-flow steps for Promise.all #3478

Uh oh!

Conversation

erik-krogh commented May 14, 2020

Uh oh!

esbena left a comment

Choose a reason for hiding this comment

Uh oh!

esbena May 15, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

esbena left a comment

Choose a reason for hiding this comment

Uh oh!

esbena May 15, 2020

Choose a reason for hiding this comment

Uh oh!

asgerf commented May 15, 2020

Uh oh!

erik-krogh commented May 15, 2020

Uh oh!

asgerf May 15, 2020

Choose a reason for hiding this comment

Uh oh!

erik-krogh May 15, 2020

Choose a reason for hiding this comment

Uh oh!

asgerf May 15, 2020

Choose a reason for hiding this comment

Uh oh!

erik-krogh May 15, 2020

Choose a reason for hiding this comment

Uh oh!

erik-krogh commented May 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asgerf left a comment

Choose a reason for hiding this comment

Uh oh!

erik-krogh commented May 20, 2020

Uh oh!

erik-krogh commented May 20, 2020

Uh oh!

Uh oh!

erik-krogh commented May 17, 2020 •

edited

Loading