Python: Modernise py/jinja2/autoescape-false #9135

tausbn · 2022-05-12T12:57:05Z

A simple rewrite to use API graphs instead.

The handling of falsy values is potentially a bit more restrictive now,
as it only accounts for local flow. We should probably figure out a
better way of capturing this pattern, but I felt that this was out of
scope for the present PR.

I don't think a change note is necessary for this, but I'll gladly add one if needed.

A simple rewrite to use API graphs instead. The handling of falsy values is potentially a bit more restrictive now, as it only accounts for local flow. We should probably figure out a better way of capturing this pattern, but I felt that this was out of scope for the present PR.

yoff

I agree that we should probably use global data flow to track the booleans. Ideally, when parameterised modules arrive in anger, we can have a data flow configuration for tracking booleans that can just be seamlessly referred to in situations such as this.

For now, though, I am fine with this rewrite provided we do not lose many alerts. Do we know that the experiment in progress does contain a bunch of alerts (it looks to be the nightly suite)?

tausbn · 2022-05-16T20:26:08Z

The experiment suite does contain results (e.g. for saltstack/salt), but there was something weird in the last run, so I restarted it a short while ago.

RasmusWL · 2022-05-17T11:38:12Z

python/ql/src/Security/CWE-079/Jinja2WithoutEscaping.ql

+    any(DataFlow::LocalSourceNode n | n.asExpr().(ImmutableLiteral).booleanValue() = false)
+        .flowsTo(getAutoEscapeParameter(call))


if call was an API::CallNode you could use the following code

Suggested change

any(DataFlow::LocalSourceNode n | n.asExpr().(ImmutableLiteral).booleanValue() = false)

.flowsTo(getAutoEscapeParameter(call))

call.getKeywordParameter("autoescape").getAValueReachingRhs().asExpr().(ImmutableLiteral).booleanValue() = false

to also get global flow -- that's how I've been tracking boolean values in other library modeling, so I think doing it here would also make sense 😊

doing this would require inlining getAutoEscapeParameter, which I think is fine

I just tried this, and it introduced a false positive in

def checked(cond=False): if cond: e = Environment(autoescape=cond) # GOOD (but now has an alert)

So I'm inclined not to do it at the present time. I think this may have to do with flow coming out of default values not being quite right.

I'll switch DataFlow::CallCfgNode anyway, though, as that'll set us up better for the future.

I think this may have to do with flow coming out of default values not being quite right.

Is it not that the if does not act as a barrier, since there is really not a concept of value involved in type tracking? It seems correct to take the default value into account..

Ah, you're right. I think I had somehow inverted the logic in my head, thinking it was cond=True rather than cond=False, and that we were seeing the alert due to missing flow (to a call that was in fact safe).

This makes me wonder, though, if we couldn't incorporate these barriers into our local flow (and hence type tracking). Consider e.g. splitting the local flow relation into two forms, one for truthy values, and one for falsy values. We could then only allow flow into the body of an if like the above if it is of the correct truthyness. (The general local flow relation would then be the union of the truthy and falsy relations.)

It's probably not worth the effort, but it would be an interesting experiment.

to also get global flow -- that's how I've been tracking boolean values in other library modeling, so I think doing it here would also make sense 😊

I am generally fine with rewriting pointsTo into getAValuereachingRhs as a quick way to convert our queries. I will submit, though, that this does not really get global flow, but a different global property, and in cases where global flow is important, we should use global data flow.

Introduces a false positive, but arguably that false positive should have been there with the local flow as well.

yoff

LGTM, if we get many FP reports, we can rewrite it or lower the precision as appropriate.

tausbn added the no-change-note-required This PR does not need a change note label May 12, 2022

github-actions bot added the Python label May 12, 2022

tausbn marked this pull request as ready for review May 12, 2022 13:50

tausbn requested a review from a team as a code owner May 12, 2022 13:50

yoff previously approved these changes May 16, 2022

View reviewed changes

RasmusWL requested changes May 17, 2022

View reviewed changes

Python: Use API::CallNode

ba8d73c

tausbn dismissed yoff’s stale review via ba8d73c May 17, 2022 12:00

tausbn requested a review from RasmusWL May 17, 2022 12:01

Python: Use API-graph flow for boolean tracking

ea32299

Introduces a false positive, but arguably that false positive should have been there with the local flow as well.

RasmusWL approved these changes May 17, 2022

View reviewed changes

yoff approved these changes May 23, 2022

View reviewed changes

yoff merged commit 23d64ff into github:main May 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Python: Modernise py/jinja2/autoescape-false #9135

Python: Modernise py/jinja2/autoescape-false #9135

Uh oh!

tausbn commented May 12, 2022

Uh oh!

yoff left a comment

Uh oh!

tausbn commented May 16, 2022

Uh oh!

RasmusWL May 17, 2022

Uh oh!

tausbn May 17, 2022

Uh oh!

yoff May 20, 2022

Uh oh!

tausbn May 20, 2022

Uh oh!

yoff May 23, 2022

Uh oh!

yoff left a comment

Uh oh!

Uh oh!

		any(DataFlow::LocalSourceNode n \| n.asExpr().(ImmutableLiteral).booleanValue() = false)
		.flowsTo(getAutoEscapeParameter(call))

	any(DataFlow::LocalSourceNode n \| n.asExpr().(ImmutableLiteral).booleanValue() = false)
	.flowsTo(getAutoEscapeParameter(call))
	call.getKeywordParameter("autoescape").getAValueReachingRhs().asExpr().(ImmutableLiteral).booleanValue() = false

Python: Modernise py/jinja2/autoescape-false #9135

Python: Modernise py/jinja2/autoescape-false #9135

Uh oh!

Conversation

tausbn commented May 12, 2022

Uh oh!

yoff left a comment

Choose a reason for hiding this comment

Uh oh!

tausbn commented May 16, 2022

Uh oh!

RasmusWL May 17, 2022

Choose a reason for hiding this comment

Uh oh!

tausbn May 17, 2022

Choose a reason for hiding this comment

Uh oh!

yoff May 20, 2022

Choose a reason for hiding this comment

Uh oh!

tausbn May 20, 2022

Choose a reason for hiding this comment

Uh oh!

yoff May 23, 2022

Choose a reason for hiding this comment

Uh oh!

yoff left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!