Java: Delayed unsafe deserialization #8501

artem-smotrakov · 2022-03-19T20:39:36Z

Deserialization can sometimes be implemented in two steps. An untrusted serialized object can be stored in a field but actual deserialization happens only when the object is necessary. CVE-2016-6194 in RabbitMQ is an example of such scenario (GitHub issue). Untrusted data that comes from a response is stored in RMQObjectMessage.buf field. Then, deserialization happens when getObject() method is called. Currently, java/unsafe-deserialization query doesn't catch this.

I'd like to propose the following updates to the query:

Add a flow step to the query that propagates taint from a byte array to a field. I suppose this step can match quite a lot of expressions. I guess enabling this step by default for all configs may be too much, but I think it should be okay for java/unsafe-deserialization query because unsafe deserialization is a severe issue.
Add flow steps for methods in DataInput and ObjectInput that read byte arrays.
Add flow sources and steps for RabbitMQ.

With these updates, the query detects CVE-2016-6194. Let me know what you think.

Marcono1234

Hopefully these comments are useful. Feel free to consider them only suggestions since I am not a member of this project.

java/ql/lib/semmle/code/java/frameworks/JavaIo.qll

Marcono1234 · 2022-03-20T22:23:22Z

java/ql/lib/semmle/code/java/security/UnsafeDeserializationQuery.qll

+  exists(FieldAccess access, Field field | field = access.getField() |
+    field.getType().hasName("byte[]") and
+    access = fromNode.asExpr() and
+    field.getAnAccess() = toNode.asExpr()


Probably not an issue, but should this cast the access to FieldRead to explicitly exclude FieldWrite?

Suggested change

field.getAnAccess() = toNode.asExpr()

field.getAnAccess().(FieldRead) = toNode.asExpr()

Maybe. I'm going to move this to a separate PR, let's discuss it there. Thanks for the suggestion!

Marcono1234 · 2022-03-20T22:36:46Z

java/ql/test/query-tests/security/CWE-502/DelayedDeserialization.java

+        ObjectInputStream ois = new ObjectInputStream(bais);
+        return ois.readObject(); // $unsafeDeserialization
+    }
+}


Maybe it would be good to also add a test where an assignment to a byte[] field transfers taint, e.g.:
(assuming that is the reason why you chose FieldAccess instead of the more specific FieldRead for your byteArrayFieldFlowStep predicate above)

... class ObjectHolder { private byte[] buffer; public void init(byte[] tainted) throws Exception { buffer = tainted; } public Object getObject() throws Exception { ByteArrayInputStream bais = new ByteArrayInputStream(buffer); ObjectInputStream ois = new ObjectInputStream(bais); return ois.readObject(); // $unsafeDeserialization } }

Thanks for the idea!

smowton · 2022-03-21T12:10:31Z

I suggest splitting this PR into two parts: the uncontroversial addition of new sources, sinks and library-specific propagation steps, but then let's consider separately the difficult issue of when it's reasonable to treat a field write as a jump step (one that discards calling context, and supposes that probably at some point a read step takes place)

pwntester · 2022-03-24T08:34:34Z

@artem-smotrakov can you show us in which specific case you need the byteArrayFieldFlowStep? It seems like there may be some framework-specific lifecycle (the framework calls user-defined methods as part of an event loop or similar) which we could model (example)

Co-authored-by: Marcono1234 <Marcono1234@users.noreply.github.com>

artem-smotrakov · 2022-04-16T18:16:17Z

Thanks for the review and suggestions @Marcono1234 !

artem-smotrakov · 2022-04-16T18:20:42Z

I suggest splitting this PR into two parts: the uncontroversial addition of new sources, sinks and library-specific propagation steps, but then let's consider separately the difficult issue of when it's reasonable to treat a field write as a jump step (one that discards calling context, and supposes that probably at some point a read step takes place)

Hi @smowton Sure, let's split it. I'm going to add a model for JMS API as well. If you're not sure about byteArrayFieldFlowStep, how about adding an experimental query for this kind of delayed deserialization?

artem-smotrakov · 2022-04-16T18:42:47Z

hey @pwntester

can you show us in which specific case you need the byteArrayFieldFlowStep? It seems like there may be some framework-specific lifecycle (the framework calls user-defined methods as part of an event loop or similar) which we could model (example)

I got this idea while analysing a fix for CVE-2016-6194 in RabbitMQ. Here is the details.

RMQObjectMessage stores serialized data in buf field

https://github.com/rabbitmq/rabbitmq-jms-client/blob/3b907c4baea8ef9eb9f3fd73dd209d12948689fd/src/main/java/com/rabbitmq/jms/client/message/RMQObjectMessage.java#L26

Untrusted data can be written to buf by calling writeBody() method

https://github.com/rabbitmq/rabbitmq-jms-client/blob/3b907c4baea8ef9eb9f3fd73dd209d12948689fd/src/main/java/com/rabbitmq/jms/client/message/RMQObjectMessage.java#L74

But actuall deserialization of untrusted data happens later when getObject() is called

https://github.com/rabbitmq/rabbitmq-jms-client/blob/3b907c4baea8ef9eb9f3fd73dd209d12948689fd/src/main/java/com/rabbitmq/jms/client/message/RMQObjectMessage.java#L58

Honestly, it doesn't look framework-specific to me. Untrusted data flows via byte-arrays and standard Java I/O objects. That is why I thought we can cover this case in the query for unsafe deserialization. In CVE-2016-6194, untrusted data comes from an RabbitMQ message, therfore I think it makes sense to add a source for that.

As I suggested above, we could probably create an experimental query for this kind of delayed deserialization instaead of updating the existing one. What do you think?

artem-smotrakov · 2022-04-17T11:40:40Z

I suggest splitting this PR into two parts: the uncontroversial addition of new sources, sinks and library-specific propagation steps

Hi @smowton I've moved this part to #8765 . Also, I've added a model for JMS API as @pwntester suggested.

artem-smotrakov · 2022-04-17T13:19:05Z

let's consider separately the difficult issue of when it's reasonable to treat a field write as a jump step (one that discards calling context, and supposes that probably at some point a read step takes place)

@smowton I've moved this to a new experimental query so that this wouldn't affect the standard one.

#8766

Hopefully these comments are useful. Feel free to consider them only suggestions since I am not a member of this project.

@Marcono1234 I've applied your suggestions in the new pull request. Please have a look. Your commenta are as usual very useful, I am always happy to see them!

artem-smotrakov · 2022-04-17T13:23:36Z

Hi @pwntester

can you show us in which specific case you need the byteArrayFieldFlowStep? It seems like there may be some framework-specific lifecycle

Honestly, it doesn't look framework-specific to me. Untrusted data flows via byte-arrays and standard Java I/O objects.

As an example, please have a look at the test from the new PR

https://github.com/github/codeql/pull/8766/files#diff-e799d2006a36e76240a753169ed054410e13dbd379365629a410d212bc91d0e2

artem-smotrakov · 2022-04-17T13:23:56Z

This PR has been split to two

#8765 Adds new flow sources and steps for JMS API versions 1 and 2, RabbitMQ, and a few for Java I/O
#8766 Adds a new experimental query for delayed unsafe deserialization

Closing it.

artem-smotrakov added 4 commits March 19, 2022 17:45

Java: catch delayed unsafe deserialization

f76899d

Java: Added flow steps for DataInput and ObjectInput

8ba5db2

Java: Added sources and flow steps for RabbitMQ

07eb058

Added a change note

1bd1be1

artem-smotrakov requested a review from a team as a code owner March 19, 2022 20:39

github-actions bot added documentation Java labels Mar 19, 2022

artem-smotrakov mentioned this pull request Mar 19, 2022

[Java]: Delayed unsafe deserialization github/securitylab#556

Closed

2 tasks

Marcono1234 reviewed Mar 20, 2022

View reviewed changes

Fixed model for DataInput

5717926

Co-authored-by: Marcono1234 <Marcono1234@users.noreply.github.com>

artem-smotrakov mentioned this pull request Apr 17, 2022

Java: Add flow sources and steps for RabbitMQ and JMS #8765

Merged

artem-smotrakov mentioned this pull request Apr 17, 2022

Java: catch delayed unsafe deserialization #8766

Open

artem-smotrakov closed this Apr 17, 2022

artem-smotrakov deleted the rabbitmq-deserialization branch April 18, 2022 10:48

artem-smotrakov restored the rabbitmq-deserialization branch April 18, 2022 10:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Java: Delayed unsafe deserialization #8501

Java: Delayed unsafe deserialization #8501

Uh oh!

artem-smotrakov commented Mar 19, 2022

Uh oh!

Marcono1234 left a comment

Uh oh!

Uh oh!

Marcono1234 Mar 20, 2022

Uh oh!

artem-smotrakov Apr 16, 2022

Uh oh!

Marcono1234 Mar 20, 2022

Uh oh!

artem-smotrakov Apr 16, 2022

Uh oh!

smowton commented Mar 21, 2022

Uh oh!

pwntester commented Mar 24, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022 •

edited

Loading

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

Uh oh!

	field.getAnAccess() = toNode.asExpr()
	field.getAnAccess().(FieldRead) = toNode.asExpr()

Java: Delayed unsafe deserialization #8501

Java: Delayed unsafe deserialization #8501

Uh oh!

Conversation

artem-smotrakov commented Mar 19, 2022

Uh oh!

Marcono1234 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Marcono1234 Mar 20, 2022

Choose a reason for hiding this comment

Uh oh!

artem-smotrakov Apr 16, 2022

Choose a reason for hiding this comment

Uh oh!

Marcono1234 Mar 20, 2022

Choose a reason for hiding this comment

Uh oh!

artem-smotrakov Apr 16, 2022

Choose a reason for hiding this comment

Uh oh!

smowton commented Mar 21, 2022

Uh oh!

pwntester commented Mar 24, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 16, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

artem-smotrakov commented Apr 17, 2022

Uh oh!

Uh oh!

artem-smotrakov commented Apr 17, 2022 •

edited

Loading