Additional security section on fragmentation reassembly attacks #444

huitema · 2017-04-19T17:25:08Z

Describe the equivalent of the Teardrop attack for QUIC, and propose mitigation.

I have lots of emotional things to say about such checks...

martinthomson

This doesn't talk about flow control. It needs to.

Like #443, I think that this is far more detailed than we need. The point of this is to make an implementer aware that a malicious peer might intentionally fragment the data on receive buffers in order to cause disproportionate memory commitment (either disproportionate to the number of bytes that were transmitted, or disproportionate to the flow control offset that was provided, in practice probably both are necessary to make the attack worthwhile). This can be said more concisely, I think.

The most interesting case for this attack is where receivers over-commit memory and advertise flow control offsets in the aggregate that exceed actual available memory. This strategy works in most cases given that most clients are not attempting denial of service. The very tail of a receive window is rarely needed in practice. Over-commitment fails badly when under this kind of attack.

martinthomson · 2017-04-20T00:38:18Z

draft-ietf-quic-transport.md

+An adversarial client may attempt to
+exhaust server memory resource by performing
+a stream fragmentation and reassembly attack, similar to the UDP/ICMP
+"Teardrop" fragmentation attacks. The adversarial client would open a stream,


Just dropping the name quoting. Could not find a good Teardrop reference.

martinthomson · 2017-04-20T00:38:57Z

draft-ietf-quic-transport.md

@@ -2697,6 +2697,43 @@ also be forward-secure encrypted.  Since the attacker will not have the forward
 secure key, the attacker will not be able to generate forward-secure encrypted
 packets with ACK frames.

+## Stream fragmentation and reassembly attacks


martinthomson · 2017-04-20T00:39:27Z

draft-ietf-quic-transport.md

@@ -2697,6 +2697,43 @@ also be forward-secure encrypted.  Since the attacker will not have the forward
 secure key, the attacker will not be able to generate forward-secure encrypted
 packets with ACK frames.

+## Stream fragmentation and reassembly attacks
+
+An adversarial client may attempt to


This is any endpoint, though I agree that it's (usually) not very interesting for a server to mount the attack.

OK. Rewriting to"endpoint" instead of client.

martinthomson · 2017-04-20T00:49:23Z

draft-ietf-quic-transport.md

+This attack can be mitigated by not
+committing memory for stream data reassembly,
+and simply keeping the STREAM DATA frames until enough fragments have been
+received and the data can be delivered to the application in proper sequence.


Saving STREAM frames only works if the data provided is sufficiently sparse, at some point the overhead of saving the frames exceeds the overheads of assembling the data into a buffer and tracking the holes.

The real mitigation is not to over-commit on flow control.

Saving frames is the only way the connection-level flow control window makes sense. Otherwise, you'd have to commit (number of streams)*(stream flow control window) memory.

Hmm, that's true. That suggests a different way to write this: assume that frames are saved (and maybe merged opportunistically). Then the attack is on the overhead associated with saved frames.

Depending on what is meant by using up system memory, an attack may focus on locking out other connections new, or existing by forcing low congenstion windows. Avoiding overcommits can make this situation worse if the attacker succesfully increases flow control budgets.

For practical large scale solutions, the implementation needs to overcommit very significantly. There can be 100K connections of which only a few hundred are active. Each of these need write capacity immediately. If not, it both impacts responsiveness, and it ties up resources by having more concurrent active work going on. The same applies by number of streams vs active streams in some use cases while other cases normally expect streams to be active or closed. If each connection classified as active gets a connection level budget, it doesn't really matter what the stream budget is - this is more for the application consumption management. If the connection budget is abused with holes, it just hurts throughput of the sender and could limit the ability to start new streams. The real problem is to decide which connections are active and which are sleeping without preventing fast rampup of new and sleeping connections, and how to throttle back when connections are no longer active. The attack is then to appear active while commiting the least possible sender resources. A heuristic could be the age of holes. If retransmission does not kick in timely, packets could be dropped deliberately on that connection despite having a reasonable connection level budget.

I should perhaps clarify that in the above, a connection level budget is not a linear function of flow control. There is a fixed amount of internal memory, and as that is released, the congestion window is expanded. So storing lots of fragments will use memory faster and release memory slower, and thus reduce the connection level congestion window. And, when the memory fills, packets starts to drop. In this way, the worst case is that the full budget is consumed with holes, whereas a friendly peer would fill the same budget with linear stream data. The adversary can only create so many holes before the cost of whole punching is more expensive than linear data. Of course, there a endless different ways this can be handled, and it depends on the use case, risk, degradation when not under attack, etc., therefore it is hard to provide general advise.

As you say, Mikkel, "it is hard to provide general advice". So maybe that's own we should rewrite the "advice" part of this PR. Something like "It is hard to provide general advice. QUIC deployments SHOULD provide mitigations against the stream fragmentation attack, which MAY be avoiding over-committing memory, delaying reassembly of STREAM DATA frames, or implementing heuristics based the age and duration of reassembly holes."

huitema · 2017-04-20T03:45:28Z

Shortened the text, added reference to flow control. The point is that (some) receivers will over-commit, and will need to mitigate the attack. This will require some kind of heuristic. I proposed one -- counting holes, and if they are not commensurate with the packet loss rate abort the connection. If you believe there is something smarter to do, please chime in.

Aron-Schats · 2017-04-20T17:47:46Z

On Wed, Apr 19, 2017 at 9:15 PM, Marten Seemann ***@***.***> wrote: Saving frames is the only way the connection-level flow control window makes sense. Otherwise, you'd have to commit (number of streams)*(stream flow control window) memory.

I don't see it: why would you have to commit more than the connection flow control window?

huitema · 2017-04-21T01:13:22Z

Aaron: "why would you have to commit more than the connection flow control window?" It happens if the sum of the per stream windows is larger than the congestion window. For example, when the endpoint cannot predict which of the streams the other endpoint will fill first.

Since there is no "one size fits all" mitigation, simplify the recommendations. The point is to draw attention to the problem, and trust developers to do the right thing.

huitema · 2017-04-21T01:22:42Z

Modified the mitigations part. Martin, I think that the new text addresses your review. Can you give it a look? Thanks.

martinthomson

Yep, looks fine. I'll give others a chance to poke at this a little before merging.

martinthomson · 2017-04-21T01:27:31Z

draft-ietf-quic-transport.md

+The attack is mitigated if flow control windows correspond to
+available memory. However, some receivers will over-commit memory and advertise
+flow control offsets in the aggregate that exceed actual available memory.
+The over-commitment strategy may leads to better performance when


"may leads" -> "can lead"

martinthomson · 2017-04-21T01:27:57Z

draft-ietf-quic-transport.md

+the stream fragmentation attack.
+
+QUIC deployments SHOULD provide mitigations against the stream fragmentation
+attack. Mitigations MAY consist of avoiding over-committing memory, delaying


Don't use "MAY" here, it's not permissive. "could" is fine.

martinthomson · 2017-04-21T01:28:09Z

draft-ietf-quic-transport.md

+
+QUIC deployments SHOULD provide mitigations against the stream fragmentation
+attack. Mitigations MAY consist of avoiding over-committing memory, delaying
+reassembly of STREAM DATA frames, implementing heuristics based on the


"STREAM frames"

OK, I believe I fixed all that...

huitema added 3 commits April 19, 2017 10:24

Additional security section on fragmentation reassembly attacks

cdbfb64

Describe the equivalent of the Teardrop attack for QUIC, and propose mitigation.

Update draft-ietf-quic-transport.md

dade254

Removing trailing spaces on some of changed lines

6a0337c

I have lots of emotional things to say about such checks...

martinthomson requested changes Apr 20, 2017

View reviewed changes

Simplifying the text, per Martin's review

d244484

Simplifying the mitigation text

46f293c

Since there is no "one size fits all" mitigation, simplify the recommendations. The point is to draw attention to the problem, and trust developers to do the right thing.

And fixing a typo.

040a5fe

martinthomson approved these changes Apr 21, 2017

View reviewed changes

huitema added 3 commits April 20, 2017 18:32

STREAM frames.

a168b3b

Can lead

31872b3

MAY -> could

b47c030

martinthomson merged commit b47c030 into quicwg:master Apr 24, 2017

martinthomson mentioned this pull request Apr 28, 2017

Known attacks #440

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additional security section on fragmentation reassembly attacks #444

Additional security section on fragmentation reassembly attacks #444

huitema commented Apr 19, 2017

martinthomson left a comment

martinthomson Apr 20, 2017

huitema Apr 20, 2017

martinthomson Apr 20, 2017

martinthomson Apr 20, 2017

huitema Apr 20, 2017

martinthomson Apr 20, 2017

marten-seemann Apr 20, 2017

martinthomson Apr 20, 2017

mikkelfj Apr 20, 2017

mikkelfj Apr 20, 2017 •

edited

huitema Apr 20, 2017

huitema commented Apr 20, 2017

Aron-Schats commented Apr 20, 2017 via email

huitema commented Apr 21, 2017

huitema commented Apr 21, 2017

martinthomson left a comment

martinthomson Apr 21, 2017

martinthomson Apr 21, 2017

martinthomson Apr 21, 2017

huitema Apr 21, 2017

Additional security section on fragmentation reassembly attacks #444

Additional security section on fragmentation reassembly attacks #444

Conversation

huitema commented Apr 19, 2017

martinthomson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikkelfj Apr 20, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

huitema commented Apr 20, 2017

Aron-Schats commented Apr 20, 2017 via email

huitema commented Apr 21, 2017

huitema commented Apr 21, 2017

martinthomson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikkelfj Apr 20, 2017 •

edited