No RTT samples, no persistent congestion #3889

ianswett · 2020-07-09T23:09:35Z

Adds a SHOULD NOT and clarifies that persistent congestion is across PN spaces.

I was thinking that this is how it would be implemented, but I suspect this'll end up as a design change.

kazuho

I'm not sure if this is correct.

The problem being discussed in #3875 is that PTO could be significantly reduced when obtaining the first RTT sample, and that persistent congestion is declared because that RTT sample would be used.

I think we need to do something along the lies of forbidding declaration of persistent congestion using packets sent before RTT samples were obtained, or until the RTT estimation becomes stable.

martinthomson · 2020-07-10T01:02:56Z

So a slight rewording might help, but you would have to disqualify losses associated with packets sent prior to getting an RTT estimate, as noted. But then RTT estimates are poor to start with (the first packet to establish a NAT binding can add noticeable delays, for instance).

marten-seemann · 2020-07-10T01:17:03Z

Can we just forbid persistent congestion detection for the Initial and the Handshake PN space? That would be easier to reason about than the RTT estimate you had a while ago, when you sent the first packet.

kazuho · 2020-07-10T01:36:24Z

@marten-seemann I'm sot sure if that would be sufficient as a fix, because packets belonging to ApplicationData PN space are also sent before an RTT sample is obtained (e.g., 0-RTT packets, 0.5-RTT data).

marten-seemann · 2020-07-10T01:38:52Z

@marten-seemann I'm sot sure if that would be sufficient as a fix, because packets belonging to ApplicationData PN space are also sent before an RTT sample is obtained (e.g., 0-RTT packets, 0.5-RTT data).

Right, this is what #3831 is about. Maybe we should say that only packets sent after handshake completion should be considered for persistent congestion detection?

kazuho · 2020-07-10T01:46:31Z

@marten-seemann I agree that using handshake completion (or confirmation) would be better than status quo, though it is still questionable if that's good enough.

IIUC, there are two issues:

there is no continuity between initial RTT and real RTT
the quality of RTT estimate is bad during the early stages of a connection

Using handshake completion (or confirmation) as the signal concentrates on fixing the former, but does not address the second concern. If we are to make a design change, I think it would be beneficial to spend some time looking into addressing both of the issues.

PS. Note that PTO taking RTTVAR into account does not work as a counterargument here, because we are using RTTVAR in an exceptional way (i.e. to estimate PTO of the past).

janaiyengar · 2020-07-13T02:31:36Z

Yeah, this is tricky. Can we move the discussion to the issue please?

martinthomson · 2020-07-16T04:06:36Z

Based on the discussion we've had, don't we need a more comprehensive set of changes that deal with persistent congestion, sending probe packets, and calculating RTT?

kazuho · 2020-07-16T04:50:53Z

@martinthomson I think that depends on the following two aspects:

First, the accuracy we want to have in the definition of persistent congestion (see #3875 (comment)). If we think of persistent congestion as a ballpark figure that resembles a blackhole period of somewhere around 3 PTO, during which an endpoint would have sent at least 2 or 3 packets, then I think the change proposed in this PR is sufficient.

Second, if your concern is about endpoints not arming PTO for ApplicationData and that becoming an edge of persistent congestion, then I'd assume that the wort case that we might want to discuss is when all the following conditions are met:

the TLS handshake transcript send by the server does not fit in 3 datagrams (therefore the server would have RTT estimate when sending the first 0.5 RTT data)
client spends long time in verifying the certificate chain
both the 1-RTT packets that carrried 0.5 RTT data and the first 1-RTT data (i.e. the one containing HANDSHAKE_DONE) gets lost

Under such scenario, persistent congestion would be declared. But then, the condition matches the definition of the previous paragraph. Therefore, I think we can call this a non-issue, if we are fine with what's written in the previous paragraph.

ianswett · 2020-07-16T14:02:41Z

I think we might need to do something to avoid or at least caution against terrible RTT samples, but that's being discussed in #3821

janaiyengar · 2020-07-17T01:52:43Z

This is separable from #3821, and I wouldn't mix the two. I agree with @kazuho that the design issue here is about whether 3 x PTO is about time, about packets, or both. That needs consensus. This PR is good if consensus is on the issue is that PTO is (mostly) about time. We should discuss that on #3875.

marten-seemann

Do we need to add some pseudo code?

janaiyengar · 2020-07-18T01:46:00Z

@marten-seemann : That's probably a good idea.

martinthomson · 2020-07-19T23:49:32Z

draft-ietf-quic-recovery.md

+The persistent congestion period SHOULD NOT start until there is at
+least one RTT sample, both because the length of the period is unknown and
+the PTO may be excessively conservative.


Maybe a wordier version is clearer:

"Persistent congestion SHOULD NOT consider packets that were sent prior to obtaining an RTT sample. Correctly detecting persistent congestion can depend on packets being sent on PTO expiration. The initial RTT, which is used to set the PTO timer, might be too conservative. If the first RTT sample results in a much smaller RTT, the resulting persistent congestion interval might contain too few packets for their loss to be indicative of congestion."

I find the first sentence potentially confusing, but I can make this more verbose along the lines of the second two sentences.

@martinthomson

A variation on @martinthomson suggestion

Update persistent congestion pseudocode

ianswett · 2020-07-20T16:33:55Z

I added a few lines of pseudocode and a comment. Hopefully that's helpful.

draft-ietf-quic-recovery.md

kazuho · 2020-07-21T03:09:51Z

draft-ietf-quic-recovery.md

@@ -1581,6 +1588,10 @@ Invoked when DetectAndRemoveLostPackets deems packets lost.

 ~~~
   InPersistentCongestion(lost_packets):


IIUC, this function is expected to check for losses across all packet number spaces. Assuming that this understanding is correct, I think we should not pass lost_packets as an argument (to this function or to the AreAllPacketsLost that is being called from this function), because lost_packets is a per-PN value that is obtained by calling DetectAndRemoveLostPackets(pn_space).

And I think that this mitigates the concern raised in #3831.

Agreed, it needs to be across PN spaces. Done.

I disagree. While strictly speaking this change is not wrong, it makes things more complex than necessary: An ACK can only ever establish loss in its own packet number space, never across packet number spaces. So it's totally fine to pass lost_packets to InPersistentCongestion.

My point is that passing loss_packets in the pseudo-code gives the readers false impression that persistent congestion is something to be observed per-PN-space.

An endpoint can pass arguments whatever necessary (or beneficial), but that does not mean that those arguments should appear in pseudo-code.

The comment now says:

Determine if all packets in the time period before the largest newly lost packet, including the edges and across all packet number spaces, are marked lost.

My point is that this is unnecessarily complicated, as only packets within a single packet number space, namely the packet number space the ACK was received in, can be newly lost.

@marten-seemann It is impossible to rely on largest_acked_packet_send_time, because an ACK on another PN space might have arrived later than the persistent congestion period to be declared. See the example below.

As stated in #3889 (comment), I do not think it is possible can use loss_packets, unless you implement some lossy behavior to bound state.

@kazuho That makes sense to me. The algorithm I envisioned doesn't work. Not sure how one would correctly implement an InPersistentCongestion that yields correct results across packet number spaces then...

@marten-seemann IIUC, one trivial way of implementing InPersistentCongestion is as follows:

Retain the entries in sentmap (the list of packets being sent) for at least 3 PTO.

When receiving an acknowledgement, mark a hole in the sentmap for the packet being acked.

When a loss is detected in a packet number space, check if that loss spans across more than 3 PTO, by traversing the sentmap. Then, if did, check the sentmap of other packet number spaces (by traversing through those sentmaps) to see if any ACK-eliciting packet sent in that period has been acked.

This approach might sound too trivial, as it is O(N) where N is the number of packets being lost (or in case there are multiple PN spaces in action, the number of inflight packets in other PN spaces). But in practice, I think this approach is sufficient in terms of performance, because N would be like 3 for the current packet number space, and because we would not be sending that many packets on Initial and Handshake packet number spaces.

When receiving an acknowledgement, mark a hole in the sentmap for the packet being acked.

I don't think that's trivial. Creating a "hole" is extra state that has to be cleaned up at some point.

Moving the discussion to #3939, starting from #3939 (comment). @marten-seemann I will respond there.

Co-authored-by: Jana Iyengar <jri.ietf@gmail.com>

Don't pass all the lost packets in, since this is across PN spaces.

janaiyengar

I think this is good.

martinthomson

It took me quite a while to confirm that "is first rtt sample" is good. It's slightly more expansive than what I had implemented (which records

This has the effect of including packets sent prior to getting that sample. But as this is only packets sent within a single RTT, it's guaranteed to be less than a PTO.

draft-ietf-quic-recovery.md

Co-authored-by: Martin Thomson <mt@lowentropy.net>

marten-seemann · 2020-07-22T02:08:22Z

draft-ietf-quic-recovery.md

+     // Persistent congestion cannot be declared on the
+     // first RTT sample.
+     if (is first RTT sample):
+       return false


I'm not sure if that's correct. What matters is that the packet you use for the start of the persistent congestion period was sent after you already had an RTT sample.

I agree with @marten-seemann here.

Consider the case where an endpoint receives the first ACK 10 seconds after sending the first packet (with an RTT of 10ms), then receives the next ACK in 20ms (with RTT of 10ms).

When the second ACK is being processed, the execution would pass through this if, and invoke AreAllPacketsLost. Because all the packets that were sent during the first few seconds are not acked, persistent congestion would be declared.

I think we can simply remove these lines, and rely on the fact that the comment in the pseudo-code talking about "edges." We can clarify what "edge" means, if necessary.

I think we can simply remove these lines, and rely on the fact that the comment in the pseudo-code talking about "edges." We can clarify what "edge" means, if necessary.

I'm not sure if that's sufficient. I think I'd prefer to make the pseudo-code more closely resemble what you'd have to implement. Specifically, to implement this, you'll need a new global variable time_of_first_rtt_measurement, or alternative first_packet_sent_with_measured_rtt, and then you'll have to implement a comparison with the time of the packets in lost_packets.

Specifically, to implement this, you'll need a new global variable time_of_first_rtt_measurement, or alternative first_packet_sent_with_measured_rtt, and then you'll have to implement a comparison with the time of the packets in lost_packets.

Can you implement that way? Persistent congestion is declared across all packet number spaces. That means that you'd have to consult if any acks were received in other packet number spaces during the loss period observed in lost_packets. But you cannot remember all the moments when acks were received, because the state would be unbounded.

@kazuho can you explain your reasoning around "edges" being sufficient?

@mjoras If we define "edge" as a lost packet next to a packet that has been acked, having both edges means that a series of packets being lost are surrounded by acks. As there would have been RTT samples obtained for each of the surrounding acks, there is no need for a separate condition that checks if an RTT sample has been previously obtained.

That's very cute. I would be okay with that too if we cleaned up what "edge" means and also clarify the notion of which contiguous segment should be considered for the persistent period.

Discussed offline with @janaiyengar, it has turned out that the approach of tweaking the definion of edges does not work well, because the edges (from current time) can be any packet, while RTT can only be established with an ack-eliciting packet.

mjoras · 2020-07-22T05:30:20Z

FWIW this is what mvfst (and perhaps others?) has been doing since persistent congestion was implemented, since at the time it seemed silly to calculate the period using the initial RTT. However, at the time we still had the handshake timeout and so persistent congestion did not even apply to initial and handshake packets (see #2649).

More recently our deployment has relied on PTOs for the handshake and thus this check, and there are no obvious problems with it on the Internet. I am in favor of this change even if it is perhaps not exhaustive in its coverage of unlikely pathological cases.

janaiyengar · 2020-07-24T02:58:57Z

My reaction is the same as @mjoras. I had to think a few times over to make sure that this was adequate, and I believe it is. It covers an egregious case explicitly, which is important.

ianswett · 2020-07-29T13:04:36Z

I'm merging this so Jana can pull any conflicts into #3961

No RTT samples, no persistent congestion

2d92d2c

Fixes #3875

ianswett added editorial An issue that does not affect the design of the protocol; does not require consensus. -recovery labels Jul 9, 2020

ianswett requested review from kazuho and janaiyengar July 9, 2020 23:09

kazuho reviewed Jul 9, 2020

View reviewed changes

Update draft-ietf-quic-recovery.md

414498f

ianswett removed the editorial An issue that does not affect the design of the protocol; does not require consensus. label Jul 14, 2020

janaiyengar mentioned this pull request Jul 15, 2020

Persistent congestion threshold is unreliable during the early stages of a connection #3875

Closed

ianswett added the design An issue that affects the design of the protocol; resolution requires consensus. label Jul 16, 2020

kazuho approved these changes Jul 18, 2020

View reviewed changes

janaiyengar approved these changes Jul 18, 2020

View reviewed changes

marten-seemann reviewed Jul 18, 2020

View reviewed changes

martinthomson reviewed Jul 19, 2020

View reviewed changes

ianswett added 3 commits July 20, 2020 11:24

Update draft-ietf-quic-recovery.md

5a790e1

A variation on @martinthomson suggestion

Update draft-ietf-quic-recovery.md

eabdc5c

Update draft-ietf-quic-recovery.md

a5cfbdc

Update persistent congestion pseudocode

janaiyengar reviewed Jul 21, 2020

View reviewed changes

draft-ietf-quic-recovery.md Outdated Show resolved Hide resolved

draft-ietf-quic-recovery.md Outdated Show resolved Hide resolved

draft-ietf-quic-recovery.md Outdated Show resolved Hide resolved

kazuho reviewed Jul 21, 2020

View reviewed changes

ianswett and others added 4 commits July 21, 2020 17:23

Update draft-ietf-quic-recovery.md

89e99d1

Co-authored-by: Jana Iyengar <jri.ietf@gmail.com>

Update draft-ietf-quic-recovery.md

4139dce

Co-authored-by: Jana Iyengar <jri.ietf@gmail.com>

Update draft-ietf-quic-recovery.md

1acc648

Co-authored-by: Jana Iyengar <jri.ietf@gmail.com>

Kazuho's suggestion

3ddca85

Don't pass all the lost packets in, since this is across PN spaces.

janaiyengar approved these changes Jul 21, 2020

View reviewed changes

kazuho approved these changes Jul 21, 2020

View reviewed changes

martinthomson approved these changes Jul 22, 2020

View reviewed changes

draft-ietf-quic-recovery.md Outdated Show resolved Hide resolved

Update draft-ietf-quic-recovery.md

e4601fe

Co-authored-by: Martin Thomson <mt@lowentropy.net>

marten-seemann reviewed Jul 22, 2020

View reviewed changes

This was referenced Jul 23, 2020

clarify that persistent congestion is a contigous loss across all PN spaces #3940

Closed

It's unclear if persistent congestion is a per-PN-space property #3939

Closed

janaiyengar approved these changes Jul 24, 2020

View reviewed changes

ianswett merged commit c0c3f4a into master Jul 29, 2020

ianswett deleted the ianswett-no-rtt branch July 29, 2020 13:04

This was referenced Jul 29, 2020

Rework section on persistent congestion #3961

Merged

Persistent congestion period: text and pseudo-code don't agree #3972

Closed

		@@ -1581,6 +1588,10 @@ Invoked when DetectAndRemoveLostPackets deems packets lost.

		~~~
		InPersistentCongestion(lost_packets):

No RTT samples, no persistent congestion #3889

No RTT samples, no persistent congestion #3889

Conversation

ianswett commented Jul 9, 2020 • edited Loading

kazuho left a comment

Choose a reason for hiding this comment

martinthomson commented Jul 10, 2020

marten-seemann commented Jul 10, 2020

kazuho commented Jul 10, 2020 • edited Loading

marten-seemann commented Jul 10, 2020

kazuho commented Jul 10, 2020 • edited Loading

janaiyengar commented Jul 13, 2020

martinthomson commented Jul 16, 2020

kazuho commented Jul 16, 2020

ianswett commented Jul 16, 2020

janaiyengar commented Jul 17, 2020

marten-seemann left a comment

Choose a reason for hiding this comment

janaiyengar commented Jul 18, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ianswett commented Jul 20, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marten-seemann Jul 22, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

janaiyengar left a comment

Choose a reason for hiding this comment

martinthomson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kazuho Jul 22, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mjoras commented Jul 22, 2020 • edited Loading

janaiyengar commented Jul 24, 2020

ianswett commented Jul 29, 2020

ianswett commented Jul 9, 2020 •

edited

Loading

kazuho commented Jul 10, 2020 •

edited

Loading

kazuho commented Jul 10, 2020 •

edited

Loading

marten-seemann Jul 22, 2020 •

edited

Loading

kazuho Jul 22, 2020 •

edited

Loading

mjoras commented Jul 22, 2020 •

edited

Loading