rough in server scheduling guidance #1266

LPardue · 2020-09-17T02:47:10Z

This tries to address two sets of comments on "what signals are subsumed by priority hints" and "how should a server implement things".

One aspect of this is setting client expectations straight; they should have a vague idea what will happen if a server plays ball but also expect that servers can and will do whatever they want.

The other aspect of this is describing what signals servers have at their disposal, and presenting some of the gotchas or tradeoffs that might happen if the extensible priority scheme is implemented too matter-of-fact. Some people would like to see more explicit guidance for servers, that's a fine request. However, I don't see how any single scheduling algorithm would work for the range of vendors and deployments that have shown an interest Priorities, so I've focused on the common criteria.

While adding this section, the thought did cross my mind to move the sections on scheduling. We can always do that as a followup.

Closes #1216 and #1232.

cc @ekinnear, @guoye-zhang, @martinthomson

draft-ietf-httpbis-priority.md

martinthomson · 2020-09-17T05:18:03Z

draft-ietf-httpbis-priority.md

+given urgency level can align well with clients usage of HTTP; such as user
+agents that load document trees where ordering is important.
+
+For non-incremental resources the total download time (time to first byte - time


isn't it time from request to last byte delivered?

This is probably open to interpretation a little so it would be good to hammer it down. Looking at this again, I don't like how I presented it and will tweak it. I was basing it on my understanding of curl and chrome as shown here https://blog.cloudflare.com/a-question-of-timing/. Do other client have alternative views?

My thinking, based on other discussions, is that a non-incremental payload can only be used when the whole payload is received. Therefore, factoring in the delta between request and TTFB is not super helpful in this context. However, non-incremental objects probably do benefit from a shorter TTFB and "time to significant delineator", such as a progressive jpeg header.

For things like JS, we often can't continue processing until we have all the bytes. That's what I meant by atomic below: the whole thing is needed, it can't be divided into smaller parts and be useful. But atomic probably has connotations that aren't helpful.

martinthomson · 2020-09-17T05:18:58Z

draft-ietf-httpbis-priority.md

+to last byte) is important. For incremental resources chunk download times are
+important, especially the first. A server that receives a mix of incremental and


For incremental resources, the time to deliver every byte is important.

I wonder if the split here is between atomic and incremental.

For some types of content not all bytes are important due to internal boundaries. I'd like to avoid falling into the trap of describing this in too much detail if possible. I tried to use "chunk" as the sub-unit that could be as small as 1 byte.

What do you mean by atomic vs incremental?

See above.

This is a simplification: absent more specific information the server must assume that i=false the entire resource needs to be delivered to be of use. For i=true the more of the resource that is delivered the more utility is obtained. Obviously this isn't smoothly linear and there are some bytes that don't allow any more value to be extracted than those preceding, but that is the model that we are operating under.

Extensions can define cut points or chunking. Like a stepped incremental extension that lists a number of byte offsets that contain significant value (which likely needs to come from the server...). Or something that says "don't bother delivering anything less than the following chunk size, because MICE doesn't allow data to be used".

draft-ietf-httpbis-priority.md

martinthomson · 2020-09-17T05:45:29Z

draft-ietf-httpbis-priority.md

+factors. An unbalanced scheduler might prefer one type over another, leading to
+sub-optimal loading and in the worst case starvation of one type. Servers are
+RECOMMENDED to avoid starvation but no specific method of doing so is prescribed.


I don't get this point about unbalanced scheduling. It seems to imply that there might be some other reason to balance between atomic and incremental resources, but I don't think that is the intent.

From #1232 (comment)

Let's imagine two cases:

At the same urgency level, a huge non-incremental file download has started, then a small incremental resource is requested.

At the same urgency level, an incremental hanging GET is waiting for response, while a non-incremental file download is requested.

An unbalanced scheduler might be designed to completely flush one type of resource before moving on the the other. Especially likely if resources are large in comparison to the BDP. This could starve the other type from ever getting a share. The text is attempting to say don't do that. To avoid it, an implemention could somehow yield sending one type in a given time period. Guoye's suggestion on #1232 (comment) provides an example. I like to avoid recommending any specific solution.

I don't think that you want to use the word unbalanced because it carries negative connotations.

What those examples are suggesting is a few questions:

Do you want to permit small responses to jump ahead of in-progress responses?
What if something that is nominally higher priority (by order, not urgency) can't start until a lower priority response has begun? Can it pre-empt? Can it pre-empt only based on size?
(There's another one I thought of: Do you permit non-incremental responses to start when incremental responses are in progress? Maybe you have an answer for that already.)

These imply judgment being exercised or the presence of "other inputs". But it is probably sensible to allow it, but that is clearly not the intent conveyed by the signal. Maybe size-based discrimination is appropriate, but it's not something that this scheme supports, so you need to be careful.

What I would do then is to enumerate corner cases where we know that strict adherence to the scheme could end up with suboptimal results. These cases are exactly the ones that I'd include there.

This scheme can't address these cases. So you need a clear delineation between what following the scheme gets you and where you are using "special sauce". This paragraph went from describing how this scheme operates and the consequences of that straight to special sauce stuff with no pause for breath.

I've reworked this paragraph to address the points here and in the other comment. See a215d40

(my force push seems to have broken Github's tracking, I aplogize).

martinthomson · 2020-09-17T05:46:04Z

draft-ietf-httpbis-priority.md

+An HTTP/2 server that sends SETTINGS_DEPRECATE_HTTP2_PRIORITIES ({{disabling}})
+SHOULD NOT act on HTTP/2 priority signals.


What if that is all it gets? I think that you need to provide more exposition for this recommendation.

I added some more exposition in 3c7f8e6. PTAL.

martinthomson · 2020-09-24T03:43:31Z

draft-ietf-httpbis-priority.md

+Clients can expect servers will make prioritization decisions, including
+ignoring all signals. And they should expect that decisions might be based on
+metadata or information beyond the scope of extensible priorities. 


This is a little odd. It says "Server's gonna do what server wanna do." in a somewhat roundabout way. How about:

Clients cannot depend on particular treatment based on priority signals. Servers can use other information to prioritize responses.

martinthomson · 2020-09-24T03:46:39Z

draft-ietf-httpbis-priority.md

+receives concurrent requests at the same urgency level might serve the responses
+one-by-one but it needs to pick an order. Serving the lowest Stream ID in a


I think that the "might server the responses one-by-one" is distracting, and the phrasing here implies far more discretion for servers. Maybe just

Prioritizing concurrent requests at the same urgency level based on the stream ID, which corresponds to the order in which clients make requests, ensures that clients can use request ordering to influence response order.

martinthomson · 2020-09-24T03:48:31Z

draft-ietf-httpbis-priority.md

+given urgency level can align well with clients usage of HTTP; such as user
+agents that load document trees where ordering is important.
+
+For non-incremental resources the total download time (time to first byte - time


For things like JS, we often can't continue processing until we have all the bytes. That's what I meant by atomic below: the whole thing is needed, it can't be divided into smaller parts and be useful. But atomic probably has connotations that aren't helpful.

martinthomson · 2020-09-24T03:52:19Z

draft-ietf-httpbis-priority.md

+to last byte) is important. For incremental resources chunk download times are
+important, especially the first. A server that receives a mix of incremental and


See above.

This is a simplification: absent more specific information the server must assume that i=false the entire resource needs to be delivered to be of use. For i=true the more of the resource that is delivered the more utility is obtained. Obviously this isn't smoothly linear and there are some bytes that don't allow any more value to be extracted than those preceding, but that is the model that we are operating under.

Extensions can define cut points or chunking. Like a stepped incremental extension that lists a number of byte offsets that contain significant value (which likely needs to come from the server...). Or something that says "don't bother delivering anything less than the following chunk size, because MICE doesn't allow data to be used".

martinthomson · 2020-09-24T04:03:25Z

draft-ietf-httpbis-priority.md

+factors. An unbalanced scheduler might prefer one type over another, leading to
+sub-optimal loading and in the worst case starvation of one type. Servers are
+RECOMMENDED to avoid starvation but no specific method of doing so is prescribed.


I don't think that you want to use the word unbalanced because it carries negative connotations.

What those examples are suggesting is a few questions:

Do you want to permit small responses to jump ahead of in-progress responses?
What if something that is nominally higher priority (by order, not urgency) can't start until a lower priority response has begun? Can it pre-empt? Can it pre-empt only based on size?
(There's another one I thought of: Do you permit non-incremental responses to start when incremental responses are in progress? Maybe you have an answer for that already.)

These imply judgment being exercised or the presence of "other inputs". But it is probably sensible to allow it, but that is clearly not the intent conveyed by the signal. Maybe size-based discrimination is appropriate, but it's not something that this scheme supports, so you need to be careful.

What I would do then is to enumerate corner cases where we know that strict adherence to the scheme could end up with suboptimal results. These cases are exactly the ones that I'd include there.

This scheme can't address these cases. So you need a clear delineation between what following the scheme gets you and where you are using "special sauce". This paragraph went from describing how this scheme operates and the consequences of that straight to special sauce stuff with no pause for breath.

Co-authored-by: Martin Thomson <martin.thomson@gmail.com>

kazuho

Thank you for all the work. Left some comments. PTAL.

draft-ietf-httpbis-priority.md

kazuho · 2020-09-24T23:35:55Z

draft-ietf-httpbis-priority.md

+to prioritization. Prioritizing concurrent requests at the same urgency level
+based on the Stream ID, which corresponds to the order in which clients make
+requests, ensures that clients can use request ordering to influence response
+order.


How about moving the contents of this paragraph to the one above that talks about urgency, and going like: When there are multiple responses with same urgency, a server SHOULD ...

I tried this and didn't like what I came up with. The progression from urgency, to incremental, to request order feels more natural to me. Combining the text as you suggest also produced a strange implication that request order is only important for requests at the same urgency, which I don't agree with.

I'd happily review a suggestion that avoids these problems. Perhaps we can make a separate editorial PR?

👍 I think I can work on a separate PR. That PR can point to this PR or master, depending on how we proceed.

kazuho · 2020-09-24T23:41:26Z

draft-ietf-httpbis-priority.md

+
+An HTTP/2 server implementing the Extensible Priorities scheme instead of the
+HTTP/2 priority sends SETTINGS_DEPRECATE_HTTP2_PRIORITIES; see {{disabling}}. It
+SHOULD NOT act on priority signals belonging to the HTTP/2 scheme. The absence


This sentence sounds like that a server cannot respect the PRIORITY frames sent by a legacy HTTP/2 client.

I think that the intent is to state something like: When a client sends SETTINGS_DEPRECATE_HTTP2_PRIORITIES, a server SHOULD NOT act ...

Ok I think we come to this from different viewpoints. I was trying to address the statement in {{disabling}}

The SETTINGS frame precedes any priority signal sent from a client in HTTP/2, so a server can determine if it should respect the HTTP/2 scheme before building state.

I've reworked the paragraph to accomodate either client or server sending the setting, PTAL.

Co-authored-by: Kazuho Oku <kazuhooku@gmail.com>

draft-ietf-httpbis-priority.md

kazuho · 2020-09-28T11:27:55Z

draft-ietf-httpbis-priority.md

+instead of the HTTP/2 priority scheme by sending
+SETTINGS_DEPRECATE_HTTP2_PRIORITIES; see {{disabling}}. A server that sends or
+receives this setting SHOULD NOT act on priority signals belonging to the HTTP/2
+scheme. The absence of a client Extensible Priority signal SHOULD be treated


I'm not sure if this sentence is correct. I think that a server is expected to respect the H2 prioritization scheme unless the client sends SETTINGS_DEPRECATE_HTTP2_SETTINGS.

I also think that we might want to move the suggestion to {{disabling}}, as it talks about how the client handles the existence (or absence) of the settings parameter? Doing so would be fine, as we refer to that section in the previous sentence.

I agree this would fit in {{disabling}}. That section includes the sentence

The SETTINGS frame precedes any priority signal sent from a client in HTTP/2, so a server can determine if it should respect the HTTP/2 scheme before building state.

So I intended the new text to build on that. My mental model was that a server might just want to cut out a lot of H2 priorities code, leaving only the bits necessary for parsing. Such a server cannot act on the signal and declares so using the deprecate setting. It's unfortunate if the client wants to continue using the old scheme, but there would not be an interop failure.

I could live with downgrading things to only provide this guidance when a server receives the setting. Would that work for you?

I could live with downgrading things to only provide this guidance when a server receives the setting. Would that work for you?

Thanks. I think my preference goes there. IMO, we do not need to recommend servers degrade performance of legacy HTTP/2 clients. So maybe something like: When receiving SETTINGS_DEPRECATE_HTTP2_PRIORITIES, a server MUST ignore the HTTP/2 PRIORITY frames received on that connection.

done in f29bd50

kazuho · 2020-09-28T11:28:43Z

draft-ietf-httpbis-priority.md

+to prioritization. Prioritizing concurrent requests at the same urgency level
+based on the Stream ID, which corresponds to the order in which clients make
+requests, ensures that clients can use request ordering to influence response
+order.


👍 I think I can work on a separate PR. That PR can point to this PR or master, depending on how we proceed.

Co-authored-by: Kazuho Oku <kazuhooku@gmail.com>

LPardue · 2020-09-30T12:02:13Z

Thanks for the contribution @kazuho. I'm going to squash and merge this.

* rough in server scheduling guidance Co-authored-by: Martin Thomson <martin.thomson@gmail.com> Co-authored-by: Kazuho Oku <kazuhooku@gmail.com>

LPardue requested a review from kazuho September 17, 2020 02:47

LPardue added the priorities label Sep 17, 2020

martinthomson reviewed Sep 17, 2020

View reviewed changes

martinthomson reviewed Sep 24, 2020

View reviewed changes

LPardue and others added 6 commits September 24, 2020 22:37

rough in server scheduling guidance

90f9c2f

MT's suggestion

eb9d021

Co-authored-by: Martin Thomson <martin.thomson@gmail.com>

more exposition about HTTP/2 disabling

1472318

I'll have a small pee

28832e6

MT's direct editorial suggestions

2c6f371

rework paragraph on delivery timing and special sauce

a215d40

LPardue force-pushed the priority-server-scheduling branch from 930fe0d to a215d40 Compare September 24, 2020 21:37

kazuho reviewed Sep 24, 2020

View reviewed changes

LPardue and others added 3 commits September 25, 2020 09:41

s/follow/following

e5f33ea

Kazuho's sending sentence split

5778723

Co-authored-by: Kazuho Oku <kazuhooku@gmail.com>

improve text on handling H2 signals when disabled

a532966

kazuho reviewed Sep 28, 2020

View reviewed changes

LPardue and others added 2 commits September 28, 2020 12:40

Kazuho's suggestion

d691bec

Co-authored-by: Kazuho Oku <kazuhooku@gmail.com>

Move server requirements about disabling H2 scheme to that section

f29bd50

LPardue force-pushed the priority-server-scheduling branch from 27050ec to f29bd50 Compare September 28, 2020 13:31

kazuho added 2 commits September 30, 2020 20:54

consolidate recommendations around incremental

2ce197e

nit

560235a

LPardue merged commit 741e80c into master Sep 30, 2020

LPardue deleted the priority-server-scheduling branch September 30, 2020 12:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rough in server scheduling guidance #1266

rough in server scheduling guidance #1266

LPardue commented Sep 17, 2020 •

edited

martinthomson Sep 17, 2020

LPardue Sep 23, 2020

martinthomson Sep 24, 2020

martinthomson Sep 17, 2020

LPardue Sep 23, 2020

martinthomson Sep 24, 2020

martinthomson Sep 17, 2020

LPardue Sep 23, 2020

martinthomson Sep 24, 2020

LPardue Sep 24, 2020

martinthomson Sep 17, 2020

LPardue Sep 23, 2020 •

edited

martinthomson Sep 24, 2020

martinthomson Sep 24, 2020

martinthomson Sep 24, 2020

martinthomson Sep 24, 2020

martinthomson Sep 24, 2020

kazuho left a comment

kazuho Sep 24, 2020

LPardue Sep 25, 2020

kazuho Sep 28, 2020

kazuho Sep 24, 2020

LPardue Sep 25, 2020

kazuho Sep 28, 2020

LPardue Sep 28, 2020

kazuho Sep 28, 2020

LPardue Sep 28, 2020

kazuho Sep 28, 2020

LPardue commented Sep 30, 2020

		to last byte) is important. For incremental resources chunk download times are
		important, especially the first. A server that receives a mix of incremental and

		An HTTP/2 server that sends SETTINGS_DEPRECATE_HTTP2_PRIORITIES ({{disabling}})
		SHOULD NOT act on HTTP/2 priority signals.

		receives concurrent requests at the same urgency level might serve the responses
		one-by-one but it needs to pick an order. Serving the lowest Stream ID in a

rough in server scheduling guidance #1266

rough in server scheduling guidance #1266

Conversation

LPardue commented Sep 17, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LPardue Sep 23, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kazuho left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LPardue commented Sep 30, 2020

LPardue commented Sep 17, 2020 •

edited

LPardue Sep 23, 2020 •

edited