Duplication can use different indexing #1128

martinthomson · 2018-02-21T03:18:42Z

A duplication instruction never references the static table. Currently however, this is possible, and it's not a great idea.

Aside from it being pointless to duplicate entries in the static table (it costs space and time to do this, and references to these new entries will have longer encodings than static table references), in the current design the encoding of the duplication instruction is such that virtually every duplication instruction has to use a two octet encoding because exactly one entry in the dynamic table can be duplicated in one octet. Duplication of this entry is completely useless because duplicating it does nothing to improve its position in the table.

mikkelfj · 2018-02-21T03:23:17Z

Can please refer to QCRAM or HTTP on this and similar issues - it’s otherwise hard to decipher context.

afrind · 2018-02-21T17:20:39Z

One proposed solution comes from the QPACK draft, which is to use a bit in the instruction to indicate if a referenced index is static or dynamic. I found that this actually simplified parts of the implementation compared to HPACK/QCRAM's unified index space. If we separate the instruction space, then we can define duplication to always operate on the dynamic table index space. It also may compress slightly better, since more dynamic indexes can be encoded by a single byte.

afrind · 2018-02-21T18:20:33Z

@mikkelfj : is the -qcram tag insufficient to differentiate? Would you prefer something in the issue title?

LPardue · 2018-02-21T18:29:18Z

When I read these issues that are pushed in email there is no visibility of the tag. As a non-editor I cannot set tags for an issue, so it makes it hard to imply scope. This situation is not unliveable. However, some projects define a namespace to be used in the issue title in order to help such situations. "qcram" and "hq" might be enough to help to disambiguate from transport, security etc.

mikkelfj · 2018-02-21T18:55:45Z

@afrind yes QCRAM in the title is fine, or qcram:, just something. The issue is with email as @LPardue says.

martinthomson · 2018-02-21T23:42:26Z

@afrind, the extra bit is less efficient overall (see also why arithmetic coding is superior to Huffman). Especially given that the static table is a fixed size. I don't find the indexing that onerous, so I don't think that the extra bit would help. That is, unless we get a proposal for a new, bigger static table. Having references to the dynamic table start at (for example) 200, would make the bit a good investment.

afrind · 2018-02-22T17:52:16Z

If we leave the on-the-wire index space unified, is the proposal to use a different indexing scheme for this instruction only (eg: skip the + 62)? I'm not crazy about having two different indexing schemes in the design, but otherwise it would solve the issues you raise above.

martinthomson · 2018-02-23T00:36:36Z

Well, let's decide the other thing first, because that will drive any design here. (-62 is the obviously choice assuming we change nothing else).

MikeBishop · 2018-02-25T14:45:32Z

Actually, the optimal change for length's sake would be to use a totally different reference point. You typically want to duplicate the things that are about to fall off the end, so count from the oldest item that hasn't been dropped yet. Since you can only send this on the control stream, both sides have a synchronized view about what that is.

That's not great for comprehension, necessarily, but....

martinthomson · 2018-02-26T00:25:38Z

Right. That was in my original write-up. The reverse indexing is the most efficient.

afrind · 2018-02-26T17:14:19Z

There's already of 3 different ways indexes are written on the wire: Absolute Index, Hybrid (relative an encoded base), and Relative (HPACK style, relative to head). An implementation may also have an internal indexing scheme (eg: indexes into an array of headers). How many bytes are we going to save introducing a Relative-to-end style? A few every time user-agent and Accept get near the end of the table? I'd prefer to burn the bytes and keep the draft a bit simpler.

MikeBishop · 2018-02-26T18:56:20Z

Depends on how the instruction space lands, but one byte per duplicate instruction seems a likely outcome.

afrind · 2018-02-26T21:19:39Z

My question was how often to we wrap around and need to duplicate, and how many will need duplication? I assume it takes several or dozens of requests of normal usage to wrap. How many headers will implementations want to duplicate when a wrap occurs? I can see a browser re-adding the handful that are sent on every request. Also note that this is all gravy over HPACK which would just evict and re-insert.

afrind · 2018-03-09T19:51:24Z

In my simulation, duplication is a very rare instruction. With a 4k table, my 250 request HAR issued 4 Duplicate instructions. With an 8kb table, it never happened.

If we forgo splitting the instruction space, but only change the control instructions so that Duplicate is reuses Indexed (and keep Table Size Update the same), and we subtract 62, then we could duplicate 63 entries in a single byte, and I don't think there's a big advantage for reversing the index.

MikeBishop · 2018-04-25T23:40:36Z

Discussed with Alan; absolute indices grow without bound, so they're not a good option. Most tables have less than 1k entries, which can be represented in at most three bytes. You could index from the end of the dynamic table and make it always one byte, but that introduces a fourth (!!!) way to reference entries in QPACK, which seems less than ideal.

Inclination is to leave this alone unless we have data that says Duplicate instructions are getting intractably large.

martinthomson added design An issue that affects the design of the protocol; resolution requires consensus. -qpack labels Feb 21, 2018

MikeBishop mentioned this issue Feb 28, 2018

QCRAM opcodes with non-byte-aligned string literals #1144

Merged

mnot added this to Headers in HTTP Mar 6, 2018

MikeBishop mentioned this issue Apr 20, 2018

Better Indexing in QPACK #1314

Merged

MikeBishop closed this as completed Apr 25, 2018

mnot removed this from Headers in HTTP May 23, 2018

mnot added the has-consensus An issue that the Chairs have determined has consensus, by canvassing the mailing list. label Mar 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Duplication can use different indexing #1128

Duplication can use different indexing #1128

martinthomson commented Feb 21, 2018

mikkelfj commented Feb 21, 2018 via email

afrind commented Feb 21, 2018

afrind commented Feb 21, 2018

LPardue commented Feb 21, 2018 via email

mikkelfj commented Feb 21, 2018

martinthomson commented Feb 21, 2018

afrind commented Feb 22, 2018

martinthomson commented Feb 23, 2018

MikeBishop commented Feb 25, 2018

martinthomson commented Feb 26, 2018

afrind commented Feb 26, 2018

MikeBishop commented Feb 26, 2018 •

edited

Loading

afrind commented Feb 26, 2018

afrind commented Mar 9, 2018

MikeBishop commented Apr 25, 2018

Duplication can use different indexing #1128

Duplication can use different indexing #1128

Comments

martinthomson commented Feb 21, 2018

mikkelfj commented Feb 21, 2018 via email

afrind commented Feb 21, 2018

afrind commented Feb 21, 2018

LPardue commented Feb 21, 2018 via email

mikkelfj commented Feb 21, 2018

martinthomson commented Feb 21, 2018

afrind commented Feb 22, 2018

martinthomson commented Feb 23, 2018

MikeBishop commented Feb 25, 2018

martinthomson commented Feb 26, 2018

afrind commented Feb 26, 2018

MikeBishop commented Feb 26, 2018 • edited Loading

afrind commented Feb 26, 2018

afrind commented Mar 9, 2018

MikeBishop commented Apr 25, 2018

MikeBishop commented Feb 26, 2018 •

edited

Loading