Sanitize all multibyte chars in HpackHuffmanEncoder by Lincong · Pull Request #13546 · netty/netty

Lincong · 2023-08-13T06:41:56Z

Motivation:

To fix the following problem: during encoding, Huffman encoded headers are sanitized differently compared to non-Huffman encoded headers in HpackEncoder. As a result, characters with code point values higher than 0xFF which could be decoded to an unexpected control chars instead of '?'.

Modification:

Change how each character is sanitized in HpackHuffmanEncoder. Specifically, use the new approach [1] to replace the old approach [2].

[1] AsciiString.c2b(aChar) & 0xFF
[2] aChar & 0xFF

Expected output is 0 if aChar > 0xFF. But with the old approach, if aChar == 0x4E01, 0x4E01 & 0xFF == 1 which is incorrect.

Result:
All characters with code point values higher than 0xFF are decoded to ?s regardless of whether Huffman encoding was used during encoding.

Fixes #13540

codec-http2/src/main/java/io/netty/handler/codec/http2/HpackHuffmanEncoder.java

codec-http2/src/main/java/io/netty/handler/codec/http2/HpackEncoder.java

codec-http2/src/test/java/io/netty/handler/codec/http2/HpackEncoderTest.java

bryce-anderson · 2023-08-15T18:14:24Z

Pr looks good to me although CodeQL is complaining, all but certainly about some extra whitespace. 😄

Can we tighten up the commit message a touch: I think the core problem was that headers that ended up Huffman encoded were sanitized differently, specifically chars with values higher than 0xFF which could result in unexpected control chars instead of the '?'. In both cases the headers are corrupted: one is just safer than the other.

As an aside and as your test demonstrates, we can still emit control chars we just have to be explicit about them.

bryce-anderson

Thank you @Lincong!

Lincong · 2023-08-16T17:55:15Z

Thanks @bryce-anderson for the suggestion to improve the commit message. I have updated it and PTAL before we merge this PR!

Thanks @normanmaurer for fixing style violation (here). I have not completely set up my dev environment yet so that some style violation cannot be caught locally. I will make sure I am able to catch and fix such issues in my future PRs.

Motivation: To fix the following problem: during encoding, Huffman encoded headers are sanitized differently compared to non-Huffman encoded headers in `HpackEncoder`. As a result, characters with code point values higher than 0xFF which could be decoded to an unexpected control chars instead of `'?'`. Modification: Change how each character is sanitized in `HpackHuffmanEncoder`. Specifically, use the new approach [1] to replace the old approach [2]. [1] `AsciiString.c2b(aChar) & 0xFF` [2] `aChar & 0xFF` Expected output is `0` if `aChar > 0xFF`. But with the old approach, if `aChar == 0x4E01`, `0x4E01 & 0xFF == 1` which is incorrect. Result: All characters with code point values higher than 0xFF are decoded to `?`s regardless of whether Huffman encoding was used during encoding. Fixes #13540 --------- Co-authored-by: Norman Maurer <norman_maurer@apple.com>

Lincong · 2023-08-17T16:20:55Z

Thanks @normanmaurer for merging this PR!

Do you know an ETA for 4.1.97.Final release (including this change)?

normanmaurer · 2023-08-17T16:27:37Z

@Lincong I think sometime next week

Lincong · 2023-08-17T22:35:45Z

@Lincong I think sometime next week

I am not sure if this is something appropriate to ask for, but it will be super nice if 4.1.97.Final release can land by the end of next Wed (8/23/2023). Thanks! @normanmaurer

Lincong added 2 commits August 12, 2023 23:23

wip 1

3de634e

wip 2

853780d

Lincong commented Aug 13, 2023

View reviewed changes

codec-http2/src/main/java/io/netty/handler/codec/http2/HpackHuffmanEncoder.java Outdated Show resolved Hide resolved

wip 3

5a1e6d2

vkostyukov approved these changes Aug 14, 2023

View reviewed changes

codec-http2/src/main/java/io/netty/handler/codec/http2/HpackEncoder.java Outdated Show resolved Hide resolved

bryce-anderson reviewed Aug 14, 2023

View reviewed changes

codec-http2/src/test/java/io/netty/handler/codec/http2/HpackEncoderTest.java Show resolved Hide resolved

wip 4

1c6e091

Lincong requested a review from bryce-anderson August 15, 2023 05:11

Fix check style

5bd128d

bryce-anderson approved these changes Aug 16, 2023

View reviewed changes

Lincong requested a review from bryce-anderson August 16, 2023 17:56

normanmaurer added this to the 4.1.97.Final milestone Aug 17, 2023

normanmaurer merged commit aa07be4 into netty:4.1 Aug 17, 2023

Lincong deleted the hpack_huffman_encoder_sanitization branch August 17, 2023 04:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sanitize all multibyte chars in HpackHuffmanEncoder#13546

Sanitize all multibyte chars in HpackHuffmanEncoder#13546
normanmaurer merged 5 commits intonetty:4.1from
Lincong:hpack_huffman_encoder_sanitization

Lincong commented Aug 13, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bryce-anderson commented Aug 15, 2023

Uh oh!

bryce-anderson left a comment

Uh oh!

Lincong commented Aug 16, 2023 •

edited

Loading

Uh oh!

Lincong commented Aug 17, 2023

Uh oh!

normanmaurer commented Aug 17, 2023

Uh oh!

Lincong commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Lincong commented Aug 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bryce-anderson commented Aug 15, 2023

Uh oh!

bryce-anderson left a comment

Choose a reason for hiding this comment

Uh oh!

Lincong commented Aug 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lincong commented Aug 17, 2023

Uh oh!

normanmaurer commented Aug 17, 2023

Uh oh!

Lincong commented Aug 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Lincong commented Aug 13, 2023 •

edited

Loading

Lincong commented Aug 16, 2023 •

edited

Loading