fix: fix the bytes encode/decode for redis cache by wangxinalex · Pull Request #153 · ContentSquare/chproxy

wangxinalex · 2022-03-29T13:50:37Z

Using string(data) to convert the byte array to string introduces error in json marshal/unmarshal,
hence causes error when returning cached response from redis.

The reason is Unmarshal function in
encode/json would replace invalid UTF-8 or invalid UTF-16 pairs with U+FFFD, therefore the
payload string in redisCachePayload will actually change after json marshal/unmarshal since the
byte array may contain invalid UTF-8/UTF-16 byte, the length of payload will thereby change,
resulting the http server to find the declared length in header Content-Length mismatches the
actual length of payload.

The fix is to base64-encode/decode the byte array to string, thereby
eliminates invalid UTF-8/UTF-16 bytes.

wangxinalex · 2022-03-30T02:09:44Z

A example to reproduce the json marshal/unmarshal bug.
https://gist.github.com/wangxinalex/8924e649192bd39527313cf49b4125a5

gontarzpawel · 2022-03-30T11:53:45Z

Hello @wangxinalex! Thank you for contribution.

I'd like to understand couple of aspects:

what was the payload returned from Clickhouse that made you find out the invalid utf-8/16 bytes? would it be possible to provide a failing test fixed by your change?
it seems that json encoder can be configured to not replace invalid bytes: SetEscapeHTML(false).

wangxinalex · 2022-03-31T03:29:29Z

Dear Gontarz: Thank you for replying. 1. Actually I found out this issue when I try to connect the chproxy with DataGrip. The request is quite simple `SELECT 1 FORMAT TabSeparatedWithNamesAndTypes`. The actual length of the response payload is 90, but the declared length is 62. And that causes the http write to report an error. 1. When the result is first put into the redis cache, the payload is `bb012032a452485b03ae25a2d507665582120000000800000080310a55496e74380ade79cf087fb635049db816df195b016b820c0000000200000020310a`, the actual length and declared length are both 62. 2. However, when the same result is retrieved from cache and unmarshaled, the payload becomes `efbfbd012032efbfbd52485b03efbfbd25efbfbdefbfbd076655efbfbd1200000008000000efbfbd310a55496e74380aefbfbd79efbfbd087fefbfbd3504efbfbdefbfbd16efbfbd195b016befbfbd0c0000000200000020310a`, the actual length becomes 90, thus the actual length and declared length differ, which causes the http write error. 3. More detailed debug code can be found here. https://github.com/wangxinalex/chproxy/blob/1f0a5e7a94ae8c2351937188e1b0c94d140847f8/cache/redis_cache.go#L149 2. In my opinion, the root cause of this issue is `string(bytes)` is not the canonical way to encode byte array. Especially when the payload is to be marshaled/unmarshaled. The root problem may be described in the comment of `encoding/json/decode.go:96`. // When unmarshaling quoted strings, invalid UTF-8 or // invalid UTF-16 surrogate pairs are not treated as an error. // Instead, they are replaced by the Unicode replacement // character U+FFFD. The original byte array is just an arbitrary byte stream, and `Unmarshal` function takes it with UTF-8/UTF-16 charset and replaces some bytes silently. Thus why the length of original bytes and unmarshaled bytes are different. As you can see, the frequent `efbfbd` in the retrieved result is actually `U+FFFD`. The `SetEscapeHtml` cannot solve this behavior. As shown in https://gist.github.com/wangxinalex/885f3d53047bae62c8a454de620c9717. Therefore I use base64 to encode/decode the byte array and avoid the UTF-8/UTF-16 problem. 在 2022年3月30日 +0800 19:54，Paweł Gontarz ***@***.***>，写道： Hello @wangxinalex! Thank you for contribution. I'd like to understand couple of aspects: • what was the payload returned from Clickhouse that made you find out the invalid utf-8/16 bytes? would it be possible to provide a failing test fixed by your change? • it seems that json encoder can be configured to not replace invalid bytes: SetEscapeHTML(false). — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: ***@***.***>

cache/redis_cache.go

gontarzpawel

Thank you for your clear explanation!

I've launched tests locally to verify your change does not break anything and there's one test failing.

make test

truncated output

....
--- FAIL: TestServe (0.47s)
    --- FAIL: TestServe/http_requests_with_caching_in_redis_ (0.01s)
        main_test.go:369: result from cache query is wrong: {"l":4,"t":"text/plain; charset=utf-8","enc":"","payload":"T2suCg=="}

Could you adapt it to your change please?
It'd be also beneficial if we could have the failing example that you provided, added as unit test. Could you also do that?

FYI we're working on adding CI step to verify tests.

EDIT:
rebased your PR on master please to have CI (github actions) enabled

wangxinalex · 2022-03-31T15:45:49Z

Dear @Garnek20, the failing case is adapted and a new test case is added for the changed behavior.

Using `string(data)` to convert the byte array to string introduces error in json marshal/unmarshal, hence causes error when returning cached response from redis. The reason is `Unmarshal` function in `encode/json` would replace invalid UTF-8 or invalid UTF-16 pairs with `U+FFFD`, therefore the `payload` string in `redisCachePayload` will actually change after json marshal/unmarshal since the byte array may contain invalid UTF-8/UTF-16 byte, the length of payload will thereby change, resulting the http server to find the declared length in header `Content-Length` mismatches the actual length of payload. The fix is to base64-encode/decode the byte array to string, thereby eliminates invalid UTF-8/UTF-16 bytes.

add test cases for base64 encode/decode the cached value

wangxinalex · 2022-04-01T01:34:58Z

@Garnek20 It seems that --- FAIL: TestReverseProxy_ServeHTTP1/queue_overflow_for_user (0.12s)
case failed in ci workflow, but this case passes on my local machine, do you have any ideas?

wangxinalex · 2022-04-01T02:37:52Z

@Garnek20 My best guess is

chproxy/proxy_test.go

Line 254 in 5b23001

time.Sleep(time.Millisecond * 5)

the cpu of ci runner makes the time.Sleep(time.Millisecond * 5) to sleep longer than it should be, so as to suppress the queue_overflow_error. So my suggestion is maybe should minimize the sleep time and try again.

…s ci minimize the waiting time between two consecutive requests

gontarzpawel

Thank you for adding the test! One last comment and we will be ready to merge it 🙂

cache/redis_cache.go

gontarzpawel

Thank you @wangxinalex for you contribution!

pull-request-size bot added the size/S label Mar 29, 2022

gontarzpawel reviewed Mar 31, 2022

View reviewed changes

cache/redis_cache.go Outdated Show resolved Hide resolved

gontarzpawel requested changes Mar 31, 2022

View reviewed changes

pull-request-size bot added size/M and removed size/S labels Mar 31, 2022

wangxinalex added 2 commits March 31, 2022 23:55

fix: add test case about encode/decode the cached value

648df1b

add test cases for base64 encode/decode the cached value

wangxinalex force-pushed the fix/redis-cache-bytes-encode branch from 8226e08 to 648df1b Compare March 31, 2022 15:58

wangxinalex requested a review from gontarzpawel March 31, 2022 16:00

fix: adjust the waiting time of queue_overflow_for_user case to pas…

43ac559

…s ci minimize the waiting time between two consecutive requests

gontarzpawel reviewed Apr 2, 2022

View reviewed changes

cache/redis_cache.go Outdated Show resolved Hide resolved

fix: make the cache.redisCachePayload private again

4219d9e

wangxinalex requested a review from gontarzpawel April 3, 2022 06:56

gontarzpawel approved these changes Apr 3, 2022

View reviewed changes

gontarzpawel merged commit 6cfac12 into ContentSquare:master Apr 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fix the bytes encode/decode for redis cache#153

fix: fix the bytes encode/decode for redis cache#153
gontarzpawel merged 4 commits intoContentSquare:masterfrom
wangxinalex:fix/redis-cache-bytes-encode

wangxinalex commented Mar 29, 2022

Uh oh!

wangxinalex commented Mar 30, 2022

Uh oh!

gontarzpawel commented Mar 30, 2022

Uh oh!

wangxinalex commented Mar 31, 2022 via email

Uh oh!

Uh oh!

gontarzpawel left a comment •

edited

Loading

Uh oh!

wangxinalex commented Mar 31, 2022

Uh oh!

wangxinalex commented Apr 1, 2022

Uh oh!

wangxinalex commented Apr 1, 2022

Uh oh!

gontarzpawel left a comment

Uh oh!

Uh oh!

gontarzpawel left a comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Conversation

wangxinalex commented Mar 29, 2022

Uh oh!

wangxinalex commented Mar 30, 2022

Uh oh!

gontarzpawel commented Mar 30, 2022

Uh oh!

wangxinalex commented Mar 31, 2022 via email

Uh oh!

Uh oh!

gontarzpawel left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wangxinalex commented Mar 31, 2022

Uh oh!

wangxinalex commented Apr 1, 2022

Uh oh!

wangxinalex commented Apr 1, 2022

Uh oh!

gontarzpawel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gontarzpawel left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

gontarzpawel left a comment •

edited

Loading