[Feature] better work with large query responses #385

kasimtj · 2023-12-25T14:20:57Z

Is your feature request related to a problem? Please describe.
We had a problem with memory consumption of chproxy and found 2 problems.

Reading large payload from Redis creates 2MiB buffers in cycle and garbage collector unable to free memory properly
If proxy request failes or gets canceled chproxy tries to extract error reason and large tmp file with partial response will be read into memory

Describe the solution you'd like

We solved first problem with max_payload_size, we just stopped caching large responses. But it would be good to reuse buffers instead of creating new ones, altho it seems like limitations of go-redis library
Setting wait_end_of_query=1 in Clickhouse may solve our problem, but it will not save us from reading large tmp files entirely and increases latency of proxy requests. Possible solutions are:

not reading tmp file if response code is 200 (no partial response will be read)
having a limit to error reason same as max_payload_size

Describe alternatives you've considered

Additional context

kasimtj · 2023-12-25T14:34:06Z

Example for second problem solve #386

mga-chka · 2024-01-03T15:30:12Z

Hi, it took a quick look at the code.
Indeed, the code will fetch data from redis by bocks of 2MBytes for large queries. Since Redis doesn't provide an API to reuse a buffer, the following call creates a new 2Mbytes buffer at each cycle.

That being said, it should be garbage collected and I don't see where in the code a memory leak could be. Just to be sure, did you check during your test that the garbage collector was triggered?

nb: I'll take a look at your fix of pb 2 this week.

kasimtj · 2024-01-08T03:00:53Z

Hi, thank you for a response, fixing these bugs is really important for our team now

With redis we think the main problem is that garbage collector is not triggered immediately so heap grows much faster than it's been cleaned. We will run additional tests on it, maybe profiling showed total memory used and not peak memory usage

If your team is ok with the general idea of fix to the second problem I can send proper PR with tests and etc.

mga-chka · 2024-01-08T15:14:09Z

Yes we're ok with your second fix.
If you do the changes I asked in your PR, we'll merge the query.

mga-chka · 2024-01-16T09:42:28Z

I'm closing this issue since you have a fix for 1 and your fix for 2 will soon be merged.
Feel free to repeon it if needed

Blokje5 · 2024-01-16T11:21:35Z

@kasimtj 1.26.0 is released and includes your fix.

kasimtj · 2024-01-17T06:53:48Z

@kasimtj 1.26.0 is released and includes your fix.

Thanks a lot for a quick response!

mga-chka closed this as completed Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] better work with large query responses #385

[Feature] better work with large query responses #385

kasimtj commented Dec 25, 2023

kasimtj commented Dec 25, 2023

mga-chka commented Jan 3, 2024

kasimtj commented Jan 8, 2024

mga-chka commented Jan 8, 2024

mga-chka commented Jan 16, 2024

Blokje5 commented Jan 16, 2024

kasimtj commented Jan 17, 2024

[Feature] better work with large query responses #385

[Feature] better work with large query responses #385

Comments

kasimtj commented Dec 25, 2023

kasimtj commented Dec 25, 2023

mga-chka commented Jan 3, 2024

kasimtj commented Jan 8, 2024

mga-chka commented Jan 8, 2024

mga-chka commented Jan 16, 2024

Blokje5 commented Jan 16, 2024

kasimtj commented Jan 17, 2024