Performance improvement on nonblocking sockets #838

misalcedo · 2023-11-22T03:04:48Z

Closes #836

The goal of this change is to improve the performance of excon while maintaining a low number of allocations.

This change is similar in nature to what Net::HTTP does in https://github.com/ruby/ruby/blob/v3_2_2/lib/net/protocol.rb#L166 .

The following is a zipped archive of 3 ruby-prof profiles measuring allocations. From the profiles, it is evident the number of allocations in OpenSSL calls and Socket calls remains very similar (still significantly less than v0.71.0).

Excon.zip

Benchmark

Run on a 32-core Codespace with nothing else running. When run on a 4-core Intel MacBook Pro, Excon is fast enough to not be a bottleneck (the local docker container was my bottleneck).

Modified:      381.1 i/s
v0.104.0:      258.3 i/s
v0.71.0:       364.4 i/s

geemus · 2023-12-05T16:53:19Z

@misalcedo Thanks, I do hope to review this soon, but have been a bit behind with holiday travel and catching up on other things. So thanks for your patience also in the interim.

misalcedo · 2023-12-05T20:10:48Z

No worries, take as much time as you need.

lgo · 2023-12-10T04:23:01Z

@misalcedo, this is absolutely fantastic - thank you! I'll also try to take a read through and see what I can do to test this internally at Stripe, but I'm not sure I'll have time until closer to the holidays. Embarrassingly, we still have not bumped versions after having internal breakages from #796 (🤞 resolved by v0.95.0).

I recently made some observations that the current code, while much lower for allocs, did negatively impact total allocated memory. The header string would be re-allocated as each line was processed, getting smaller and smaller. (My naive understanding is that the former is much worse than the latter)

From a quick skim of your changes, I believe you may have also addressed that but I'll do a more thorough read later.

…of gems.

misalcedo · 2023-12-12T20:48:32Z

The more eyes on it the better. I haven't tried to do any memory profiling, but that should be fairly simple since I setup the benchmark with Ruby-prof. As long as you are not using Ruby 3 that is. Ruby-prof segfaults for me whenever I try to do memory profiling on Ruby 3.2.2. I think this is because the garbage collector is different, but not entirely sure.

I opened ruby-prof/ruby-prof#326 with ruby-prof

lgo · 2023-12-12T22:37:27Z

Yep, I've faced similar segfault issues. When I last measured this, I used https://github.com/Shopify/memory_profiler. You can wrap a block with the profiler and get pretty granular information on (1) alloc count, (2) source of allocs, (3) size and specific strings allocated. This is how we've spotted both the single-char alloc issues and the large line-by-line allocs.

When I get to testing this internally, I'll also spin up the memory profiler and check. It's been almost a year since I last walked through profiling Excon but I have a bunch of notes + hacked up PRs from the last time that I'll look back on.

geemus

Apologies for the delays, I wanted to make sure I could find a time when I was fresh and focused to give this a thorough look. Overall it seems great, thanks again for your work and help. I had a couple minor suggestions about naming/cleanup that I'd like to discuss, but other than that I think it's very nearly there. Thanks!

lib/excon/socket.rb

…buffer.length

geemus

Thanks, I think the readable_bytes change does help a lot in terms of quickly and easily understanding.

geemus · 2023-12-15T15:04:06Z

@misalcedo thanks again for the work you did here and your collaboration in polishing it.

@lgo I'll be curious to hear what you find, and would welcome any notes/code you have that might help us profile in the future (it's not something I have much experience with either). Also, just let me know if there is anything you need so that you all can get bumped to a newer version.

lgo · 2023-12-15T15:43:16Z

I'll be curious to hear what you find, and would welcome any notes/code you have that might help us profile in the future (it's not something I have much experience with either).

Absolutely, I'd be more than happy to share notes on the whole process and see what I can do to provide a simple/reproducible suite (maybe just tweaking @misalcedo's).

@misalcedo - thanks again for these awesome changes as well as the benchmarks to keep this a standard!

misalcedo · 2023-12-15T16:02:36Z

Contributing was actually pretty fun for me, so glad I could help. I played around with the memory profiler but struggled to process the vast amount of information it provided. Now I have something new to learn this weekend.

misalcedo added 2 commits November 21, 2023 19:06

Re-use the read_buffer when empty.

7162c91

Improve system call performance while still passing all tests.

2331a1b

misalcedo mentioned this pull request Nov 22, 2023

Performance regression since version 0.71 #836

Closed

misalcedo added 3 commits November 22, 2023 08:47

Add HTTPBin benchmark

b3a7203

Document how to run against a local endpoint

d9cc5a9

Add more comments and rename things a bit.

cfbdb0f

misalcedo marked this pull request as ready for review November 22, 2023 14:34

Update httpbin.rb to simplify the generation of the data

833f3e9

misalcedo added 2 commits December 12, 2023 20:12

Define a source to enable the inline script to run with it's own set …

012bbb1

…of gems.

Use version 1.5 of ruby-prof as 1.6.3 segfaults on ruby 3.2.2

0014ef9

geemus requested changes Dec 13, 2023

View reviewed changes

lib/excon/socket.rb Outdated Show resolved Hide resolved

lib/excon/socket.rb Show resolved Hide resolved

Rename buffer_length to readable_bytes to avoid confusion with @read_…

af5ebcd

…buffer.length

misalcedo requested a review from geemus December 15, 2023 13:33

geemus approved these changes Dec 15, 2023

View reviewed changes

geemus merged commit f95758f into excon:master Dec 15, 2023
6 checks passed

misalcedo deleted the misalcedo/reuse branch December 15, 2023 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance improvement on nonblocking sockets #838

Performance improvement on nonblocking sockets #838

misalcedo commented Nov 22, 2023 •

edited

geemus commented Dec 5, 2023

misalcedo commented Dec 5, 2023

lgo commented Dec 10, 2023

misalcedo commented Dec 12, 2023 •

edited

lgo commented Dec 12, 2023 •

edited

geemus left a comment

geemus left a comment

geemus commented Dec 15, 2023

lgo commented Dec 15, 2023 •

edited

misalcedo commented Dec 15, 2023

Performance improvement on nonblocking sockets #838

Performance improvement on nonblocking sockets #838

Conversation

misalcedo commented Nov 22, 2023 • edited

Benchmark

geemus commented Dec 5, 2023

misalcedo commented Dec 5, 2023

lgo commented Dec 10, 2023

misalcedo commented Dec 12, 2023 • edited

lgo commented Dec 12, 2023 • edited

geemus left a comment

Choose a reason for hiding this comment

geemus left a comment

Choose a reason for hiding this comment

geemus commented Dec 15, 2023

lgo commented Dec 15, 2023 • edited

misalcedo commented Dec 15, 2023

misalcedo commented Nov 22, 2023 •

edited

misalcedo commented Dec 12, 2023 •

edited

lgo commented Dec 12, 2023 •

edited

lgo commented Dec 15, 2023 •

edited