ARROW-11048: [Rust] Added bench to MutableBuffer #9032

jorgecarleitao · 2020-12-28T17:20:58Z

This bench compares the behavior of MutableBuffer vs allocating Vec<u8> and using extend_from_slice.

On my computer:

mutable                 time:   [579.24 us 580.21 us 581.34 us]                    
mutable prepared        time:   [614.98 us 616.15 us 617.42 us]                             
from_slice              time:   [1.2945 ms 1.3262 ms 1.3607 ms]                        
from_slice prepared     time:   [1.0161 ms 1.0472 ms 1.0881 ms]

I.e. growing a MutableBuffer seems to be 2x faster than creating a Vec and using Buffer::from to convert it to a Buffer.

It is odd that creating a buffer with the correct capacity takes longer, though. Any ideas @jhorstmann , @Dandandan ?

github-actions · 2020-12-28T18:04:28Z

https://issues.apache.org/jira/browse/ARROW-11048

Dandandan · 2020-12-28T19:44:05Z

I am not sure what is causing the difference, I would expect it to be faster just as from_slice prepared is faster.
The main difference is that initializing with capacity 0 uses memory::allocate_aligned while the other uses a few calls to memory::reallocate. I guess the first is (quite) a bit slower somehow?

jhorstmann · 2020-12-28T19:56:11Z

I think Buffer::from currently does another copy of the vec contents because it does not take ownership and because of the current alignment and padding requirements. We probably could add a zero-copy Buffer::from_vec method if we loosen those restrictions. Another interesting benchmark would be another version of from_slice that returns a Vec instead of Buffer, to see whether the extend/reallocate logic itself could be optimized.

jorgecarleitao · 2020-12-29T11:38:25Z

just FYI, I found one reason. alloc_zeroed is expensive. I am preparing a PR where this is addressed. There is some code atm that relies on MutableBuffer::reserve and MutableBuffer::new to allocate zeros (and not undefined), which I need to handle.

Dandandan

LGTM, nice to be able to measure this / remove inefficiencies 👍

alamb · 2020-12-31T13:18:36Z

The full set of Rust CI tests did not run on this PR :(

Can you please rebase this PR against apache/master to pick up the changes in #9056 so that they do?

I apologize for the inconvenience.

codecov-io · 2021-01-01T08:06:27Z

Codecov Report

Merging #9032 (7a0fa18) into master (51672b2) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #9032   +/-   ##
=======================================
  Coverage   82.61%   82.61%           
=======================================
  Files         202      202           
  Lines       50048    50048           
=======================================
  Hits        41347    41347           
  Misses       8701     8701

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 51672b2...7a0fa18. Read the comment docs.

alamb · 2021-01-01T11:30:30Z

I believe these benches were already added as part of #8997 (which I found out while trying to merge this PR -- LOL)

jorgecarleitao · 2021-01-02T09:00:34Z

Closing as this code was merged as part of another PR

github-actions bot added the Component: Rust label Dec 28, 2020

Dandandan mentioned this pull request Dec 29, 2020

ARROW-11053: [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches #9036

Closed

This was referenced Dec 30, 2020

ARROW-11045: [Rust] Fix performance issues of allocator #9044

Closed

ARROW-11037: [Rust] Optimized creation of string array from iterator. #9016

Closed

Dandandan approved these changes Dec 31, 2020

View reviewed changes

Added new bench.

7a0fa18

jorgecarleitao closed this Jan 2, 2021

jorgecarleitao deleted the bench_alloc branch January 2, 2021 09:00

asfimport mentioned this pull request Jan 2, 2021

[Rust] Add bench to MutableBuffer #26964

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-11048: [Rust] Added bench to MutableBuffer #9032

ARROW-11048: [Rust] Added bench to MutableBuffer #9032

jorgecarleitao commented Dec 28, 2020

github-actions bot commented Dec 28, 2020

Dandandan commented Dec 28, 2020

jhorstmann commented Dec 28, 2020

jorgecarleitao commented Dec 29, 2020 •

edited

Loading

Dandandan left a comment

alamb commented Dec 31, 2020

codecov-io commented Jan 1, 2021

alamb commented Jan 1, 2021

jorgecarleitao commented Jan 2, 2021

ARROW-11048: [Rust] Added bench to MutableBuffer #9032

ARROW-11048: [Rust] Added bench to MutableBuffer #9032

Conversation

jorgecarleitao commented Dec 28, 2020

github-actions bot commented Dec 28, 2020

Dandandan commented Dec 28, 2020

jhorstmann commented Dec 28, 2020

jorgecarleitao commented Dec 29, 2020 • edited Loading

Dandandan left a comment

Choose a reason for hiding this comment

alamb commented Dec 31, 2020

codecov-io commented Jan 1, 2021

Codecov Report

alamb commented Jan 1, 2021

jorgecarleitao commented Jan 2, 2021

jorgecarleitao commented Dec 29, 2020 •

edited

Loading