buffer: raw_combined allocations buffer and ref count together #7612

liewegas · 2016-02-11T17:50:38Z

Before:

[----------] 1 test from Buffer
[ RUN ] Buffer.BenchAlloc
1000000 alloc of size 16384 in 0.505902
1000000 alloc of size 4096 in 0.268716
1000000 alloc of size 1024 in 0.198454
1000000 alloc of size 256 in 0.127251
1000000 alloc of size 32 in 0.100286
1000000 alloc of size 4 in 0.099957
[ OK ] Buffer.BenchAlloc (1301 ms)
[----------] 1 test from Buffer (1301 ms total)

[----------] 1 test from BufferList
[ RUN ] BufferList.BenchAlloc
100000 alloc of size 32768 in 0.335541
100000 alloc of size 25000 in 0.328742
100000 alloc of size 16384 in 0.323622
100000 alloc of size 10000 in 0.302479
100000 alloc of size 8192 in 0.302486
100000 alloc of size 6000 in 0.304038
100000 alloc of size 4096 in 0.304013
100000 alloc of size 1024 in 0.295486
100000 alloc of size 256 in 0.199804
100000 alloc of size 32 in 0.163652
100000 alloc of size 4 in 0.165094
[ OK ] BufferList.BenchAlloc (3025 ms)
[----------] 1 test from BufferList (3025 ms total)

After:

[----------] 1 test from Buffer
[ RUN ] Buffer.BenchAlloc
1000000 alloc of size 16384 in 0.677293
1000000 alloc of size 4096 in 0.232403
1000000 alloc of size 1024 in 0.114915
1000000 alloc of size 256 in 0.108958
1000000 alloc of size 32 in 0.091280
1000000 alloc of size 4 in 0.078020
[ OK ] Buffer.BenchAlloc (1303 ms)
[----------] 1 test from Buffer (1303 ms total)

[----------] 1 test from BufferList
[ RUN ] BufferList.BenchAlloc
100000 alloc of size 32768 in 0.335772
100000 alloc of size 25000 in 0.331753
100000 alloc of size 16384 in 0.328051
100000 alloc of size 10000 in 0.309004
100000 alloc of size 8192 in 0.309229
100000 alloc of size 6000 in 0.233140
100000 alloc of size 4096 in 0.204561
100000 alloc of size 1024 in 0.205416
100000 alloc of size 256 in 0.149929
100000 alloc of size 32 in 0.164062
100000 alloc of size 4 in 0.125882
[ OK ] BufferList.BenchAlloc (2697 ms)
[----------] 1 test from BufferList (2697 ms total)

I can't figure out why the buffer::create(16384) calls are slower in
the Buffer.BenchAlloc. It goes from being a straight new raw_char(...)
to the new call, and if I switch it back on the new branch it is still
slower than the original branch... maybe the way the code was generated
vs instructino prefetch or something? Very confusing.

Anyway, aside from that regression, everything else is the same or
faster (in the 10-20% range). More so for small allocations.

cbodley · 2016-02-11T19:37:40Z

src/common/buffer.cc

+      if (!align)
+	align = sizeof(size_t);
+      size_t rawlen = ROUND_UP_TO(sizeof(buffer::raw_combined), sizeof(size_t));
+      size_t datalen = ROUND_UP_TO(len, sizeof(size_t));


does rawlen need to be padded?

for datalen, you might consider alignof(raw_combined) as a more explicit alternative to sizeof(size_t)

branch-predictor · 2016-02-15T07:40:05Z

This is how rados bench 120 write -t 128 went with and without patch, vertical axis is speed in MB/s, horizontal - time in seconds. Cluster was configured to use simple messenger.
This patch improved performance on small IO, but the difference is smaller due to more disk flushing occurring more frequently. For large I/O, the difference is close to none. I assume that with bluestore the performance difference will be even greater than that.
(Complete data here: http://ceph.predictor.org.pl/bufferlist_patch.xlsx, raw data at http://ceph.predictor.org.pl/chunktest_org.tar.gz and http://ceph.predictor.org.pl/chunktest_patch.tar.gz)

cbodley · 2016-02-26T16:24:02Z

looks good to me 👍

liewegas · 2016-02-27T20:49:49Z

needs rebase

Do not assume there is a trailing null the terminate the string. Signed-off-by: Sage Weil <sage@redhat.com>

- fix source - include larger sizes Signed-off-by: Sage Weil <sage@redhat.com>

Signed-off-by: Sage Weil <sage@redhat.com>

These eliminate most callers of buffers(), which exposes the internal list<ptr>. Signed-off-by: Sage Weil <sage@redhat.com>

Signed-off-by: Sage Weil <sage@redhat.com>

This will let us put policy create_aligned. Signed-off-by: Sage Weil <sage@redhat.com>

If the alignment is on a page boundary, or the allocation is big, a separate buffer::raw goes faster. The rest of the time, a raw_combined does. Signed-off-by: Sage Weil <sage@redhat.com>

This may as well fit the input; this doesn't relate to the append buffer. Signed-off-by: Sage Weil <sage@redhat.com>

Signed-off-by: Sage Weil <sage@redhat.com>

…ions We drop some unittest assertions about alloc buffer size. Sorry! Signed-off-by: Sage Weil <sage@redhat.com>

Signed-off-by: Sage Weil <sage@redhat.com>

buffer: raw_combined allocations buffer and ref count together Reviewed-by: Casey Bodley <cbodley@redhat.com>

liewegas added common performance labels Feb 11, 2016

cbodley reviewed Feb 11, 2016
View reviewed changes

liewegas added needs-qa wip-sage-testing and removed wip-sage-testing labels Feb 26, 2016

liewegas added 12 commits March 1, 2016 08:47

unittest_bufferlist: fix ptr move test

7723d29

Do not assume there is a trailing null the terminate the string. Signed-off-by: Sage Weil <sage@redhat.com>

unittest_bufferlist: fix append_bench

08c0d98

- fix source - include larger sizes Signed-off-by: Sage Weil <sage@redhat.com>

unittest_bufferlist: benchmark some allocations

69bcbe1

Signed-off-by: Sage Weil <sage@redhat.com>

buffer: add front(), back(), get_num_buffers() methods

04482ae

These eliminate most callers of buffers(), which exposes the internal list<ptr>. Signed-off-by: Sage Weil <sage@redhat.com>

buffer: combine data and buffer::raw into single allocation

724a493

Signed-off-by: Sage Weil <sage@redhat.com>

buffer: align unspecified allocations to a word

6be3b99

This will let us put policy create_aligned. Signed-off-by: Sage Weil <sage@redhat.com>

buffer: use raw_combined for certain allocations

73dcd26

If the alignment is on a page boundary, or the allocation is big, a separate buffer::raw goes faster. The rest of the time, a raw_combined does. Signed-off-by: Sage Weil <sage@redhat.com>

buffer: alloc right-sized buffer from read_fd

ce3e5a3

This may as well fit the input; this doesn't relate to the append buffer. Signed-off-by: Sage Weil <sage@redhat.com>

rbd-replay: s/CEPH_BUFFER_APPEND_SIZE/CEPH_PAGE_SIZE/

f2c0d5c

Signed-off-by: Sage Weil <sage@redhat.com>

buffer: size append_buffer so that it fits into page-multiple allocat…

b6ed4d3

…ions We drop some unittest assertions about alloc buffer size. Sorry! Signed-off-by: Sage Weil <sage@redhat.com>

buffer: clean up raw_combined construction

ef80690

Signed-off-by: Sage Weil <sage@redhat.com>

buffer: use alignof for raw_combined allocation arithmetic

aa2b891

Signed-off-by: Sage Weil <sage@redhat.com>

liewegas force-pushed the wip-buffer-combined branch from 58621dc to aa2b891 Compare March 1, 2016 14:14

liewegas added the wip-sage-testing label Mar 1, 2016

liewegas added this to the jewel milestone Mar 1, 2016

liewegas added a commit that referenced this pull request Mar 2, 2016

Merge pull request #7612 from liewegas/wip-buffer-combined

67696f0

buffer: raw_combined allocations buffer and ref count together Reviewed-by: Casey Bodley <cbodley@redhat.com>

liewegas merged commit 67696f0 into ceph:master Mar 2, 2016

liewegas deleted the wip-buffer-combined branch March 2, 2016 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

buffer: raw_combined allocations buffer and ref count together #7612

buffer: raw_combined allocations buffer and ref count together #7612

liewegas commented Feb 11, 2016

cbodley Feb 11, 2016

branch-predictor commented Feb 15, 2016

cbodley commented Feb 26, 2016

liewegas commented Feb 27, 2016

Navigation Menu

buffer: raw_combined allocations buffer and ref count together #7612

buffer: raw_combined allocations buffer and ref count together #7612

Conversation

liewegas commented Feb 11, 2016

cbodley Feb 11, 2016

Choose a reason for hiding this comment

branch-predictor commented Feb 15, 2016

cbodley commented Feb 26, 2016

liewegas commented Feb 27, 2016