Restore streamInput() performance over PagedBytesReference. #5589

hhoffstaette · 2014-03-28T08:04:16Z

The initial implementation of bulk-reading a streamInput() over PagedBytesReference was slow (byte-by-byte reading when bulk copying).

Times in µs for bulk-reading a stream over plain vs. paged, averaged over 1000 runs:

MB	plain µs	paged µs	Ratio
1	72	2048	28.6
2	140	4127	29.4
3	218	6208	28.4
4	430	8396	19.5
5	700	10525	15.0
10	1739	21055	12.1
20	3481	42063	12.1
50	8701	105337	12.1
100	17409	210848	12.1

This changeset restores performance:

MB	plain µs	paged µs	Ratio
1	65	68	1.05
2	134	141	1.04
3	198	235	1.18
4	456	418	0.91
5	750	761	1.01
10	1736	1743	1.00
20	3514	3497	0.99
50	8706	8700	0.99
100	17608	17731	1.00

The performance jitters slightly due to the usual Hotspot variances, OS scheduling etc. For all practical purposes the performance is now back to what it was before.

s1monw · 2014-03-28T08:20:04Z

src/main/java/org/elasticsearch/common/bytes/PagedBytesReference.java

            while (written < len) {
-                b[bOffset + written] = bytearray.get(offset + written);
-                written++;
+                // how much can we bulk-read until hitting a multiple of PAGE_SIZE?


could you put the //comments behind the actual line - this would be much easier to read and we have 120 chars at least :)

Closes #5589

hhoffstaette · 2014-03-28T09:00:13Z

Made args final, reformatted a bit, added PR.

s1monw · 2014-03-28T09:06:16Z

src/main/java/org/elasticsearch/common/bytes/PagedBytesReference.java

+                long pagefragment = PAGE_SIZE - (bytearrayOffset % PAGE_SIZE); // how much can we read until hitting N*PAGE_SIZE?
+                int bulksize = (int)Math.min(pagefragment, todo - written); // we cannot copy more than a page fragment
+                boolean copied = bytearray.get(bytearrayOffset, bulksize, ref); // get the fragment
+                assert (copied == false); // we should never ever get back a materialized byte[]


this still confuses me why do we return that boolean if it is always expected to be false?

I your reply got busted.... lemme reread

The assert was just for testing. If you find it less confusing I can remove the return value & the assert .

I am just wondering if we can make sure we never get a class taht does that but I guess the assert is ok

s1monw · 2014-03-28T09:22:05Z

LGTM

Closes #5589

s1monw reviewed Mar 28, 2014
View reviewed changes

Restore streamInput() performance over PagedBytesReference.

c3e0803

Closes #5589

s1monw reviewed Mar 28, 2014
View reviewed changes

hhoffstaette added enhancement and removed enhancement labels Mar 28, 2014

hhoffstaette self-assigned this Mar 28, 2014

hhoffstaette closed this in 0c1b9a6 Mar 28, 2014

hhoffstaette pushed a commit that referenced this pull request Mar 28, 2014

Restore streamInput() performance over PagedBytesReference.

089d0e5

Closes #5589

hhoffstaette deleted the streamcopy branch March 28, 2014 10:09

hhoffstaette removed their assignment Mar 10, 2015

clintongormley added the :Internal label Jun 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restore streamInput() performance over PagedBytesReference. #5589

Restore streamInput() performance over PagedBytesReference. #5589

hhoffstaette commented Mar 28, 2014

s1monw Mar 28, 2014

hhoffstaette commented Mar 28, 2014

s1monw Mar 28, 2014

s1monw Mar 28, 2014

hhoffstaette Mar 28, 2014

s1monw Mar 28, 2014

s1monw commented Mar 28, 2014

Restore streamInput() performance over PagedBytesReference. #5589

Restore streamInput() performance over PagedBytesReference. #5589

Conversation

hhoffstaette commented Mar 28, 2014

s1monw Mar 28, 2014

Choose a reason for hiding this comment

hhoffstaette commented Mar 28, 2014

s1monw Mar 28, 2014

Choose a reason for hiding this comment

s1monw Mar 28, 2014

Choose a reason for hiding this comment

hhoffstaette Mar 28, 2014

Choose a reason for hiding this comment

s1monw Mar 28, 2014

Choose a reason for hiding this comment

s1monw commented Mar 28, 2014