msg/simple: apply prefetch policy more precisely #10344

xiexingguo · 2016-07-19T08:09:58Z

We shall apply prefetch policy based on the residual length aftering
checking cache instead of the original request length.

E.g., if the reading sequences are 1K, 5K, 2K, the improved logic
will trigger another prefetch of 4K(as 5K - 3K(from recv_buf) == 2K, and we
now have 8K prefetched data total) by the second 5K reading(which we
don't do this according to the old logic), and thus the last reading request
which asks for 2K data can be also benefited from this prefetch,
which is good for performance.

Signed-off-by: xie xingguo xie.xingguo@zte.com.cn

We shall apply prefetch policy based on the residual length aftering checking cache instead of the original request length. E.g., if the reading sequences are 1K, 5K, 2K, the improved logic will trigger another prefetch of 4K(as 5K - 3K(from recv_buf) == 2K, and we now have 8K prefetched data total) by the second 5K reading(which we don't do this according to the old logic), and thus the last reading request which asks for 2K data can be also benefited from this prefetch too, which is good for performance. Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>

gregsfortytwo · 2016-07-19T20:46:24Z

Doesn't this change contradict the comment right there about not prefetching large reads?

xiexingguo · 2016-07-20T00:39:55Z

The key difference is how do we define large reads.
Previously we honour the origin length of read only when applying prefetch policy. But here we reconsider the residual length of read after reading from the cache instead.

E.g., suppose the recv_max_prefetch is 4K, and the read sequences are 1K, 5K, 2K

Before this change:

read 1K, as 1K < recv_max_prefetch, we prefetch 4K, and the 1K read itself is hit in the cache after the prefetch is done.
read 5K, the first 3K is hit in the cache and the cache is now empty, as 5K > recv_max_prefetch, we don't prefetch and trigger a 2K read instead.
read 2K, the cache is now empty, as 2K > recv_max_prefetch, we trigger another prefetch and get 2K from the cache after prefetch is done.

After this change:

read 1K, as 1K < recv_max_prefetch, we prefetch 4K, and the 1K read itself is hit in the cache after the prefetch is done.
read 5K, the first 3K is hit in the cache and the cache is now empty and we have 5K-3K = 2K to read, as 2K < recv_max_prefetch, we prefetch again and get 2K from the cache after prefetch is done, the cache has 2K data remaining.
read 2K, which is directly hit in the cache.

From the above example, we need exactly 2 (prefetch)reads now instead of 3 reads which we need before.

xiexingguo · 2016-07-21T11:39:16Z

@gregsfortytwo Thoughts?

gregsfortytwo · 2016-07-26T21:55:48Z

Yeah, that makes sense. I was worried about the performance impact of the change but going down a few function frames it all looks good to me!
Reviewed-by:

yuriw · 2016-08-01T21:23:54Z

test run http://pulpito.ceph.com/yuriw-2016-07-27_11:33:48-rados-wip-yuri-testing_2016_7_27-distro-basic-smithi/
ready for merge
@jdurgin pls merge

msg/simple: apply prefetch policy more precisely Reviewed-by: Greg Farnum <gfarnum@redhat.com> Conflicts: src/msg/simple/Pipe.cc (removed unneeded cast)

We shall apply prefetch policy based on the residual length instead of the original requested length. E.g., suppose the recv_max_prefetch is 4K, and the read sequences are 1K, 5K, 2K **Before this change:** - read 1K, as 1K < recv_max_prefetch, we prefetch 4K, and the 1K read itself is hit in the cache after the prefetch is done. - read 5K, the first 3K is hit in the cache and the cache is now empty, as 5K > recv_max_prefetch, we don't prefetch and trigger a 2K read instead. - read 2K, the cache is now empty, as 2K > recv_max_prefetch, we trigger another prefetch and get 2K from the cache after prefetch is done. **After this change:** - read 1K, as 1K < recv_max_prefetch, we prefetch 4K, and the 1K read itself is hit in the cache after the prefetch is done. - read 5K, the first 3K is hit in the cache and the cache is now empty and we have 5K-3K = 2K to read, as 2K < recv_max_prefetch, we prefetch again and get 2K from the cache after prefetch is done, the cache has 2K data remaining. - read 2K, which is directly hit in the cache. From the above example, we need exactly 2 (prefetch)reads now instead of 3 reads which we need before. See-also: ceph#10344 Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>

xiexingguo added core performance labels Jul 22, 2016

gregsfortytwo added the needs-qa label Jul 26, 2016

yuriw added the wip-yuri-testing label Jul 26, 2016

jdurgin merged commit d5c12af into ceph:master Aug 1, 2016

xiexingguo deleted the xxg-wip-pipe-2016-07-19-02 branch August 1, 2016 22:44

xiexingguo mentioned this pull request Jan 3, 2019

msg/async: improve read-prefetch logic #25758

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

msg/simple: apply prefetch policy more precisely #10344

msg/simple: apply prefetch policy more precisely #10344

xiexingguo commented Jul 19, 2016 •

edited

gregsfortytwo commented Jul 19, 2016

xiexingguo commented Jul 20, 2016 •

edited

xiexingguo commented Jul 21, 2016

gregsfortytwo commented Jul 26, 2016

yuriw commented Aug 1, 2016 •

edited

msg/simple: apply prefetch policy more precisely #10344

msg/simple: apply prefetch policy more precisely #10344

Conversation

xiexingguo commented Jul 19, 2016 • edited

gregsfortytwo commented Jul 19, 2016

xiexingguo commented Jul 20, 2016 • edited

xiexingguo commented Jul 21, 2016

gregsfortytwo commented Jul 26, 2016

yuriw commented Aug 1, 2016 • edited

xiexingguo commented Jul 19, 2016 •

edited

xiexingguo commented Jul 20, 2016 •

edited

yuriw commented Aug 1, 2016 •

edited