Feature: block: add support for efficient zero writes to ScaleIO volumes (scini devices) #9

gmmephisto · 2018-02-19T09:25:17Z

No description provided.

The patch introduces new concept: minimal memory alignment for bounce buffers. Original so called "optimal" value is actually minimal required value for aligment. It should be used for validation that the IOVec is properly aligned and bounce buffer is not required. Though, from the performance point of view, it would be better if bounce buffer or IOVec allocated by QEMU will be aligned stricter. The patch does not change any alignment value yet. Signed-off-by: Denis V. Lunev <den@openvz.org> Reviewed-by: Kevin Wolf <kwolf@redhat.com> Message-id: 1431441056-26198-2-git-send-email-den@openvz.org CC: Paolo Bonzini <pbonzini@redhat.com> Reviewed-by: Kevin Wolf <kwolf@redhat.com> CC: Stefan Hajnoczi <stefanha@redhat.com> Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com> Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

The following sequence int fd = open(argv[1], O_RDWR | O_CREAT | O_DIRECT, 0644); for (i = 0; i < 100000; i++) write(fd, buf, 4096); performs 5% better if buf is aligned to 4096 bytes. The difference is quite reliable. On the other hand we do not want at the moment to enforce bounce buffering if guest request is aligned to 512 bytes. The patch changes default bounce buffer optimal alignment to MAX(page size, 4k). 4k is chosen as maximal known sector size on real HDD. The justification of the performance improve is quite interesting. From the kernel point of view each request to the disk was split by two. This could be seen by blktrace like this: 9,0 11 1 0.000000000 11151 Q WS 312737792 + 1023 [qemu-img] 9,0 11 2 0.000007938 11151 Q WS 312738815 + 8 [qemu-img] 9,0 11 3 0.000030735 11151 Q WS 312738823 + 1016 [qemu-img] 9,0 11 4 0.000032482 11151 Q WS 312739839 + 8 [qemu-img] 9,0 11 5 0.000041379 11151 Q WS 312739847 + 1016 [qemu-img] 9,0 11 6 0.000042818 11151 Q WS 312740863 + 8 [qemu-img] 9,0 11 7 0.000051236 11151 Q WS 312740871 + 1017 [qemu-img] 9,0 5 1 0.169071519 11151 Q WS 312741888 + 1023 [qemu-img] After the patch the pattern becomes normal: 9,0 6 1 0.000000000 12422 Q WS 314834944 + 1024 [qemu-img] 9,0 6 2 0.000038527 12422 Q WS 314835968 + 1024 [qemu-img] 9,0 6 3 0.000072849 12422 Q WS 314836992 + 1024 [qemu-img] 9,0 6 4 0.000106276 12422 Q WS 314838016 + 1024 [qemu-img] and the amount of requests sent to disk (could be calculated counting number of lines in the output of blktrace) is reduced about 2 times. Both qemu-img and qemu-io are affected while qemu-kvm is not. The guest does his job well and real requests comes properly aligned (to page). Signed-off-by: Denis V. Lunev <den@openvz.org> Reviewed-by: Kevin Wolf <kwolf@redhat.com> Message-id: 1431441056-26198-3-git-send-email-den@openvz.org CC: Paolo Bonzini <pbonzini@redhat.com> CC: Kevin Wolf <kwolf@redhat.com> CC: Stefan Hajnoczi <stefanha@redhat.com> Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com> Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

gmmephisto added this to the qemu-kvm-ev-2.3.0-31.el7_2.21.1.CROC3 milestone Mar 4, 2018

gmmephisto force-pushed the pr-master-block-sio-driver branch from 11cd1c7 to 033a2b6 Compare March 4, 2018 16:57

Denis V. Lunev and others added 5 commits April 3, 2018 21:36

raw-posix: introduce aiocb block discard handler

4ab7bd0

Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

configure: added CONFIG_SIO option

a130e00

Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

raw-posix: scaleio devices support

41d5c56

Signed-off-by: Mikhail Ushanov <MiUshanov@croc.ru>

gmmephisto force-pushed the pr-master-block-sio-driver branch from 033a2b6 to 41d5c56 Compare April 3, 2018 18:36

gmmephisto merged commit 41d5c56 into master Apr 3, 2018

gmmephisto deleted the pr-master-block-sio-driver branch April 4, 2018 10:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: block: add support for efficient zero writes to ScaleIO volumes (scini devices) #9

Feature: block: add support for efficient zero writes to ScaleIO volumes (scini devices) #9

gmmephisto commented Feb 19, 2018

Feature: block: add support for efficient zero writes to ScaleIO volumes (scini devices) #9

Feature: block: add support for efficient zero writes to ScaleIO volumes (scini devices) #9

Conversation

gmmephisto commented Feb 19, 2018