rbd: fix thread_offsets calculation of rbd bench #20590

hitoshikamei · 2018-02-26T12:52:29Z

This patch fixes the way to calculate the thread_offset
vector for sequential I/O of rbd bench command.

The rbd bench command doesn't access whole image of rbd
in some cases, because the amount of accessed data is counted
up to the amount of total I/O.

For example, if options are set to below:

rbd image size : 20M
io-size : 4M
io-total : 20M
io-threads : 3
io-type : write (sequential)

In this case, the data chunk is 5 (20MB / 4MB).
Fist, Thread 1 (T1) writes data to chunk 1. Thread 2 (T2)
writes data to chunk 2. Thread 3 (T3) writes data to
chunk 3. And, the amount of written data sums up to the "off"
value.

After that, the write position of each thread moves next
chunk, and threads overwrite data to the chunks. And,
the amount of overwritten data sums up to the "off" value.
Consequently, the off value reaches rbd image size, and
the rbd bench ends.

The rbd bench command doesn't write whole image, and 8 MB of
image is unwritten.

The processing image is described below:

            0   4   8  12  16   20 MB
            ---------------------
 chunks     | 1 | 2 | 3 | 4 | 5 |
            ---------------------
 1st loop    T1  T2  T3          -> 12 MB written (add to off value)
 2nd loop        T1  T2          ->  8 MB written (add to off value)
                                                 20 MB -> rbd bench END.

Hitoshi Kamei (1):
rbd: fix thread_offsets calculation of rbd bench

src/tools/rbd/action/Bench.cc | 24 +++++++++++++++++-------
1 file changed, 17 insertions(+), 7 deletions(-)

--
2.15.1

dillaman · 2018-02-26T14:26:48Z

src/tools/rbd/action/Bench.cc

+        if (off < (io_size * unit_len * io_threads) ) {
+          thread_offset[i] += io_size;
+        } else {
+          // thread_offset is adjusted to the chunks unassgined to threads.


Nit: unassigned

dillaman · 2018-02-26T14:43:51Z

src/tools/rbd/action/Bench.cc

+          thread_offset[i] = off + (i * io_size);
+        }
+        if (thread_offset[i] + io_size > size)
+          thread_offset[i] = 0;


Nit: shouldn't this be something like thread_offset[i] = unit_len * i * io_size

hitoshikamei · 2018-02-27T08:06:14Z

Thank you for your comment. My explanation might be misleading, so I'll describe it more carefully with images.

Benchmark should not overwrite chunks because the write performance of allocated chunks is different from unallocated chunks. When an unallocated chunk is written, ceph allocates new rados object; meanwhile, when an allocated chunk is written, no need to allocate new rados object. So, the benchmark is affected by the overheads of the operation. Thus, the benchmark needs to avoid to overwrite chunks allocated by previous write.

These accesses are described below (assumption is same as PR message):

1. Current code: Overwrite just next chunk, chunk 4 and chunk 5 are not written
            0   4   8  12  16   20 MB
            ---------------------
 chunks     | 1 | 2 | 3 | 4 | 5 |
            ---------------------
 1st loop    T1  T2  T3          -> 12 MB written (add to off value)
 2nd loop        T1  T2          ->  8 MB written (add to off value)
                              Total 20 MB -> rbd bench END.

2. Your proposal code: Back to start position, chunk 4 and chunk 5 are not written
            0   4   8  12  16   20 MB
            ---------------------
 chunks     | 1 | 2 | 3 | 4 | 5 |
            ---------------------
 1st loop    T1  T2  T3          -> 12 MB written (add to off value)
 2nd loop    T1  T2              ->  8 MB written (add to off value)
                              Total 20 MB -> rbd bench END.

3. Proposed code: Write chunk 4 and chunk 5 in 2nd loop, all chunks are written
            0   4   8  12  16   20 MB
            ---------------------
 chunks     | 1 | 2 | 3 | 4 | 5 |
            ---------------------
 1st loop    T1  T2  T3          -> 12 MB written (add to off value)
 2nd loop                T1  T2  ->  8 MB written (add to off value)
                              Total 20 MB -> rbd bench END.

dillaman · 2018-02-27T21:05:18Z

@hitoshikamei That wasn't what I was proposing -- what I was proposing was to stop all threads restarting at offset zero after the image is written once.

This patch fixes the calculation of the thread_offset vector for sequential I/O of rbd bench command. The rbd bench command doesn't access whole image of rbd, because the some chunks are not assigned to threads. This patch changes the way to calculate the thread_offsets to assign all chunks to threads. Signed-off-by: Hitoshi Kamei <hitoshi.kamei.xm@hitachi.com> Cc: Mitsuo Hayasaka <mitsuo.hayasaka.hu@hitachi.com>

hitoshikamei · 2018-03-02T11:32:04Z

I'm sorry. I misunderstood your comment. You referred to line 313, not 310.
I agree that the result of your proposed code is the same as current code.
So, I revised the patch according to your review comment.

dillaman

lgtm

dillaman added cleanup rbd labels Feb 26, 2018

dillaman reviewed Feb 26, 2018

View reviewed changes

hitoshikamei force-pushed the rbd-bench branch from 478c9f3 to 75c5620 Compare March 2, 2018 11:31

dillaman approved these changes Mar 5, 2018

View reviewed changes

dillaman added needs-qa wip-jason-testing labels Mar 5, 2018

dillaman merged commit 2be034e into ceph:master Mar 8, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rbd: fix thread_offsets calculation of rbd bench #20590

rbd: fix thread_offsets calculation of rbd bench #20590

hitoshikamei commented Feb 26, 2018

dillaman Feb 26, 2018

dillaman Feb 26, 2018

hitoshikamei commented Feb 27, 2018

dillaman commented Feb 27, 2018

hitoshikamei commented Mar 2, 2018 •

edited

dillaman left a comment

rbd: fix thread_offsets calculation of rbd bench #20590

rbd: fix thread_offsets calculation of rbd bench #20590

Conversation

hitoshikamei commented Feb 26, 2018

dillaman Feb 26, 2018

Choose a reason for hiding this comment

dillaman Feb 26, 2018

Choose a reason for hiding this comment

hitoshikamei commented Feb 27, 2018

dillaman commented Feb 27, 2018

hitoshikamei commented Mar 2, 2018 • edited

dillaman left a comment

Choose a reason for hiding this comment

hitoshikamei commented Mar 2, 2018 •

edited