rgw/rgw_op.cc: support async md5 calculation #42435

yanghonggang · 2021-07-21T10:07:24Z

The current putobject procedure is:

LOOP
- read data from client
- calculate md5
- async write
drain write ops
update index and write header

If we change 'md5 calculation' from sync op to async op, the 'async write' can start earlier, which will reduce put latency.

In my test environment, put latency is decreased from 91ms to 82ms(4M s3 object, test tool is hsbench). Of cause, the rados cluster performence and the rgw node's CPU should not be a bottleneck.

In order to change 'md5 calculation' to async op, one copy of user data chunk is kept until the calculation is finished. I don't know if there is a smart way to handle this.

Any suggestions would be greatly appreciated.

Checklist

References tracker ticket
Updates documentation if necessary
Includes tests for new functionality or reproducer for bug

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox

Signed-off-by: Yang Honggang <yanghonggang_yewu@cmss.chinamobile.com>

mattbenjamin · 2021-07-21T15:44:05Z

hi @yanghonggang! @mdw-at-linuxbox has thought about this a bit, I'm adding him as a reviewer

cbodley

very cool, thanks!

cbodley · 2021-07-21T19:43:32Z

src/rgw/rgw_op.cc

@@ -3958,7 +3995,15 @@ void RGWPutObj::execute(optional_yield y)
    }

    if (need_calc_md5) {
-      hash.Update((const unsigned char *)data.c_str(), data.length());
+      int data_len = data.length();
+      char* buf = new char[data_len];


no need to allocate and copy the buffer, you can just pass a copy of bufferlist data through hash.Update() and its lambda

cbodley · 2021-07-22T17:46:12Z

src/rgw/rgw_op.cc

+  void Update(char *data, size_t len) {
+    ongoing_ops.get(1);
+
+    last = std::async([&](std::future<void>&& last, char* data,


this pattern with std::async() and std::future looks simple and elegant, but i do have some concerns here

first is that rgw requests can run asynchronously as coroutines in a boost::asio::io_context, and we want to avoid blocking on a condition variable (either in Throttle::get() or std::future::get()) - instead, we should suspend the coroutine so this thread can resume work on something else until the result is ready. you can find some examples of this in RGWReshardWait::wait(optional_yield y) and RGWHTTPClient::wait(optional_yield y)

this overload of std::async() doesn't take a launch policy, and

Behaves as if called with policy being std::launch::async | std::launch::deferred. In other words, f might be executed in another thread or it might be run synchronously when the resulting std::future is queried for a value.

as far as i know, std::launch::async doesn't give us any guarantees about the number or lifetime of its background threads, or the order of execution for these tasks. if it does limit the number of threads and allows out-of-order execution, this pattern could lead to deadlocks because each task blocks on the result of the previous task with std::future::get()

you can find some examples of this in RGWReshardWait::wait(optional_yield y) and RGWHTTPClient::wait(optional_yield y)

ok, thank you.

if it does limit the number of threads and allows out-of-order execution, this pattern could lead to deadlocks because each task blocks on the result of the previous task with std::future::get()

@cbodley
I don't know under which condition this will lead to deadlocks. Can you give an example?
thank you.

if it does limit the number of threads and allows out-of-order execution, this pattern could lead to deadlocks because each task blocks on the result of the previous task with std::future::get()

I don't know under which condition this will lead to deadlocks. Can you give an example?

taking this example to an extreme, consider an implementation that uses a thread pool with a single thread. we upload an object, and AsyncMD5 creates a sequence of tasks A->B->C->D. if this thread pool allows out-of-order execution, then it may execute B before A. task B will block waiting for the result of A, but task A can never run because there's only the single thread

as the number of threads incease, deadlock becomes far less likely. but i think we're better off handling the threading manually to guarantee that it won't. for example, with a scheduler that's aware of these dependencies, and doesn't schedule a task until it's ready to run. we should also be able to combine the two separate locks (Throttle and std::future) into one

ultimately i think we're either going to need SIMD or large batches to see real wins here, to make up for the added overhead of thread synchronization

ultimately i think we're either going to need SIMD or large batches to see real wins here, to make up for the added overhead of thread synchronization

I'm with you on that. Thank you for your suggestions.

mkogan1 · 2021-08-02T16:59:44Z

in a vstart test the performance seems to be lower and the cpu higher with this PR
screenshots for comparison,
without this PR- top, with this PR - bottom

without this PR- left, with this PR - right

yanghonggang · 2021-08-03T12:52:32Z

@mkogan1 thank you for your response. It seems that the performance is s3 object size dependent(your -z is 4K).
I can share my test results(s3 object size 4M, duration 500s). And my ceph version is 14.2.8.

without this PR:

with this PR:

github-actions · 2021-10-21T14:41:40Z

This pull request can no longer be automatically merged: a rebase is needed and changes have to be manually resolved

stale · 2022-01-09T00:27:08Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2022-11-28T20:02:12Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2022-12-28T21:01:25Z

This pull request has been automatically closed because there has been no activity for 90 days. Please feel free to reopen this pull request (or open a new one) if the proposed change is still appropriate. Thank you for your contribution!

github-actions · 2023-03-07T05:01:34Z

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

github-actions · 2023-04-06T05:01:42Z

This pull request has been automatically closed because there has been no activity for 90 days. Please feel free to reopen this pull request (or open a new one) if the proposed change is still appropriate. Thank you for your contribution!

rgw/rgw_op.cc: support async md5 calculation

1e314dc

Signed-off-by: Yang Honggang <yanghonggang_yewu@cmss.chinamobile.com>

github-actions bot added common rgw labels Jul 21, 2021

mattbenjamin requested a review from mdw-at-linuxbox July 21, 2021 15:43

alimaredia self-requested a review July 21, 2021 19:18

mkogan1 requested review from ofriedma and mkogan1 July 22, 2021 16:04

cbodley reviewed Jul 22, 2021

View reviewed changes

cbodley added the performance label Jul 22, 2021

github-actions bot added the needs-rebase label Oct 21, 2021

stale bot added the stale label Jan 9, 2022

djgalloway changed the base branch from master to main July 2, 2022 00:00

github-actions bot removed the stale label Jul 28, 2022

github-actions bot added the stale label Nov 28, 2022

github-actions bot closed this Dec 28, 2022

alimaredia reopened this Jan 6, 2023

github-actions bot removed the stale label Jan 6, 2023

github-actions bot added the stale label Mar 7, 2023

github-actions bot closed this Apr 6, 2023

cbodley mentioned this pull request Jun 12, 2023

rgw: add s3 checksum crc32 and sha1 #49986

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rgw/rgw_op.cc: support async md5 calculation #42435

rgw/rgw_op.cc: support async md5 calculation #42435

yanghonggang commented Jul 21, 2021

mattbenjamin commented Jul 21, 2021

cbodley left a comment

cbodley Jul 21, 2021

yanghonggang Aug 2, 2021

cbodley Jul 22, 2021

yanghonggang Aug 2, 2021 •

edited

cbodley Aug 2, 2021

yanghonggang Aug 3, 2021

mkogan1 commented Aug 2, 2021 •

edited

yanghonggang commented Aug 3, 2021

github-actions bot commented Oct 21, 2021

stale bot commented Jan 9, 2022

github-actions bot commented Nov 28, 2022

github-actions bot commented Dec 28, 2022

github-actions bot commented Mar 7, 2023

github-actions bot commented Apr 6, 2023

rgw/rgw_op.cc: support async md5 calculation #42435

rgw/rgw_op.cc: support async md5 calculation #42435

Conversation

yanghonggang commented Jul 21, 2021

Checklist

mattbenjamin commented Jul 21, 2021

cbodley left a comment

Choose a reason for hiding this comment

cbodley Jul 21, 2021

Choose a reason for hiding this comment

yanghonggang Aug 2, 2021

Choose a reason for hiding this comment

cbodley Jul 22, 2021

Choose a reason for hiding this comment

yanghonggang Aug 2, 2021 • edited

Choose a reason for hiding this comment

cbodley Aug 2, 2021

Choose a reason for hiding this comment

yanghonggang Aug 3, 2021

Choose a reason for hiding this comment

mkogan1 commented Aug 2, 2021 • edited

yanghonggang commented Aug 3, 2021

github-actions bot commented Oct 21, 2021

stale bot commented Jan 9, 2022

github-actions bot commented Nov 28, 2022

github-actions bot commented Dec 28, 2022

github-actions bot commented Mar 7, 2023

github-actions bot commented Apr 6, 2023

yanghonggang Aug 2, 2021 •

edited

mkogan1 commented Aug 2, 2021 •

edited