mds: Client syncfs is slow (waits for next MDS tick) #15544

taodd · 2017-06-07T07:54:39Z

tracker url: http://tracker.ceph.com/issues/20129
Signed-off-by: dongdong tao tdd21151186@gmail.com

ukernel · 2017-06-07T13:44:29Z

src/client/Client.cc

@@ -5799,6 +5817,7 @@ void Client::unmount()

  while (!mds_requests.empty()) {
    ldout(cct, 10) << "waiting on " << mds_requests.size() << " requests" << dendl;
+    flush_mdlog_sync();


we should only call flush_mdlog_sync() once when there are pending request or flushing caps. mount_cond can get signaled prematurely, calling flush_mdlog_sync() each time mount_cond get signaled creates unnecessary overhead on mds. Besides, we should call flush_mdlog_sync() after flushing dirty caps

1 we are inside the loop "while(!mds_requests.empty())", so i think this means there are at least 1 pending requests ?
2 client_clock is still hold here until next statement, so I don't understand why mount_cond would get signaled prematurely.
3 if we don't add flush_mdlog_sync() here, in this while loop, then next statement (mount_cond.Wait(client_lock)) will still have to wait for next mds tick, right?

4 you said flushing dirty caps, flush_caps_sync() is used to flush dirty caps, right ? and it is been called later in function "Client::unmount";
may be i misunderstand your comment ?

handle_client_reply() signals mount_cond. If there are N pending requests, flush_mdlog_sync() get called N times.

My point is reducing the messages/requests that trigger mdlog->flush to as few as possible

yeah, you are right, flush_mdlog_sync() in the while loop would be trigged N times
I should call this before this while loop and add check in flush_mdlog_sync

jcsp · 2017-06-19T12:46:09Z

This has passed tests + is ready to merge, @taodd please could you clean up the commit messages, they should be something like this:

client: signal MDS to flush log when doing a syncfs

<...more description text...>

Fixes: http://tracker.ceph.com/issues/20129
Signed-off-by: dongdong tao <tdd21151186@gmail.com>

Have a look at other recent commits in the repository for examples.

Fixes: http://tracker.ceph.com/issues/20129 Signed-off-by: dongdong tao <tdd21151186@gmail.com>

taodd · 2017-06-19T13:55:35Z

updated the commit message

jcsp · 2017-06-19T14:56:45Z

@ukernel @taodd anyone have any thoughts about how to handle upgrades? I should have asked this before merging really :-), but we definitely need an answer before we release luminous -- MDSs currently abort if they see an unexpected CEPH_SESSION_* in a MClientSession, so the client probably needs a way to avoid sending it to older MDSs.

taodd · 2017-06-19T23:21:02Z

yes, we should, how do we identify older MDSs ?

jcsp · 2017-06-21T11:24:45Z

Turns out we already had a handy luminous feature bit to check, I've created a PR to handle upgrades #15805

taodd changed the title ~~Fix issue: Client syncfs is slow (waits for next MDS tick)~~ mds: Client syncfs is slow (waits for next MDS tick) Jun 7, 2017

jcsp added bug-fix cephfs Ceph File System labels Jun 7, 2017

jcsp requested a review from ukernel June 7, 2017 10:24

ukernel reviewed Jun 7, 2017

View reviewed changes

ukernel approved these changes Jun 8, 2017

View reviewed changes

taodd force-pushed the master branch from 2468008 to 3fa5dcb Compare June 19, 2017 13:19

taodd added 2 commits June 19, 2017 21:47

Client: signal MDS to flush log when doing a syncfs

2222bf5

Fixes: http://tracker.ceph.com/issues/20129 Signed-off-by: dongdong tao <tdd21151186@gmail.com>

client: signal MDS to flush log when doing a syncfs

616e763

Fixes: http://tracker.ceph.com/issues/20129 Signed-off-by: dongdong tao <tdd21151186@gmail.com>

taodd force-pushed the master branch from 3fa5dcb to 616e763 Compare June 19, 2017 13:52

jcsp merged commit 18de794 into ceph:master Jun 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mds: Client syncfs is slow (waits for next MDS tick) #15544

mds: Client syncfs is slow (waits for next MDS tick) #15544

taodd commented Jun 7, 2017

ukernel Jun 7, 2017

taodd Jun 8, 2017

ukernel Jun 8, 2017

ukernel Jun 8, 2017

taodd Jun 8, 2017

jcsp commented Jun 19, 2017

taodd commented Jun 19, 2017

jcsp commented Jun 19, 2017

taodd commented Jun 19, 2017

jcsp commented Jun 21, 2017

mds: Client syncfs is slow (waits for next MDS tick) #15544

mds: Client syncfs is slow (waits for next MDS tick) #15544

Conversation

taodd commented Jun 7, 2017

ukernel Jun 7, 2017

Choose a reason for hiding this comment

taodd Jun 8, 2017

Choose a reason for hiding this comment

ukernel Jun 8, 2017

Choose a reason for hiding this comment

ukernel Jun 8, 2017

Choose a reason for hiding this comment

taodd Jun 8, 2017

Choose a reason for hiding this comment

jcsp commented Jun 19, 2017

taodd commented Jun 19, 2017

jcsp commented Jun 19, 2017

taodd commented Jun 19, 2017

jcsp commented Jun 21, 2017