MDS: make popular counter decay at proper rate #18776

JianyuLi · 2017-11-07T04:15:34Z

Signed-off-by: Jianyu Li joannyli@tencent.com

Currently pop_auth_subtree counter don't decay at right speed, it uses the default DecayRate(which is zero) instead of mds->mdcache->decayrate used by pop_auth_subtree_nested. In this case, the pop_auth_subtree_nested of top-level dir would reduce more rapidly than pop_auth_subtree of low-level dir, after a while, the whole mds_load got from / directory would become much less than low-level directory, which makes it hard for MDbalance to find proper export subtree.

For example, start 12 mdtest processors, each creates/deletes 50000 files under /mnt/cephfs/test[20-30] separately, below is a snip log to illustrate this problem:

2017-11-02 20:43:26.349331 7fe7cff81700 0 mds.0.bal mds.0 mdsload<[8640.6,3290.68 15221.9]/[108733,24090.1 156913], req 1.72793e+06, hr 0, qlen 1, cpu 16.66> = 1.7715e+06 ~ 15221.9 <--- the pop_auth_subtree_nested of / is [8640.6,3290.68 15221.9]
2017-11-02 20:43:26.349357 7fe7cff81700 0 mds.0.bal mds.1 mdsload<[0,0 0]/[0,0 0], req 881655, hr 0, qlen 5, cpu 16.37> = 884380 ~ 7599.19
2017-11-02 20:43:26.349380 7fe7cff81700 5 mds.0.bal prep_rebalance: my load 15221.9 target 11410.6 total 22821.1
2017-11-02 20:43:26.349388 7fe7cff81700 5 mds.0.bal i am sufficiently overloaded
2017-11-02 20:43:26.349400 7fe7cff81700 5 mds.0.bal - mds.0 exports 3811.38 to mds.1
2017-11-02 20:43:26.349414 7fe7cff81700 5 mds.0.bal want to send 3811.38 to mds.1 -> 3811.38
...
2017-11-02 20:43:26.349646 7fe7cff81700 7 mds.0.bal find_exports in 40428 [dir 0x10002d05443 /test31/#test-dir.0/ [2,head] auth v=687 cv=634/634 ap=0+58+59 state=1610612738|complete f(v0 m2017-11-02 18:00:45.117433 1=0+1) n(v76 rc2017-11-02 20:43:25.847469 35384=0+35384) hs=1+0,ss=0+0 dirty=1 | child=1 replicated=0 dirty=1 authpin=0 0x7fe8004bf500] need 3811.38 (3049.1 - 4573.65)
2017-11-02 20:43:26.389502 7fe7cff81700 7 mds.0.bal find_exports in 40490.8 [dir 0x10002d05442 /test28/#test-dir.0/ [2,head] auth v=674 cv=621/621 ap=0+58+59 state=1610612738|complete f(v0 m2017-11-02 18:00:45.107886 1=0+1) n(v74 rc2017-11-02 20:43:25.846067 35356=0+35356) hs=1+0,ss=0+0 dirty=1 | child=1 replicated=0 dirty=1 authpin=0 0x7fe7f7b33a80] need 3811.38 (3049.1 - 4573.65)
<--- but the pop_auth_subtree counter of /test* dirs are much greater than it, which makes this mds couldn't choose any subtree to migrate although it is already overloaded
...

Signed-off-by: Jianyu Li <joannyli@tencent.com>

ukernel · 2017-11-08T11:27:34Z

thank you for the patch. I need more time to understand code in this area

ukernel

looks good to me. I wonder if we should initialize default DecayRate to mds->mdcache->decayrate

JianyuLi · 2017-11-10T06:31:46Z

Yes, I think it had better set the right rate in initialization too.

ukernel · 2017-11-19T12:47:54Z

http://pulpito.ceph.com/zyan-2017-11-17_09:07:11-fs-wip-zyan-testing-20171117-testing-basic-mira/

make popular counter decay at proper rate

6d4709e

Signed-off-by: Jianyu Li <joannyli@tencent.com>

JianyuLi changed the title ~~make popular counter decay at proper rate~~ MDS: make popular counter decay at proper rate Nov 7, 2017

ukernel added bug-fix cephfs Ceph File System labels Nov 7, 2017

ukernel approved these changes Nov 10, 2017

View reviewed changes

ukernel merged commit 035a588 into ceph:master Nov 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MDS: make popular counter decay at proper rate #18776

MDS: make popular counter decay at proper rate #18776

JianyuLi commented Nov 7, 2017

ukernel commented Nov 8, 2017

ukernel left a comment

JianyuLi commented Nov 10, 2017

ukernel commented Nov 19, 2017

MDS: make popular counter decay at proper rate #18776

MDS: make popular counter decay at proper rate #18776

Conversation

JianyuLi commented Nov 7, 2017

ukernel commented Nov 8, 2017

ukernel left a comment

Choose a reason for hiding this comment

JianyuLi commented Nov 10, 2017

ukernel commented Nov 19, 2017