mds: trim null dentries proactively #10606

jcsp · 2016-08-07T19:29:36Z

Instead of leaving null dentries (e.g. left
behind from unlinks) in the cache until they
fall out of the LRU, actively push them
to the bottom of the LRU and then consume
all nulls at the bottom in trim() even if
the cache is not oversized yet.

This fixes the case where standby replay daemons
would otherwise accumulate a cache full of
null dentries resulting from unlinks, and it
makes the behaviour of active daemons more
deterministic.

Fixes: http://tracker.ceph.com/issues/16919
Signed-off-by: John Spray john.spray@redhat.com

gregsfortytwo · 2016-08-09T04:07:45Z

Okay, so this is really only pushing them to the bottom on replay, right? The commit message concerned me (as obviously we sometimes want to keep them around for fast responses on incomplete dirs).

jcsp · 2016-08-09T12:43:52Z

Tested by ceph/ceph-qa-suite#1111

jcsp · 2016-08-09T12:49:22Z

Right, the extra touch_dentry_bottom calls are only in replay. There are other places that we do that already, like unlinks and renames, so those will also get those stray dentries thrown away immediately instead of waiting for the cache fill up. The (important) case of null dentries that cache lookup misses should be unaffected though.

gregsfortytwo · 2016-08-09T14:02:15Z

LGTM

…o greg-fs-testing #10606 Reviewed-by: Greg Farnum <gfarnum@redhat.com>

gregsfortytwo · 2016-08-29T03:40:47Z

http://pulpito.ceph.com/gregf-2016-08-25_19:08:49-fs-greg-fs-testing-825---basic-mira/385454/ may be this PR's fault; testing without it.

gregsfortytwo · 2016-08-29T04:33:42Z

http://pulpito.ceph.com/gregf-2016-08-29_04:22:27-fs-greg-fs-testing-828---basic-mira/ has the same branch minus this PR running that test job.

gregsfortytwo · 2016-08-29T13:56:35Z

Yeah, looks like it's busted for some reason. I unzipped the MDS logs and looked a little bit and I'm not quite sure why they're stuck, but it does indeed involve stray inodes and trimming on the export.

You may just have lucked into revealing a bug that was being disguised by timing and isn't any more; not sure.

Instead of leaving null dentries (e.g. left behind from unlinks) in the cache until they fall out of the LRU, actively push them to the bottom of the LRU and then consume all nulls at the bottom in trim() even if the cache is not oversized yet. This fixes the case where standby replay daemons would otherwise accumulate a cache full of null dentries resulting from unlinks, and it makes the behaviour of active daemons more deterministic. Fixes: http://tracker.ceph.com/issues/16919 Signed-off-by: John Spray <john.spray@redhat.com>

jcsp · 2016-09-08T10:37:37Z

The offending test (TestStrays.test_migration_on_shutdown) is passing with latest update to this patch:
http://pulpito.ceph.com/jspray-2016-09-07_23:07:40-fs-wip-jcsp-testing-20160907-testing-basic-mira/405046/

I suspect there was an underlying bug here that was triggered by the extra trimming, so I'm going to create a branch that sets cache size to zero to see if it triggers it.

jcsp · 2016-09-08T10:37:52Z

@gregsfortytwo quick re-review?

jcsp · 2016-09-08T10:39:19Z

(the bit that changed was the extra if (lru.lru_get_size() + unexpirable <= (unsigned)max) { check inside the loop. Previously it was always going through the loop at least once so each tick was trimming one item (which should have still been legal but wasn't the intent of this patch)

gregsfortytwo · 2016-09-08T23:21:29Z

Reviewed-by:

jcsp added the cephfs Ceph File System label Aug 7, 2016

jcsp mentioned this pull request Aug 9, 2016

tasks/cephfs: add TestStraysStandby ceph/ceph-qa-suite#1111

Closed

jcsp force-pushed the wip-mds-standby-trim branch from e5eae60 to c419878 Compare August 9, 2016 12:46

gregsfortytwo added the needs-qa label Aug 9, 2016

gregsfortytwo added the wip-greg-testing label Aug 24, 2016

gregsfortytwo added a commit that referenced this pull request Aug 24, 2016

Merge branch 'wip-mds-standby-trim' of git://github.com/jcsp/ceph int…

8786515

…o greg-fs-testing #10606 Reviewed-by: Greg Farnum <gfarnum@redhat.com>

gregsfortytwo removed the wip-greg-testing label Aug 29, 2016

jcsp force-pushed the wip-mds-standby-trim branch from c419878 to 86f6522 Compare September 7, 2016 11:52

jcsp assigned gregsfortytwo Sep 8, 2016

gregsfortytwo removed their assignment Sep 8, 2016

jcsp merged commit 9faf778 into ceph:master Sep 9, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mds: trim null dentries proactively #10606

mds: trim null dentries proactively #10606

jcsp commented Aug 7, 2016

gregsfortytwo commented Aug 9, 2016

jcsp commented Aug 9, 2016

jcsp commented Aug 9, 2016

gregsfortytwo commented Aug 9, 2016

gregsfortytwo commented Aug 29, 2016

gregsfortytwo commented Aug 29, 2016

gregsfortytwo commented Aug 29, 2016

jcsp commented Sep 8, 2016

jcsp commented Sep 8, 2016

jcsp commented Sep 8, 2016

gregsfortytwo commented Sep 8, 2016

mds: trim null dentries proactively #10606

mds: trim null dentries proactively #10606

Conversation

jcsp commented Aug 7, 2016

gregsfortytwo commented Aug 9, 2016

jcsp commented Aug 9, 2016

jcsp commented Aug 9, 2016

gregsfortytwo commented Aug 9, 2016

gregsfortytwo commented Aug 29, 2016

gregsfortytwo commented Aug 29, 2016

gregsfortytwo commented Aug 29, 2016

jcsp commented Sep 8, 2016

jcsp commented Sep 8, 2016

jcsp commented Sep 8, 2016

gregsfortytwo commented Sep 8, 2016