mds: miscellaneous fixes #12974

ukernel · 2017-01-18T08:50:31Z

No description provided.

request whose retry_attempt > 0 can also be in the clientreply queue. Signed-off-by: Yan, Zheng <zyan@redhat.com>

the function checks if all recovering mds reach resolve state. it needs to consider damaged mds set. Signed-off-by: Yan, Zheng <zyan@redhat.com>

The function get set of mds rank that exist. it needs to consider the damaged mds set. Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds failure does not affect content of recovery_set. So there is no need to re-calculate recovery_set in MDCache::handle_mds_failure. Signed-off-by: Yan, Zheng <zyan@redhat.com>

If subtree exporter fails, the importer may send abort notification to bystanders in import_state_t::bystanders. If both the exporter and bystander fail in the same mdsmap epoch. The migrator first send abort notification to the failed bystander, then remove the bystander from import_state_t::bystanders. This cause mds to assert on unexpected MExportDirNotifyAck message. Signed-off-by: Yan, Zheng <zyan@redhat.com>

This reverts commit 99c9147. Sending slave requests to all replica mds simplify the subtree reolve process of mds recovery. It guarantee all mds have a consistent subtree map. Otherwise replica mds can failed to receive the MDentryUnlink message if the master mds fails. When the master mds recovers, its subtree map is different from the replica mds. Signed-off-by: Yan, Zheng <zyan@redhat.com>

Signed-off-by: Yan, Zheng <zyan@redhat.com>

EMetaBlob::add_primary_dentry() updates inode's last_journaled. No need to do the same job in Migrator::decode_import_inode() Signed-off-by: Yan, Zheng <zyan@redhat.com>

MDCache::create_subtree_map() use MDCache::my_ambiguous_imports and Migrator::is_ambiguous_import() to decide if a subtree is ambiguous import. Submitting log event can start new segment and submit an extra SubtreeMap. So before submitting EImportFinish event, we need to cleanup MDCache::my_ambiguous_imports and Migrator::import_state. Signed-off-by: Yan, Zheng <zyan@redhat.com>

MDCache::try_trim_non_auth_subtree() frees the CDir Signed-off-by: Yan, Zheng <zyan@redhat.com>

During mds recovers, most slave request replies are useless. If corresponding master was not committed, slave requests will be rollback. If corresponding master was committed, it's only possible to receive slave request reply of type OP_COMMITTED. The OP_COMMITTED replay is useful only if it's was triggered by the recovering mds's resolve ack message. Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

EMetaBlob::add_dir_contex() skips adding inodes that has already been journaled in the last ESubtreeMap. The log replay code only replays the first ESubtreeMap. For the rest ESubtreeMap, it just verifies subtree map in the cache matches the ESubtreeMap. If unnessary inodes were included in non-first ESubtreeMap, these inodes do not get added to the cache, the log replay code can find these inodes are missing when replaying the rest events in the log segment. Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

batrick · 2017-01-25T17:13:33Z

src/mds/MDCache.cc

@@ -4287,9 +4287,13 @@ void MDCache::handle_cache_rejoin_weak(MMDSCacheRejoin *weak)
 	dout(10) << " claiming cap import " << p->first << " client." << q->first << " on " << *in << dendl;
 	Capability *cap = rejoin_import_cap(in, q->first, q->second, from);


I think rejoin_import_cap can fail here if there is no session with the client?

rejoin_import_cap return NULL if it fails

Hrm, I think I was looking at master instead of this commit. Disregard!

Fixes: http://tracker.ceph.com/issues/18646 Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

make sure the base object is in the cache when handing MDiscoverReply Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

ukernel added 4 commits January 18, 2017 16:33

mds: fix clientreplay hang

47105c5

request whose retry_attempt > 0 can also be in the clientreply queue. Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds: fix MDSMap::is_resolving()

304e472

the function checks if all recovering mds reach resolve state. it needs to consider damaged mds set. Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds: fix MDSMap::get_recovery_mds_set()

3d6cc4a

The function get set of mds rank that exist. it needs to consider the damaged mds set. Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds: cleanup MDCache::handle_mds_failure

9ef58d8

mds failure does not affect content of recovery_set. So there is no need to re-calculate recovery_set in MDCache::handle_mds_failure. Signed-off-by: Yan, Zheng <zyan@redhat.com>

ukernel added bug-fix cephfs Ceph File System labels Jan 18, 2017

ukernel force-pushed the wip-multimds-misc branch from e0d8548 to 538cb7f Compare January 18, 2017 14:38

liewegas changed the title ~~miscellaneous mds fixes~~ mds: miscellaneous fixes Jan 18, 2017

ukernel added 6 commits January 19, 2017 14:57

mds: add -ESTALE recovery code for slave rmdir

d91b4f5

Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds: cleanup Migrator::decode_import_inode()

e7d6c85

EMetaBlob::add_primary_dentry() updates inode's last_journaled. No need to do the same job in Migrator::decode_import_inode() Signed-off-by: Yan, Zheng <zyan@redhat.com>

mds: fix use-after-free in MDCache::disambiguate_imports

6c42d7e

MDCache::try_trim_non_auth_subtree() frees the CDir Signed-off-by: Yan, Zheng <zyan@redhat.com>

ukernel force-pushed the wip-multimds-misc branch from 538cb7f to 6c42d7e Compare January 19, 2017 07:00

ukernel added 2 commits January 23, 2017 17:42

ukernel force-pushed the wip-multimds-misc branch from 2e3c476 to a9b959d Compare January 23, 2017 14:45

batrick reviewed Jan 25, 2017

View reviewed changes

ukernel added 2 commits January 26, 2017 10:13

mds: handle case that there is no session for the importing caps

e3ee0f1

Fixes: http://tracker.ceph.com/issues/18646 Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

mds: pin base object of discover

c9ecea1

make sure the base object is in the cache when handing MDiscoverReply Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

ukernel force-pushed the wip-multimds-misc branch from eef9e56 to c9ecea1 Compare January 26, 2017 03:46

jcsp approved these changes Feb 1, 2017

View reviewed changes

jcsp merged commit 13a52e9 into ceph:master Feb 1, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mds: miscellaneous fixes #12974

mds: miscellaneous fixes #12974

ukernel commented Jan 18, 2017

batrick Jan 25, 2017

ukernel Jan 26, 2017

batrick Jan 31, 2017

		@@ -4287,9 +4287,13 @@ void MDCache::handle_cache_rejoin_weak(MMDSCacheRejoin *weak)
		dout(10) << " claiming cap import " << p->first << " client." << q->first << " on " << *in << dendl;
		Capability *cap = rejoin_import_cap(in, q->first, q->second, from);

mds: miscellaneous fixes #12974

mds: miscellaneous fixes #12974

Conversation

ukernel commented Jan 18, 2017

batrick Jan 25, 2017

Choose a reason for hiding this comment

ukernel Jan 26, 2017

Choose a reason for hiding this comment

batrick Jan 31, 2017

Choose a reason for hiding this comment