mds: force client flush snap data before truncating objects #11994

ukernel · 2016-11-15T13:58:53Z

Snapshot data get lost if following sequence of events happen

client writes data to a file
make a snapshot
truncate the file
mds truncate file objects using the newest snap context
client flushes snap data using the old snap context

OSD first handles MDS's truncate request, it updates object's snap
context. When handling client's write request, OSD finds that
object's snap context is newer than request's snap context. So
it uses the newer one and treats the data as if they were
written after the snapshot.

The fix is avoid touching file objects while clients may have
unflushed snap data. Before truncating file objects, MDS checks
if clients may have unflushed snap data. If client have, MDS
set filelock to a special unstable state, the state revokes Fb
capability. MDS starts truncating file objects after the Fb
capability get revoked.

Fixes: http://tracker.ceph.com/issues/17193
Signed-off-by: Yan, Zheng zyan@redhat.com

Snapshot data get lost if following sequence of events happen - client writes data to a file - make a snapshot - truncate the file - mds truncate file objects using the newest snap context - client flushes snap data using the old snap context OSD first handles MDS's truncate request, it updates object's snap context. When handling client's write request, OSD finds that object's snap context is newer than request's snap context. So it uses the newer one and treats the data as if they were written after the snapshot. The fix is avoid touching file objects while clients may have unflushed snap data. Before truncating file objects, MDS checks if clients may have unflushed snap data. If client have, MDS set filelock to a special unstable state, the state revokes Fb capability. MDS starts truncating file objects after the Fb capability get revoked. Fixes: http://tracker.ceph.com/issues/17193 Signed-off-by: Yan, Zheng <zyan@redhat.com>

gregsfortytwo · 2016-11-15T18:30:18Z

/subscribe

ghost · 2016-11-16T16:38:47Z

jenkins test this please (eio, now ignored in master)

ghost · 2016-11-16T18:39:48Z

jenkins test this please (tox bug, now fixed in master)

jcsp · 2016-11-21T12:46:40Z

My test branch had a failure in a snapshot test:
http://pulpito.ceph.com/jspray-2016-11-18_13:57:54-fs-wip-jcsp-testing-20161118-distro-basic-smithi/559675

Failure: "2016-11-18 14:23:07.939915 mds.0 172.21.15.77:6812/4225298329 2 : cluster [ERR] dir 100000045a1 object missing on disk; some files may be lost (/client.0/tmp/k/coreutils-8.5/tests/pr)" in cluster log

ukernel · 2016-11-22T09:50:20Z

unrelated bug http://tracker.ceph.com/issues/17990

gregsfortytwo

Mostly looks good.

gregsfortytwo · 2016-11-22T17:03:52Z

src/mds/MDCache.cc

+      if (in->filelock.is_stable()) {
+	in->auth_pin(&in->filelock);
+      } else {
+	assert(in->filelock.get_state() == LOCK_XLOCKSNAP);


Don't we still need to auth_pin() the file here?

LOCK_XLOCKSNAP is unstable state, it was already auth pinned

gregsfortytwo

Reviewed-by: Greg Farnum gfarnum@redhat.com

gregsfortytwo · 2016-11-23T15:32:21Z

src/mds/MDCache.cc

+      if (in->filelock.is_stable()) {
+	in->auth_pin(&in->filelock);
+      } else {
+	assert(in->filelock.get_state() == LOCK_XLOCKSNAP);


gregsfortytwo · 2016-11-23T15:33:27Z

Should build a test for this too @ukernel — we don't have much white box testing around recovery or snapshots.

ukernel added bug-fix cephfs Ceph File System labels Nov 15, 2016

gregsfortytwo self-assigned this Nov 17, 2016

gregsfortytwo reviewed Nov 22, 2016

View reviewed changes

gregsfortytwo assigned ukernel and unassigned gregsfortytwo Nov 22, 2016

gregsfortytwo approved these changes Nov 23, 2016

View reviewed changes

gregsfortytwo added the needs-qa label Nov 23, 2016

jcsp merged commit 59666c9 into ceph:master Nov 23, 2016

ukernel deleted the wip-17193 branch January 12, 2017 01:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mds: force client flush snap data before truncating objects #11994

mds: force client flush snap data before truncating objects #11994

ukernel commented Nov 15, 2016

gregsfortytwo commented Nov 15, 2016

ghost commented Nov 16, 2016

ghost commented Nov 16, 2016

jcsp commented Nov 21, 2016

ukernel commented Nov 22, 2016

gregsfortytwo left a comment

gregsfortytwo Nov 22, 2016

ukernel Nov 22, 2016

gregsfortytwo Nov 23, 2016

gregsfortytwo left a comment

gregsfortytwo Nov 23, 2016

gregsfortytwo commented Nov 23, 2016

mds: force client flush snap data before truncating objects #11994

mds: force client flush snap data before truncating objects #11994

Conversation

ukernel commented Nov 15, 2016

gregsfortytwo commented Nov 15, 2016

ghost commented Nov 16, 2016

ghost commented Nov 16, 2016

jcsp commented Nov 21, 2016

ukernel commented Nov 22, 2016

gregsfortytwo left a comment

Choose a reason for hiding this comment

gregsfortytwo Nov 22, 2016

Choose a reason for hiding this comment

ukernel Nov 22, 2016

Choose a reason for hiding this comment

gregsfortytwo Nov 23, 2016

Choose a reason for hiding this comment

gregsfortytwo left a comment

Choose a reason for hiding this comment

gregsfortytwo Nov 23, 2016

Choose a reason for hiding this comment

gregsfortytwo commented Nov 23, 2016