rbd-nbd: fix kernel deadlock during teuthology testing #10985

dillaman · 2016-09-05T18:37:03Z

Fixes: http://tracker.ceph.com/issues/16921
Signed-off-by: Jason Dillaman dillaman@redhat.com

dillaman · 2016-09-05T18:37:34Z

http://pulpito.ceph.com/jdillaman-2016-09-05_14:09:59-rbd-wip-16921---basic-smithi/

trociny · 2016-09-07T08:31:28Z

src/tools/rbd_nbd/rbd-nbd.cc

@@ -614,6 +614,8 @@ static int do_map()

      server.start();
      ioctl(nbd, NBD_DO_IT);
+      ioctl(nbd, NBD_CLEAR_QUE);


@dillaman NBD_CLEAR_QUE looks like deprecated and nop.

Yeah, I just put these in here to match qemu-nbd's implementation.

trociny · 2016-09-07T13:00:50Z

@dillaman Still observing hangs running this on teuthology:

http://pulpito.ceph.com/trociny-2016-09-07_07:30:28-rbd-wip-mgolub-testing---basic-mira/
http://pulpito.ceph.com/trociny-2016-09-07_07:29:56-rbd-wip-mgolub-testing---basic-vps/

Also, there were failures:

2016-09-07T07:44:18.420 INFO:teuthology.orchestra.run.vpm021.stdout:rbd_resize(65375232) failed
2016-09-07T07:44:18.420 INFO:teuthology.orchestra.run.vpm021.stdout:do_clone: ops->resize: Device or resource busy

http://pulpito.ceph.com/trociny-2016-09-07_07:31:00-rbd-wip-mgolub-testing---basic-vps/
Though it might be unrelated.

dillaman · 2016-09-07T13:07:16Z

@trociny Definitely a different issue:

[ 7241.278129] block nbd0: Other side returned error (22)
[ 7241.283380] blk_update_request: I/O error, dev nbd0, sector 456576

This implies that rbd-nbd didn't receive an update notification:

$ rbd info pool_client.0/image_client.0-clone15
rbd image 'image_client.0-clone15':
size 176 MB in 45 objects

$ sudo blockdev --getsize64 /dev/nbd0
185068544

The last recorded fsx resize should have made the image 118,824,448 bytes:

2016-09-07T07:56:14.708 INFO:teuthology.orchestra.run.vpm107.stdout:1460 trunc from 0xd404c00 to 0x7151e00

The backtrace from the fsx process shows its in the middle of a write-induced resize, so there must be a race condition between resizing the image and a synchronous read/write operation noticing the change.

dillaman · 2016-09-07T13:52:00Z

The resize is hanging because rbd-nbd (the exclusive lock owner) isn't responding -- and it isn't responding most likely because its librbd thread is deadlocked. It looks like you have gdb attached to it so I cannot see what its doing.

trociny · 2016-09-07T14:05:59Z

Sorry, yes I tried attaching with gdb but it hanged and letft the terminal opened. I have kiled my session.

dillaman · 2016-09-07T14:33:46Z

@trociny Ok -- same thing is happening to me. I am thoroughly convinced that the nbd block driver has lots of edge conditions. Looking at the deadlock listed on mira023 (which is running a 4.8-rc5 kernel instead of stock), it is showing a deadlock in the nbd block driver shut down logic and looking at the code it is definitely a problem and I don't immediately see a way around it.

dillaman · 2016-09-09T01:35:51Z

@trociny It appears to be stable now -- I updated the tests to not colocate nbd with the OSDs and I fixed the resize failure.

ceph/ceph-qa-suite#1170

dillaman · 2016-09-09T01:40:43Z

http://pulpito.ceph.com/jdillaman-2016-09-08_20:44:48-rbd-wip-16921---basic-smithi/
http://pulpito.ceph.com/jdillaman-2016-09-08_20:44:36-rbd-wip-16921---basic-vps/
http://pulpito.ceph.com/jdillaman-2016-09-08_15:43:38-rbd-wip-16921---basic-vps/

dillaman · 2016-09-09T01:48:59Z

Kicked off rbd suite run: http://pulpito.ceph.com/jdillaman-2016-09-08_21:48:20-rbd-wip-16921---basic-smithi/

rbd-nbd: fix kernel deadlock during teuthology testing #10985

trociny · 2016-09-09T06:13:04Z

src/librbd/operation/ResizeRequest.h

@@ -121,6 +121,9 @@ class ResizeRequest : public Request<ImageCtxT> {
  Context *send_append_op_event();
  Context *handle_append_op_event(int *result);

+  void send_flush_cache();
+  Context *handle_flush_cache(int *result);


@dillaman Update the diagram?

Fixes: http://tracker.ceph.com/issues/16921 Signed-off-by: Jason Dillaman <dillaman@redhat.com>

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

Any potential writeback outside the extents of a shrunk image would result in orphaned objects. Signed-off-by: Jason Dillaman <dillaman@redhat.com>

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

dillaman · 2016-09-09T12:35:03Z

@trociny Pushed updates -- the unrelated blacklisting test failure is fixed by PR #11034 and the dynamic_features failure is covered by open ticket http://tracker.ceph.com/issues/17227

trociny · 2016-09-09T12:46:36Z

lgtm

huww98 · 2021-04-15T13:05:23Z

src/tools/rbd_nbd/rbd-nbd.cc

@@ -727,6 +729,7 @@ static int rbd_nbd(int argc, const char *argv[])
  env_to_vec(args);
  global_init(NULL, args, CEPH_ENTITY_TYPE_CLIENT, CODE_ENVIRONMENT_DAEMON,
              CINIT_FLAG_UNPRIVILEGED_DAEMON_DEFAULTS);
+  g_ceph_context->_conf->set_val_or_die("pid_file", "");


Hi Jason (and all reading this), I'm working on starting rbd-nbd with systemd. And it is best to let rbd-nbd write a pid file so that systemd can reliably determine the main process. But this config is disabled here. See this comment for more. Do you have any suggestions? thanks.

Why not go directly and add notify/watchdog support? That would make more sense I think.

I don't have a strong opinion here. I remember I was not very happy when it was added and we had some discussion. On the other hand for usual cases when the rbd-nbd processes are controlled by rbd map/unmap, which does not need the pid files, it is good to have the pid file disabled by default, to avoid pid file collisions.

So I think a solution could be to set the pidfile only if it is explicitly specified via command line arguments. Though I don't think it is a good place to discuss this issue. I suggest to open a tracker ticket, describing the issue, and the provide a PR with the proposed solution, and we may discuss it there.

dillaman added bug-fix rbd labels Sep 5, 2016

trociny added the wip-mgolub-testing label Sep 6, 2016

trociny self-assigned this Sep 6, 2016

trociny pushed a commit that referenced this pull request Sep 6, 2016

Merge branch 'wip-16921' into wip-mgolub-testing

b981fe1

rbd-nbd: fix kernel deadlock during teuthology testing #10985

trociny reviewed Sep 7, 2016
View reviewed changes

dillaman force-pushed the wip-16921 branch from 8f2c159 to 1d53346 Compare September 8, 2016 15:52

trociny pushed a commit that referenced this pull request Sep 9, 2016

Merge branch 'wip-16921' into wip-mgolub-testing

3128b9a

rbd-nbd: fix kernel deadlock during teuthology testing #10985

trociny reviewed Sep 9, 2016
View reviewed changes

Jason Dillaman added 4 commits September 9, 2016 08:21

rbd-nbd: fix kernel deadlock during teuthology testing

ce7c152

Fixes: http://tracker.ceph.com/issues/16921 Signed-off-by: Jason Dillaman <dillaman@redhat.com>

rbd-nbd: mask out-of-bounds IO errors caused by image shrink

c6cfb61

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

librbd: invalidate cache before trimming image

3f93a19

Any potential writeback outside the extents of a shrunk image would result in orphaned objects. Signed-off-by: Jason Dillaman <dillaman@redhat.com>

librbd: ignore cache busy errors when shrinking an image

4ce6638

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

dillaman force-pushed the wip-16921 branch from 1d53346 to 4ce6638 Compare September 9, 2016 12:23

trociny merged commit 253cdda into ceph:master Sep 9, 2016

dillaman deleted the wip-16921 branch September 9, 2016 13:34

huww98 reviewed Apr 15, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rbd-nbd: fix kernel deadlock during teuthology testing #10985

rbd-nbd: fix kernel deadlock during teuthology testing #10985

dillaman commented Sep 5, 2016

dillaman commented Sep 5, 2016

trociny Sep 7, 2016

dillaman Sep 7, 2016

trociny commented Sep 7, 2016

dillaman commented Sep 7, 2016 •

edited

Loading

dillaman commented Sep 7, 2016

trociny commented Sep 7, 2016

dillaman commented Sep 7, 2016

dillaman commented Sep 9, 2016

dillaman commented Sep 9, 2016

dillaman commented Sep 9, 2016

trociny Sep 9, 2016

dillaman commented Sep 9, 2016

trociny commented Sep 9, 2016

huww98 Apr 15, 2021

isodude Jan 20, 2023

trociny Jan 20, 2023

rbd-nbd: fix kernel deadlock during teuthology testing #10985

rbd-nbd: fix kernel deadlock during teuthology testing #10985

Conversation

dillaman commented Sep 5, 2016

dillaman commented Sep 5, 2016

trociny Sep 7, 2016

Choose a reason for hiding this comment

dillaman Sep 7, 2016

Choose a reason for hiding this comment

trociny commented Sep 7, 2016

dillaman commented Sep 7, 2016 • edited Loading

dillaman commented Sep 7, 2016

trociny commented Sep 7, 2016

dillaman commented Sep 7, 2016

dillaman commented Sep 9, 2016

dillaman commented Sep 9, 2016

dillaman commented Sep 9, 2016

trociny Sep 9, 2016

Choose a reason for hiding this comment

dillaman commented Sep 9, 2016

trociny commented Sep 9, 2016

huww98 Apr 15, 2021

Choose a reason for hiding this comment

isodude Jan 20, 2023

Choose a reason for hiding this comment

trociny Jan 20, 2023

Choose a reason for hiding this comment

dillaman commented Sep 7, 2016 •

edited

Loading