hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. #17045

chenyupeng360 · 2017-08-16T10:58:20Z

Hi，guys. Please help to check if my fix is appropriate, thanks.

Fixes: http://tracker.ceph.com/issues/21009
Signed-off-by: yupeng chen chenyupeng-it@360.cn

…o_flush's callback to eliminate potential deadlock for newly created snapshot. Fixes: http://tracker.ceph.com/issues/21009 Signed-off-by: yupeng chen <chenyupeng-it@360.cn>

smithfarm · 2017-08-16T11:26:13Z

@chenyupeng360 hammer is very close to End-of-Life and it's getting less and less probable that there will be another hammer point release. Can you test if the bug is present in luminous or jewel?

chenyupeng360 · 2017-08-16T11:55:01Z

@smithfarm OK, I will try it in Jewel. Since our several large clusters had been running hammer for quite a long time, online upgrading may not be an option. We might choose to deploy Jewel on new clusters in the future.

dillaman · 2017-08-16T12:32:45Z

src/librbd/ImageCtx.cc

@@ -114,7 +114,7 @@ struct C_AsyncCallback : public Context {
    : image_ctx(image_ctx), on_finish(on_finish) {
  }
  virtual void finish(int r) {
-    image_ctx->op_work_queue->queue(on_finish, r);
+    image_ctx->writeback_handler->queue(on_finish, r);


What is the in-memory cache is disabled? Then writeback_handler will be null

@dillaman You're right. Then is it necessary to fork a dedicated thread for callbacks or can we just borrow another thread for that purpose?
Or we can process it conditionally here in C_AsyncCallback.finish() because currently C_AsyncCallback is used solely for asynchronous flush callback:
if object_cacher is enabled:
image_ctx->writeback_handler->queue(on_finish, r);
else:
on_finish.complete(r);

What do you think?

@chenyupeng360 The flush_aysnc_operations() and flush_async_operations(Context*) methods could be modified to be like [1] where it uses a helper Context w/ a reentrant_safe boolean flag. The synchronous (reentrant safe) version can just directly complete.

[1] https://github.com/ceph/ceph/blob/master/src/librbd/ImageCtx.cc#L123

@dillaman OK，I will modify the code as above.

…dedicated thread. If a snapshot is created out-of-band, the refreshing of the image caused by the snapshot may block an aio_flush() issued before the creation of the snapshot occasionally. Fixes: http://tracker.ceph.com/issues/21009 Signed-off-by: yupeng chen <chenyupeng-it@360.cn>

chenyupeng360 · 2017-08-18T10:40:21Z

@smithfarm This issue can not be reproduced in Jewel.

dillaman · 2017-08-18T14:05:01Z

@chenyupeng360 Upon closer inspection of the hammer branch, my suggestion won't work. I think the only true fix would be to add another thread (or just upgrade your clients to jewel since it's already been fixed and Hammer will be EoL next week most likely).

chenyupeng360 · 2017-08-21T06:59:08Z

@dillaman As mentioned in the previous post, instead of upgrading from hammer to jewel for existing clusters, we would rather deploy the jewel release from scratch on newly added clusters. So for the existing clusters, we may still have to fix the bug in hammer release ourselves even if it would have reached EoL.

As to the second fix, I think it's OK to use the writeback handler for calling flush_cache() whenever possible(i.e. when rbd cache is enabled)，could you please tell me what the drawback is or what else I missed. Thanks.

If a snapshot is created out-of-band, the refreshing of the image caused by the snapshot may block an aio_flush() issued before the creation of the snapshot occasionally. Fixes: http://tracker.ceph.com/issues/21009 Signed-off-by: yupeng chen <chenyupeng-it@360.cn>

chenyupeng360 · 2017-08-28T02:32:15Z

@dillaman I pushed my modification, please have a look at it. We will use the latest code to fix the issue in our release.

dillaman · 2017-08-28T15:58:53Z

src/librbd/ImageCtx.cc

@@ -860,7 +864,7 @@ void _flush_async_operations(ImageCtx *ictx, Context *on_finish) {

  void ImageCtx::flush_async_operations() {
    C_SaferCond ctx;
-    _flush_async_operations(this, &ctx);
+    _flush_async_operations(this, new C_AsyncCallback(this, &ctx));


Is this change necessary?

No, that is not necessary. I thought that all flush-related callbacks be processed by newly added Finisher, with one potential drawback that synchronous flush callbacks might be delayed since all flush callbacks are pushed onto the Finisher queue and processed sequentially by the Finisher thread. So we can leave it unchanged here.

If a snapshot is created out-of-band, the refreshing of the image caused by the snapshot may block an aio_flush() issued before the creation of the snapshot occasionally. Leave synchronous flush callbacks processed in the ThreadPool's Context. Fixes: http://tracker.ceph.com/issues/21009 Signed-off-by: yupeng chen <chenyupeng-it@360.cn>

smithfarm · 2017-09-03T18:56:13Z

Hammer is EOL.

Use ImageCtx's writeback_handler rather than ThreadPool to process ai…

8798a7c

…o_flush's callback to eliminate potential deadlock for newly created snapshot. Fixes: http://tracker.ceph.com/issues/21009 Signed-off-by: yupeng chen <chenyupeng-it@360.cn>

chenyupeng360 changed the title ~~rbdUse ImageCtx's writeback_handler rather than ThreadPool to process ai…~~ hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. Aug 16, 2017

smithfarm added this to the hammer milestone Aug 16, 2017

smithfarm added bug-fix rbd labels Aug 16, 2017

smithfarm requested review from dillaman and trociny August 16, 2017 11:24

dillaman reviewed Aug 16, 2017

View reviewed changes

chenyupeng360 force-pushed the hammer branch from 8a80434 to 2951b30 Compare August 18, 2017 10:32

dillaman reviewed Aug 28, 2017

View reviewed changes

smithfarm closed this Sep 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. #17045

hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. #17045

chenyupeng360 commented Aug 16, 2017 •

edited

smithfarm commented Aug 16, 2017

chenyupeng360 commented Aug 16, 2017

dillaman Aug 16, 2017

chenyupeng360 Aug 16, 2017

dillaman Aug 17, 2017

chenyupeng360 Aug 18, 2017

chenyupeng360 commented Aug 18, 2017

dillaman commented Aug 18, 2017

chenyupeng360 commented Aug 21, 2017

chenyupeng360 commented Aug 28, 2017

dillaman Aug 28, 2017

chenyupeng360 Aug 30, 2017 •

edited

smithfarm commented Sep 3, 2017

hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. #17045

hammer:librbd: The qemu VMs hang occasionally after a snapshot is created. #17045

Conversation

chenyupeng360 commented Aug 16, 2017 • edited

smithfarm commented Aug 16, 2017

chenyupeng360 commented Aug 16, 2017

dillaman Aug 16, 2017

Choose a reason for hiding this comment

chenyupeng360 Aug 16, 2017

Choose a reason for hiding this comment

dillaman Aug 17, 2017

Choose a reason for hiding this comment

chenyupeng360 Aug 18, 2017

Choose a reason for hiding this comment

chenyupeng360 commented Aug 18, 2017

dillaman commented Aug 18, 2017

chenyupeng360 commented Aug 21, 2017

chenyupeng360 commented Aug 28, 2017

dillaman Aug 28, 2017

Choose a reason for hiding this comment

chenyupeng360 Aug 30, 2017 • edited

Choose a reason for hiding this comment

smithfarm commented Sep 3, 2017

chenyupeng360 commented Aug 16, 2017 •

edited

chenyupeng360 Aug 30, 2017 •

edited