Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

librbd: possible deadlock if cluster connection closed after image #11037

Merged
merged 1 commit into from Sep 12, 2016

Conversation

dillaman
Copy link

@dillaman dillaman commented Sep 9, 2016

Fixes: http://tracker.ceph.com/issues/17254
Signed-off-by: Jason Dillaman dillaman@redhat.com

@xiaoxichen
Copy link
Contributor

looks good, hoping this in 10.2.3

@xiaoxichen
Copy link
Contributor

retest this please

@dillaman
Copy link
Author

dillaman commented Sep 9, 2016

@xiaoxichen If I created a 10.2.2 dev packages with this fix, would you be able to verify that it solves the problem?

@xiaoxichen
Copy link
Contributor

@dillaman of course, YES!

@dillaman
Copy link
Author

dillaman commented Sep 9, 2016

@xiaoxichen Trusty v10.2.2 + this patch is available under the wip-17254-jewel branch. CentOS 7 builder failed due to environment issues.

@xiaoxichen
Copy link
Contributor

@dillaman ,thx, we are all in ubuntu trusty:)

@xiaoxichen
Copy link
Contributor

@dillaman ,deployed the patched library to the cluster. Lets bake and see.

@trociny trociny self-assigned this Sep 10, 2016
Context *ctx = on_finish;
delete this;

ctx->complete(r);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dillaman Shouldn't it complete with ret_val?

@xiaoxichen
Copy link
Contributor

@dillaman , it goes well in our enviroment for more than 12 hours, looking good except it should complete with ret_val

trociny pushed a commit that referenced this pull request Sep 12, 2016
librbd: possible deadlock if cluster connection closed after image #11037
@trociny trociny merged commit d141b13 into ceph:master Sep 12, 2016
@dillaman dillaman deleted the wip-17254 branch September 12, 2016 11:58
Context *ctx = on_finish;
delete this;

ctx->complete(ret_val);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@dillaman Ah, missed this: we can't use ret_val here. Need to copy it before delete.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants