Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mimic: osd crashes in on_local_recover due to stray clone #22671

Closed
wants to merge 2 commits into from

Conversation

pdvian
Copy link

@pdvian pdvian commented Jun 22, 2018

liewegas and others added 2 commits June 21, 2018 22:12
If there is a stray clone (one that does not appear in the SnapSet) and
we do any sort of recovery on it the OSD will crash.  Log an error instead
but continue.

This addresses a problem where a cluster has both (1) an unexpected clone
and (2) the clone is not present on all replicas.  Doing repair on that
PG will both not fix the unexpected clone and also cause the remaining
OSDs to crash trying to recover it.

Include a test.

Fixes: https://tracker.ceph.com/issues/24396
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 154330f)
Also, create temporary files in test specific dir and remove

Caused by: 154330f

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit c1e96ae)
@smithfarm smithfarm added this to the mimic milestone Jun 22, 2018
@smithfarm smithfarm added the core label Jun 22, 2018
@smithfarm
Copy link
Contributor

#22671 (this PR) was integrated into #22826 to avoid merge conflicts

@smithfarm smithfarm closed this Jul 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants