-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
osd/scrub: verify SnapMapper consistency #47388
Conversation
3b11e17
to
6a38c0c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good! left a few comments.
Might be worth to add a simple test.
6652b79
to
7b7ece5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems to be heading in the right direction.
We have tests that inject errors and validate the scrub behaves correctly. We need to add these scenarios to those tests. Particularly, we want a test scenario that does a good job of mimicing the SNA_ removal upgrade bug situation.
7b7ece5
to
aafb8e6
Compare
3269d12
to
3d50361
Compare
3d50361
to
38b0098
Compare
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Whenever the scrubber access the SnapMapper for the snaps of a specific clone, the mapper will now verify that the snaps have the required mapping DB entries (the 'SNA_' keys). Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
The test (in the standalone/scrub suite) verifies that the scrubber detects (and issues a cluster-log error) whenever a mapping entry ("SNA_") is missing in the SnapMapper DB. Specifically, here the entry is corrupted - shortened as per https://tracker.ceph.com/issues/56147. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
... systematically, over ceph::bufferlist. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
944796e
to
a85ef8e
Compare
jenkins retest this please |
1 similar comment
jenkins retest this please |
Merging based on Teu results (see above) |
It looks like this never got backported to any release. Am I correct, and if so shouldn't it be? |
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
encountered when manually backporting PR ceph#47388. Mostly fmtlib-related. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
updating the standalone tests to match the exact format of some log lines. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
updating the standalone tests to match the exact format of some log lines. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
updating the standalone tests to match the exact format of some log lines. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
updating the standalone tests to match the exact format of some log lines. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
The test (in the standalone/scrub suite) verifies that the scrubber detects (and issues a cluster-log error) whenever a mapping entry ("SNA_") is missing in the SnapMapper DB. Specifically, here the entry is corrupted - shortened as per https://tracker.ceph.com/issues/56147. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
encountered when manually backporting PR ceph#47388. Mostly fmtlib-related. Probably related to installed FMT-lib version. All changes copied from 'main'. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
updating the standalone tests to match the exact format of some log lines. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
A manual backport of the relevant changes introduced in PR ceph#47388 (osd/scrub: verify SnapMapper consistency) Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
The test (in the standalone/scrub suite) verifies that the scrubber detects (and issues a cluster-log error) whenever a mapping entry ("SNA_") is missing in the SnapMapper DB. Specifically, here the entry is corrupted - shortened as per https://tracker.ceph.com/issues/56147. A manual backport of PR ceph#47388. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
encountered when manually backporting PR ceph#47388. Mostly fmtlib-related. Probably related to installed FMT-lib version. All changes copied from 'main'. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
encountered when manually backporting PR ceph#47388. Mostly fmtlib-related. Probably related to installed FMT-lib version. All changes copied from 'main'. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
encountered when manually backporting PR ceph#47388. Mostly fmtlib-related. Probably related to installed FMT-lib version. All changes copied from 'main'. Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Whenever the scrubber accesses the SnapMapper for the snaps of a specific
clone, the mapper will now verify that the snaps have the required
mapping DB entries (the 'SNA_' keys).
Signed-off-by: Ronen Friedman rfriedma@redhat.com