Erasure code recovery should send additional reads if necessary #17920

dzafman · 2017-09-22T17:53:01Z

Fixes: http://tracker.ceph.com/issues/21382

dzafman · 2017-09-22T22:27:29Z

retest this please

dzafman · 2017-09-23T02:24:11Z

Testing results
http://pulpito.ceph.com/dzafman-2017-09-22_10:23:26-rados-wip-21382-distro-basic-smithi/

2 unrelated failures out of 255 rados suite jobs:
1658401 saw valgrind issues
1658403 "2017-09-22 17:55:28.482691 mon.a mon.1 172.21.15.197:6789/0 101 : cluster [WRN] Health check failed: 1/3 mons down, quorum a,c (MON_DOWN)" in cluster log

dzafman · 2017-09-25T14:51:03Z

retest this please

tchaikov · 2017-09-25T14:52:28Z

sorry, i missed this PR. will review it early tomorrow.

tchaikov · 2017-09-26T09:24:26Z

qa/standalone/erasure-code/test-erasure-eio.sh

@@ -281,10 +289,10 @@ function TEST_rados_get_subread_eio_shard_0() {

 function TEST_rados_get_subread_eio_shard_1() {
    local dir=$1
-    setup_osds || return 1
+    setup_osds 4 || return 1


3 OSDs would suffice for an erasure pool of "2+1" .

I'm taking 1 OSD down/out which can begin recovery to the 4th OSD. Not sure this is critical to this test, but I'd rather be using a more realistic scenario.

tchaikov · 2017-09-26T09:29:26Z

qa/standalone/erasure-code/test-erasure-eio.sh

+    local -a initial_osds=($(get_osds $poolname $objname))
+    local last=$((${#initial_osds[@]} - 1))
+    # Kill OSD
+    kill_daemons $dir TERM osd.${initial_osds[$last]} >&2 < /dev/null || return 1


nit, if we want to get the last element in a bash array,

local last_osd=${initial_osds[-1]}

would suffice. i check bash 4.3.11 shipped with ubuntu trusty. and it works also.

tchaikov · 2017-09-26T09:31:24Z

qa/standalone/erasure-code/test-erasure-eio.sh

+    ceph osd out ${initial_osds[$last]} || return 1
+
+    # Cluster should recovery this object
+


nit, remove empty line. s/recovery/recover/

tchaikov · 2017-09-26T09:32:35Z

src/osd/ECBackend.cc

@@ -1204,6 +1203,7 @@ void ECBackend::handle_sub_read_reply(
      set<int> want_to_read, dummy_minimum;
      get_want_to_read_shards(&want_to_read);
      int err;
+      // TODO: Should we include non-acting nodes here when for_recovery is set?


yeah, i think we should.

tchaikov · 2017-09-26T09:38:25Z

src/osd/ECBackend.cc

@@ -1485,19 +1484,12 @@ void ECBackend::call_write_ordered(std::function<void(void)> &&cb) {
  }
 }

-int ECBackend::get_min_avail_to_read_shards(
+void ECBackend::get_shards_and_have(


might want to rename it to get_all_avail_shards()?

tchaikov

lgtm, once the fixup commits are folded.

Signed-off-by: David Zafman <dzafman@redhat.com>

For now it doesn't include non-acting OSDs Added test for this case Signed-off-by: David Zafman <dzafman@redhat.com>

Signed-off-by: David Zafman <dzafman@redhat.com>

dzafman added bug-fix core needs-review needs-test tests wip-zafman-testing labels Sep 22, 2017

dzafman requested a review from tchaikov September 22, 2017 17:53

dzafman removed needs-test wip-zafman-testing labels Sep 23, 2017

tchaikov self-assigned this Sep 25, 2017

tchaikov reviewed Sep 26, 2017

View reviewed changes

tchaikov approved these changes Sep 28, 2017

View reviewed changes

dzafman added 4 commits September 28, 2017 23:31

test: Use feature to get last array element

43e3206

Signed-off-by: David Zafman <dzafman@redhat.com>

test: Allow modified options to existing setup functions

f92aa6c

Signed-off-by: David Zafman <dzafman@redhat.com>

osd: Allow recovery to send additional reads

1235810

For now it doesn't include non-acting OSDs Added test for this case Signed-off-by: David Zafman <dzafman@redhat.com>

osd: For recovery get all possible shards to read on errors

390d12f

Signed-off-by: David Zafman <dzafman@redhat.com>

dzafman force-pushed the wip-21382 branch from e3a0b8a to 390d12f Compare September 29, 2017 06:33

dzafman removed the needs-review label Sep 29, 2017

dzafman merged commit 2f466f8 into ceph:master Sep 29, 2017

dzafman deleted the wip-21382 branch September 29, 2017 16:04

dzafman mentioned this pull request Nov 2, 2017

jewel: osd: recover_replicas: object added to missing set for backfill, but is not in recovering, error! #18690

Merged

dzafman mentioned this pull request Jan 23, 2018

luminous: osd/ReplicatedPG.cc: recover_replicas: object added to missing set for backfill, but is not in recovering, error! #20081

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Erasure code recovery should send additional reads if necessary #17920

Erasure code recovery should send additional reads if necessary #17920

dzafman commented Sep 22, 2017

dzafman commented Sep 22, 2017 •

edited

dzafman commented Sep 23, 2017

dzafman commented Sep 25, 2017

tchaikov commented Sep 25, 2017

tchaikov Sep 26, 2017

dzafman Sep 27, 2017

tchaikov Sep 28, 2017

tchaikov Sep 26, 2017 •

edited

tchaikov Sep 26, 2017

tchaikov Sep 26, 2017

tchaikov Sep 26, 2017

tchaikov left a comment

		ceph osd out ${initial_osds[$last]} \|\| return 1

		# Cluster should recovery this object

Erasure code recovery should send additional reads if necessary #17920

Erasure code recovery should send additional reads if necessary #17920

Conversation

dzafman commented Sep 22, 2017

dzafman commented Sep 22, 2017 • edited

dzafman commented Sep 23, 2017

dzafman commented Sep 25, 2017

tchaikov commented Sep 25, 2017

tchaikov Sep 26, 2017

Choose a reason for hiding this comment

dzafman Sep 27, 2017

Choose a reason for hiding this comment

tchaikov Sep 28, 2017

Choose a reason for hiding this comment

tchaikov Sep 26, 2017 • edited

Choose a reason for hiding this comment

tchaikov Sep 26, 2017

Choose a reason for hiding this comment

tchaikov Sep 26, 2017

Choose a reason for hiding this comment

tchaikov Sep 26, 2017

Choose a reason for hiding this comment

tchaikov left a comment

Choose a reason for hiding this comment

dzafman commented Sep 22, 2017 •

edited

tchaikov Sep 26, 2017 •

edited