HDDS-8535. ReplicationManager: Unhealthy containers could block EC recovery in small clusters #4756

siddhantsangwan · 2023-05-22T11:50:24Z

What changes were proposed in this pull request?

With EC containers, if there is a small cluster of say 6 nodes with EC-3-2, a container will require 5 nodes. If 2 containers become unhealthy, reconstruction will be required to recover the 2 containers, but there is only 1 spare node.
This means one will get recovered, and we will have 4 "good" containers and 2 UNHEALTHY and the container will remain stuck like this because UNHEALTHY containers are only removed once the container has no over or under replication.
A similar problem was resolved previously where an EC container with both over and under replication can meet the same problem, where under replication cannot proceed due to insufficient spare nodes. In that case, the solution was to check for this case, and call the over-replication handler to clear up the excess replicas.

This PR is still in draft state for some early reviews while I write tests and think about edge cases. Here, we try to delete an UNHEALTHY replica in the same handler to free up a DN. Then we throw the exception so that this container gets queued again in the under replication queue. Perhaps it's better to throw first if over replication handling is invoked, so we don't delete multiple replicas in one go.

What is the link to the Apache JIRA

https://issues.apache.org/jira/browse/HDDS-8535

How was this patch tested?

Wrote one UT.

…covery in small clusters

...rc/main/java/org/apache/hadoop/hdds/scm/container/replication/ECUnderReplicationHandler.java

sodonnel · 2023-05-23T11:53:12Z

...rc/main/java/org/apache/hadoop/hdds/scm/container/replication/ECUnderReplicationHandler.java

+
+    // remove replicas that aren't on IN_SERVICE and HEALTHY DNs
+    // the leftover replicas will be eligible for deletion
+    Iterator<ContainerReplica> iterator = replicaCount.getReplicas().iterator();


Its not clear to me why we are removing the replicas from the iterator - this is the list which is internal to replicaCount, so we are modifying it, as it does not return a copy or an unModifiable list.

We don't see to use the replicaCount.getReplicas() again in the rest of the method - we just form the closed list and use it. Do we the iterator.remove() calls?

You're right, this area has incorrect logic. My goal is to consider only those replicas that are on in service nodes. Will correct this this.

siddhantsangwan · 2023-05-25T13:47:40Z

@sodonnel Thanks for the review. I've addressed the buggy code so that there's no removal using the iterator now. I just add the needed replicas to the sets. Also added another unit test in the latest commit.

sodonnel

LGTM

HDDS-8535. ReplicationManager: Unhealthy containers could block EC re…

52247bc

…covery in small clusters

siddhantsangwan requested review from sodonnel and adoroszlai May 22, 2023 11:50

sodonnel reviewed May 23, 2023

View reviewed changes

...rc/main/java/org/apache/hadoop/hdds/scm/container/replication/ECUnderReplicationHandler.java Show resolved Hide resolved

sodonnel reviewed May 23, 2023

View reviewed changes

address comments, add a test

ca21fa5

siddhantsangwan marked this pull request as ready for review May 25, 2023 13:47

sodonnel approved these changes May 25, 2023

View reviewed changes

sodonnel merged commit 6f17f98 into apache:master May 25, 2023
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-8535. ReplicationManager: Unhealthy containers could block EC recovery in small clusters #4756

HDDS-8535. ReplicationManager: Unhealthy containers could block EC recovery in small clusters #4756

siddhantsangwan commented May 22, 2023

sodonnel May 23, 2023

siddhantsangwan May 25, 2023

siddhantsangwan commented May 25, 2023

sodonnel left a comment

HDDS-8535. ReplicationManager: Unhealthy containers could block EC recovery in small clusters #4756

HDDS-8535. ReplicationManager: Unhealthy containers could block EC recovery in small clusters #4756

Conversation

siddhantsangwan commented May 22, 2023

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

sodonnel May 23, 2023

Choose a reason for hiding this comment

siddhantsangwan May 25, 2023

Choose a reason for hiding this comment

siddhantsangwan commented May 25, 2023

sodonnel left a comment

Choose a reason for hiding this comment