-
Notifications
You must be signed in to change notification settings - Fork 9.2k
HDFS-16514. Reduce the failover sleep time if multiple namenode are c… #4088
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: trunk
Are you sure you want to change the base?
Conversation
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The comments here looks need to be updated too.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cndaimin I updated the code. Please take a look.
|
@tasanuma Please take a look at this. Thanks a lot. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
|
💔 -1 overall
This message was automatically generated. |
JIRA: HDFS-16514
Recently, we used the [Standby Read] feature in our test cluster, and deployed 4 namenode as follow:
node1 -> active nn
node2 -> standby nn
node3 -> observer nn
node3 -> observer nn
If we set ’dfs.client.failover.random.order=true‘, the client may failover twice and wait a long time to send msync to active namenode.
I think we can reduce the sleep time of the first several failover based on the number of namenode.
For example, if 4 namenode are configured, the sleep time of first three failover operations is set to zero.