New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Start Master|Node fault detection pinging immediately during discovery #6706

Closed

bleskes wants to merge 1 commit into elastic:master from bleskes:immediately_ping

Contributor

bleskes commented Jul 3, 2014

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.


          [Discovery] immediately start Master|Node fault detection pinging

321277e

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  elastic#6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

bleskes added v1.3.0 labels

Member

martijnvg commented Jul 3, 2014

LGTM

bleskes closed this in

ae16956

bleskes removed the review label

bleskes deleted the immediately_ping branch

July 3, 2014 12:53

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

9757a6e

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          Revert "[Discovery] immediately start Master|Node fault detection pin…

caf11ff

…ging"

In #6706 we change the master validation to start pining immediately after a new master as ellected or a node joined. The idea is to have a quicker response to failures. This does however create a problem if the new master has yet fully processed it's ellection and responds to the ping with a NoLongerMasterException. This causes the source node to remove the current master and ellect another, only to find out it's not a master either and so forth. We are moving this change to the feature/improve_zen branch, where the improvements we made will cause the situation to be handled properly.

This reverts commit ae16956.

bleskes added a commit that referenced this pull request


          Revert "[Discovery] immediately start Master|Node fault detection pin…

20cd74d

…ging"

In #6706 we change the master validation to start pining immediately after a new master as ellected or a node joined. The idea is to have a quicker response to failures. This does however create a problem if the new master has yet fully processed it's ellection and responds to the ping with a NoLongerMasterException. This causes the source node to remove the current master and ellect another, only to find out it's not a master either and so forth. We are moving this change to the feature/improve_zen branch, where the improvements we made will cause the situation to be handled properly.

This reverts commit ae16956.

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

4ed028b

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

92cfa8a

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

a54a88b

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

7e85769

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

f243aaf

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

3b8fedf

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

432042f

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

1433b82

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added the resiliency label

bleskes added a commit to bleskes/elasticsearch that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

bd69e0e

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  elastic#6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes elastic#6706

bleskes added a commit to bleskes/elasticsearch that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

3bde08b

This is to allow the master election to complete on the chosen master.

 Relates to elastic#6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

3e08188

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

975b689

This is to allow the master election to complete on the chosen master.

 Relates to #6706

clintongormley changed the title ~~[Discovery] immediately start Master|Node fault detection pinging~~ Resiliency: Start Master|Node fault detection pinging immediately during discovery

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

0e9ca5e

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

c0cd013

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

79c87ce

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

87003d7

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

66b3931

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

231d031

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

1706ef2

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

4319bdb

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

dda5fb0

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

ff9dcd0

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

a268fdd

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

cc50708

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

95e7268

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

5bba569

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

5302a53

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

3586e38

This is to allow the master election to complete on the chosen master.

 Relates to #6706

bleskes added a commit to bleskes/elasticsearch that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

17874a4

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  elastic#6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes elastic#6706

bleskes added a commit to bleskes/elasticsearch that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

95bc858

This is to allow the master election to complete on the chosen master.

 Relates to elastic#6706

bleskes added v1.4.0 and removed v1.3.0 labels

bleskes added a commit that referenced this pull request


          [Discovery] immediately start Master|Node fault detection pinging

6b07234

After a node joins the clusters, it starts pinging the master to verify it's health. Before, the cluster join request was processed async and we had to give some time to complete. With  #6480 we changed this to wait for the join process to complete on the master. We can therefore start pinging immediately for fast detection of failures. Similar change can be made to the Node fault detection from the master side.

Closes #6706

bleskes added a commit that referenced this pull request


          [Discovery] Start master fault detection after pingInterval

58861c5

This is to allow the master election to complete on the chosen master.

 Relates to #6706

clintongormley added the :Cluster label

clintongormley changed the title ~~Resiliency: Start Master|Node fault detection pinging immediately during discovery~~ Start Master|Node fault detection pinging immediately during discovery

clintongormley added :Distributed/Distributed and removed :Cluster labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment