New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ClusterInfoService should wipe local cache upon unknown exceptions #9449
Conversation
…eptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes.
@@ -140,6 +151,11 @@ public void onMaster() { | |||
} | |||
} | |||
|
|||
// called from tests as well | |||
void updateOnce() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This needs actual javadocs, just looking at it is doesn't tell that it executes the job with rescheduling false (you have to read the constructor argument for ClusterInfoUpdateJob
to see that
for (DiscoveryNode node : internalTestCluster.clusterService().state().getNodes()) { | ||
mockTransportService.addDelegate(node, new MockTransportService.DelegateTransport(mockTransportService.original()) { | ||
@Override | ||
public void sendRequest(DiscoveryNode node, long requestId, String action, TransportRequest request, TransportRequestOptions options) throws IOException, TransportException { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Dude! Linebreaks! Strive for 80 columns! (or at least 100 or 120) :D
@dakrone pushed another commit. thx |
LGTM |
…eptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes. Closes #9449
…eptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes. Closes #9449
…eptions The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes). This also adds settings for the timeout used when waiting for nodes. Closes elastic#9449
The InternalClusterInfoService reaches out to the nodes to get information about their disk usage and shard store size. Upon a node level error we currently remove the node info from the local cache. We should also clear the cache when we run into an error on the action level (excluding any info from all nodes).
This also adds settings for the timeout used when waiting for nodes.