Warn on slow metadata persistence #47130

DaveCTurner · 2019-09-25T17:06:55Z

Today if metadata persistence is excessively slow on a master-ineligible node
then the ClusterApplierService emits a warning indicating that the
GatewayMetaState applier was slow, but gives no further details. If it is
excessively slow on a master-eligible node then we do not see any warning at
all, although we might see other consequences such as a lagging node or a
master failure.

With this commit we emit a warning if metadata persistence takes longer than a
configurable threshold, which defaults to 10s. We also emit statistics that
record how much index metadata was persisted and how much was skipped since
this can help distinguish cases where IO was slow from cases where there are
simply too many indices involved.

Backport of #47005 to 7.x.

Today if metadata persistence is excessively slow on a master-ineligible node then the `ClusterApplierService` emits a warning indicating that the `GatewayMetaState` applier was slow, but gives no further details. If it is excessively slow on a master-eligible node then we do not see any warning at all, although we might see other consequences such as a lagging node or a master failure. With this commit we emit a warning if metadata persistence takes longer than a configurable threshold, which defaults to `10s`. We also emit statistics that record how much index metadata was persisted and how much was skipped since this can help distinguish cases where IO was slow from cases where there are simply too many indices involved.

elasticmachine · 2019-09-25T17:06:58Z

Pinging @elastic/es-distributed

DaveCTurner · 2019-09-25T17:08:02Z

Just™ a backport, no need for a review.

DaveCTurner added >enhancement :Distributed/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. backport v7.5.0 labels Sep 25, 2019

DaveCTurner merged commit 45c7783 into elastic:7.x Sep 26, 2019

DaveCTurner deleted the 2019-09-25-assert-no-exceptions-in-application-7.x branch September 26, 2019 06:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warn on slow metadata persistence #47130

Warn on slow metadata persistence #47130

DaveCTurner commented Sep 25, 2019

elasticmachine commented Sep 25, 2019

DaveCTurner commented Sep 25, 2019

Warn on slow metadata persistence #47130

Warn on slow metadata persistence #47130

Conversation

DaveCTurner commented Sep 25, 2019

elasticmachine commented Sep 25, 2019

DaveCTurner commented Sep 25, 2019