A failed after write refresh can prevent advancing the local checkpoint even when the operations were made durable by the translog #108190
Labels
:Distributed/CRUD
A catch all label for issues around indexing, updating and getting a doc by id. Not search.
>enhancement
Team:Distributed
Meta label for distributed team
Today, when an after write refresh fails, we consider the
AsyncAfterWriteAction
as failed:elasticsearch/server/src/main/java/org/elasticsearch/action/support/replication/TransportWriteAction.java
Lines 466 to 471 in 7f83189
This prevents from advancing the local checkpoints with the safely persisted seq nos:
elasticsearch/server/src/main/java/org/elasticsearch/action/support/replication/ReplicationOperation.java
Lines 174 to 195 in 4076aa7
We should reconsider this behaviour and maybe advance the local checkpoints when the refresh failed for an unpromotable shard.
The text was updated successfully, but these errors were encountered: