etcd serves stale read requests before log is caught up during recovery/startup #3879

spacejam · 2015-11-17T05:27:39Z

Steps to reproduce:

write many monotonically increasing values to a key
restart server while trying to read the key in a loop
observe several values returned as the log catches up

It should wait until the log has caught up before serving reads.

xiang90 · 2015-11-17T05:58:08Z

@spacejam If you want to prevent stale read, you need to specify quorum=true.

philips · 2015-11-17T06:09:13Z

I think people are looking for an SLA for how "fresh" the results are. For example if we can give someone a GET request that only succeeds if the last successful heartbeat RPC was X seconds ago.

spacejam · 2015-11-17T08:07:34Z

Is there a case where it is desirable to serve read requests before the local state is fully recovered? This is likely to involve serving state that is far older than typical follower lag.

xiang90 · 2015-11-17T15:54:39Z

@spacejam I am curious about the use case. If user really care about the "freshness" more than linearizability, it might make sense to add a feature to block on the client request until:

fully applied existing entires
or fully caught up the current leader
or caught up with current leader at least N second ago

Or user can simply specify quorum=true.

But I am not convinced that user needs this fine grained control.

spacejam · 2015-11-19T17:39:37Z

I think people are generally fine with some slave lag, and would prefer stale data to no data in the presence of partitions, but I don't think many people are willing to trade extremely stale reads for a few moments of acausal availability in the rare case of server start-up. This is why many databases will not serve requests until they have fully recovered their local state upon initialization.

xiang90 · 2015-11-19T17:45:34Z

@spacejam Oh. Yea. For startup initialization, it is an easy fix and should be fixed.

It should wait until the log has caught up before serving reads.

Your original issue is border than initialization if I understand it correctly.

A server can recover from snapshot at runtime too. Shall we wait until it catches up with the leader?

A server might be partitioned and is far behind the leader. Shall we wait again?

I have some rough answers in my mind. But I do not have to plan to address them immediately.

xiang90 · 2016-03-29T20:04:08Z

From #3300:

etcd serves stale read requests before log is caught up during startup.

xiang90 · 2016-04-27T23:06:07Z

Closed by #5196

spacejam mentioned this issue Nov 17, 2015

automatic re-seeding sounds dangerous mesosphere-backup/etcd-mesos#85

Closed

xiang90 added the area/performance label Nov 20, 2015

xiang90 added this to the unplanned milestone Nov 20, 2015

xiang90 self-assigned this Nov 20, 2015

xiang90 mentioned this issue Dec 30, 2015

server endpoint to know sync state #4099

Closed

xiang90 mentioned this issue Mar 29, 2016

ETCD listens on the client port before the client uri is properly advertised, causing unexpected errors #3330

Closed

xiang90 changed the title ~~etcd serves stale read requests before log is caught up during recovery~~ etcd serves stale read requests before log is caught up during recovery/startup Mar 29, 2016

xiang90 mentioned this issue Apr 6, 2016

etcd returns "Key not found" error when issuing GET request soon after the etcd has started #4978

Closed

mitake mentioned this issue Apr 23, 2016

etcdserver: do not serve requests before finish the first internal proposal #5169

Merged

xiang90 modified the milestones: v3.0.0, unplanned Apr 27, 2016

xiang90 closed this as completed Apr 27, 2016

mfojtik mentioned this issue Jun 8, 2017

Use quorum read for origin api resources openshift/origin#14520

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etcd serves stale read requests before log is caught up during recovery/startup #3879

etcd serves stale read requests before log is caught up during recovery/startup #3879

spacejam commented Nov 17, 2015

xiang90 commented Nov 17, 2015

philips commented Nov 17, 2015

spacejam commented Nov 17, 2015

xiang90 commented Nov 17, 2015

spacejam commented Nov 19, 2015

xiang90 commented Nov 19, 2015

xiang90 commented Mar 29, 2016

xiang90 commented Apr 27, 2016

etcd serves stale read requests before log is caught up during recovery/startup #3879

etcd serves stale read requests before log is caught up during recovery/startup #3879

Comments

spacejam commented Nov 17, 2015

xiang90 commented Nov 17, 2015

philips commented Nov 17, 2015

spacejam commented Nov 17, 2015

xiang90 commented Nov 17, 2015

spacejam commented Nov 19, 2015

xiang90 commented Nov 19, 2015

xiang90 commented Mar 29, 2016

xiang90 commented Apr 27, 2016