client is unaware of unhealthy server when getting response #3265

yichengq · 2015-08-11T22:33:06Z

Because client pins to one endpoint if possible, it will always make requests to one server. If the target server is isolated from etcd cluster and becomes unhealthy for a long time, the client will be outdated too for that long time. We don't provide way to handle it or warn it today.

xiang90 · 2015-08-25T15:49:27Z

/cc @mx2323 You might want to follow on this.

gyuho · 2015-10-12T15:18:28Z

Is this still an issue?

Raft says:

The leader handles all client requests (if a client contacts a follower, the follower redirects it to the leader).

Can we use the same mechanism to notify client at least the target server is not reachable? When used with proxy, I see the error message of proxy endpoint is not available when the proxy is wrongly configured. Does etcd handle differently in this case?

Thanks,

yichengq · 2015-10-13T18:44:45Z

@gyuho For watch and read, it gets the data from local store. It is allowed/normal for clients to get data from local store. We just don't provide way to handle or warn about unhealthy server today.

gyuho · 2015-10-14T13:13:36Z

I see. Thanks,

mitake · 2015-12-08T01:20:21Z

@yichengq @gyuho if a client turns on client.GetOption.Quorum, reading from leader's store is enforced so the client can be aware of unhealthy server.

For example,

create a cluster with 3 nodes
create a key
kill 2 nodes
get the key with etcdctl get with the option --quorum (which turns on client.GetOption.Quorum)
in such a sequence, the get request in 4th step will fail because of losing quorum. So I think current etcd already has a solution for this issue.

mx2323 · 2015-12-08T01:49:13Z

The issue isn't on reads, it's when a watch is registered on an etcd server, but the server on which the watch is sent to is partitioned from the rest of the etcd cluster. In these situations, the client will sit with a watch untriggered when it could have gone to another server and gotten the new update.

mitake · 2015-12-14T05:12:25Z

@mx2323 thanks for your pointing, I couldn't consider the case of watch.

xiang90 · 2016-05-13T03:24:52Z

closed by #5332

immesys · 2016-07-26T16:41:50Z

The readme still says reads can be satisfied by unhealthy member, and references this issue. Is that still the case? If not, perhaps amend the readme?

bweston92 · 2016-11-21T11:41:17Z

Readme still seems to be out of date.

xiang90 added this to the v3.0.0-maybe milestone Aug 12, 2015

This was referenced Oct 15, 2015

Add health check for client coreos/go-etcd#245

Closed

Proposal : add health check for etcd client kubernetes/kubernetes#15756

Closed

xiang90 self-assigned this May 10, 2016

xiang90 modified the milestones: v3.0.0, v3.0.0-maybe May 10, 2016

xiang90 closed this as completed May 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

client is unaware of unhealthy server when getting response #3265

client is unaware of unhealthy server when getting response #3265

yichengq commented Aug 11, 2015

xiang90 commented Aug 25, 2015

gyuho commented Oct 12, 2015

yichengq commented Oct 13, 2015

gyuho commented Oct 14, 2015

mitake commented Dec 8, 2015

mx2323 commented Dec 8, 2015

mitake commented Dec 14, 2015

xiang90 commented May 13, 2016

immesys commented Jul 26, 2016

bweston92 commented Nov 21, 2016

client is unaware of unhealthy server when getting response #3265

client is unaware of unhealthy server when getting response #3265

Comments

yichengq commented Aug 11, 2015

xiang90 commented Aug 25, 2015

gyuho commented Oct 12, 2015

yichengq commented Oct 13, 2015

gyuho commented Oct 14, 2015

mitake commented Dec 8, 2015

mx2323 commented Dec 8, 2015

mitake commented Dec 14, 2015

xiang90 commented May 13, 2016

immesys commented Jul 26, 2016

bweston92 commented Nov 21, 2016