Distributed update API #8369

clintongormley · 2014-11-06T18:57:34Z

The difference between a primary and a replica shard should be just a flag which indicates the current role that a shard has. The work load should be the same for all shards in a group, meaning that it shouldn't matter how many primary shards there are on any one node.

The only API which doesn't follow this principle is the update API, which can run a potentially heavy script on the primary shard. This can result in hotspots in the cluster, where one node happens to host many primary shards (see #8149).

@bleskes suggested a way to fix this: by changing the update API to perform the GET and script phases on any primary or replica in a shard group. This would change the characteristics of update to be more like a normal distributed get-and-reindex.

This increases the window for conflicting changes, so it is probably worth changing retry_on_conflict to default to 1, instead of 0.

If the user is sure that their updates are light and won't cause hotspots on the primary, then they can opt in to primary-only updates with ?preference=_primary

/cc @s1monw

The text was updated successfully, but these errors were encountered:

s1monw · 2014-11-06T19:00:59Z

huge +1

martijnvg · 2014-11-06T20:29:38Z

+1

dnhatn · 2018-03-16T02:15:01Z

This issue can be avoided if read-modify-index are executed on the client side. Closing.

bleskes · 2018-03-16T15:09:17Z

To be clear, for future readers, the issue this plan tries to side step still exists - updates put an unproportional load on the primary. People that encounter it can work around it by doing updates from the client as Nhat indicated. We're currently re-evaluating the plan as described above, which is why the issue is closed.

tdoman · 2018-04-06T21:33:11Z

@bleskes Thanks for the note for future readers as I've noticed this in our ES 6.1.1 cluster (on Windows VMs). We have 3 nodes and 2 replicas of every index and yet, for some reason, all the primary replicas for every single index are on one node. The VMs periodically will need updates and restart on a staggered schedule and we'll see the primaries shift but they always end up all on one node. I could see maybe not splitting the primaries for a single index but I can't understand why every single index has each ones of their primary replicas on the same node. Perhaps there's a way to control this? At any rate, this node gets very hot when we're doing updates especially during nightly background jobs we run. If at least some of the index primaries were on other nodes, we could share the unproportional load among all 3 nodes as the updates are for a variety of indexes.
If we take the client side approach that @dnhatn suggests, how do we guarantee read\write consistency? In other words, in our scenario, it's easily possible, though not common, to have multiple threads updating the same document at the same time. Would this be detected via the client libraries (we use NEST 6.0.1) as a version conflict? We do have code that handles version conflicts w/ a brief backoff and retry during our update jobs.

bleskes · 2018-04-09T13:09:16Z

@tdoman https://www.elastic.co/blog/elasticsearch-versioning-support explains how to map what the update api does to equalivent get then index patterns. If you have any questions about how to do this with the .NET client, I suggest asking on the discuss forums.

tdoman · 2018-04-09T19:51:53Z

@bleskes thanks, I will review that. Can you tell me where to go to find out why all the primary shards for every index are located on the same node? I have another cluster running the same version of ES where I have only one large index and there I see the primary shards for that index split between two of the three nodes. By the same token, is there a way to ask ES to distribute the primary shards among the nodes in the cluster? I'd assume it'd do that by default but in my case, it's not.

tdoman · 2018-04-09T22:48:34Z

@bleskes I entered issue 29437 for this question as it seems something is wrong in ES 6.x that would cause every primary shard for every index to end up on the same node.

bleskes · 2018-04-10T09:13:23Z

@tdoman as Jason said in other ticket - please open a topic on the forum where we'd be happy to help. I think there's same basic misconceptions about the role of the primary. We keep github for concrete issues and features.

clintongormley added discuss >enhancement labels Nov 6, 2014

clintongormley mentioned this issue Nov 6, 2014

Updates causing hotspots in cluster when multiple primary shards for an index exist on a single node #8149

Closed

clintongormley changed the title ~~Distributed upgrade API~~ Distributed update API Nov 6, 2014

clintongormley added help wanted adoptme v2.0.0-beta1 and removed discuss labels Nov 6, 2014

clintongormley added the :Distributed/CRUD A catch all label for issues around indexing, updating and getting a doc by id. Not search. label Nov 7, 2014

clintongormley mentioned this issue Nov 26, 2014

Scripts only executed when corresponding file stored locally on master node #8651

Closed

clintongormley mentioned this issue Dec 23, 2014

Odd shard distribution in Elasticsearch 1.4.2 #9023

Closed

clintongormley mentioned this issue Jul 17, 2015

Shard Allocation Activity Balancing #12279

Closed

clintongormley added v2.0.0 v2.1.0 and removed v2.0.0-beta1 v2.0.0 labels Aug 13, 2015

clintongormley added v2.2.0 and removed v2.1.0 labels Nov 20, 2015

spinscale added v2.3.0 and removed v2.2.0 labels Dec 23, 2015

clintongormley mentioned this issue Jan 15, 2016

Remove cluster.routing.allocation.balance.primary #9159

Merged

clintongormley added v2.4.0 and removed v2.3.0 labels Mar 16, 2016

clintongormley added v2.4.1 and removed v2.4.0 labels Aug 24, 2016

clintongormley added v2.4.2 and removed v2.4.1 labels Sep 23, 2016

clintongormley removed the v2.4.2 label Nov 6, 2016

dnhatn closed this as completed Mar 16, 2018

klahnakoski mentioned this issue Apr 10, 2018

Moving replica shard involves primary shard, even across zones #29436

Closed

bleskes mentioned this issue Apr 10, 2018

Make doc values accessible from update scripts #29290

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed update API #8369

Distributed update API #8369

clintongormley commented Nov 6, 2014

s1monw commented Nov 6, 2014

martijnvg commented Nov 6, 2014

dnhatn commented Mar 16, 2018

bleskes commented Mar 16, 2018

tdoman commented Apr 6, 2018 •

edited

bleskes commented Apr 9, 2018

tdoman commented Apr 9, 2018

tdoman commented Apr 9, 2018 •

edited

bleskes commented Apr 10, 2018

Distributed update API #8369

Distributed update API #8369

Comments

clintongormley commented Nov 6, 2014

s1monw commented Nov 6, 2014

martijnvg commented Nov 6, 2014

dnhatn commented Mar 16, 2018

bleskes commented Mar 16, 2018

tdoman commented Apr 6, 2018 • edited

bleskes commented Apr 9, 2018

tdoman commented Apr 9, 2018

tdoman commented Apr 9, 2018 • edited

bleskes commented Apr 10, 2018

tdoman commented Apr 6, 2018 •

edited

tdoman commented Apr 9, 2018 •

edited