Coordinator documentation is stale #7201

leventov · 2019-03-06T21:14:04Z

Coordinator documentation has the following paragraph:

Before any unassigned segments are serviced by historical nodes, the available historical nodes for each tier are first sorted in terms of capacity, with least capacity servers having the highest priority. Unassigned segments are always assigned to the nodes with least capacity to maintain a level of balance between nodes. The coordinator does not directly communicate with a historical node when assigning it a new segment; instead the coordinator creates some temporary information about the new segment under load queue path of the historical node. Once this request is seen, the historical node will load the segment and begin servicing it.

As far as I can tell, both key pieces of information that are communicated in this paragraph are wrong:

"Unassigned segments are always assigned to the nodes with least capacity" - no, actually regular balancing rules are in play during loading. (see DruidCoordinatorRuleRunner).
"The coordinator does not directly communicate with a historical node when assigning it a new segment" - actually it does, if HTTP announcing (should we better call it "HTTP segment loading info communication"?) is used.

Could somebody please verify my conclusions?

Suggestions about how this paragraph should be rephrased are also welcome (or you can go ahead with a PR yourself).

@clintropolis @egor-ryashin @gianm

The text was updated successfully, but these errors were encountered:

clintropolis · 2019-03-06T21:38:21Z

Yes, you are right, loading and dropping decisions are done with the BalancerStrategy so that the load rules and the balancer do not make contradictory decisions. I believe DiskNormalizedCostBalancerStrategy would achieve what the current document is describing, but I think CostBalancerStrategy or CachingCostBalancerStrategy don't care so much about disk i think and just prefer to spread out the timeline for maximum fanout.

You are also correct about http segment loading, where the coordinator maintains a list of segments to load and drop (HttpLoadQueuePeon), and talks to the historical directly over http (SegmentListerResource on historical side) to give it a batch of load or drop operations to do.

clintropolis · 2019-03-06T21:47:54Z

... actually it does, if HTTP announcing (should we better call it "HTTP segment loading info communication"?) is used.

Additionally, I suppose there are actually 2 ways it communicates directly, since http segment loading (druid.coordinator.loadqueuepeon.type of 'http', what I was describing in previous comment) and http announcing (druid.announcer.type) are sort of independent, since you can do things like use zk based announcements with http loading.

leventov · 2020-01-22T10:52:59Z

This comment should also be addressed: #7306 (comment)

leventov added Area - Documentation Area - Segment Balancing/Coordination labels Mar 6, 2019

leventov mentioned this issue Jan 22, 2020

Reconcile terminology and method naming to 'used/unused segments'; Rename MetadataSegmentManager to MetadataSegmentsManager #7306

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Coordinator documentation is stale #7201

Coordinator documentation is stale #7201

leventov commented Mar 6, 2019 •

edited

clintropolis commented Mar 6, 2019

clintropolis commented Mar 6, 2019

leventov commented Jan 22, 2020

Coordinator documentation is stale #7201

Coordinator documentation is stale #7201

Comments

leventov commented Mar 6, 2019 • edited

clintropolis commented Mar 6, 2019

clintropolis commented Mar 6, 2019

leventov commented Jan 22, 2020

leventov commented Mar 6, 2019 •

edited