-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Description
Coordinator documentation has the following paragraph:
Before any unassigned segments are serviced by historical nodes, the available historical nodes for each tier are first sorted in terms of capacity, with least capacity servers having the highest priority. Unassigned segments are always assigned to the nodes with least capacity to maintain a level of balance between nodes. The coordinator does not directly communicate with a historical node when assigning it a new segment; instead the coordinator creates some temporary information about the new segment under load queue path of the historical node. Once this request is seen, the historical node will load the segment and begin servicing it.
As far as I can tell, both key pieces of information that are communicated in this paragraph are wrong:
- "Unassigned segments are always assigned to the nodes with least capacity" - no, actually regular balancing rules are in play during loading. (see
DruidCoordinatorRuleRunner). - "The coordinator does not directly communicate with a historical node when assigning it a new segment" - actually it does, if HTTP announcing (should we better call it "HTTP segment loading info communication"?) is used.
Could somebody please verify my conclusions?
Suggestions about how this paragraph should be rephrased are also welcome (or you can go ahead with a PR yourself).