Remove indicesLifecycle.Listener from IndexingMemoryController #6892

bleskes · 2014-07-16T14:59:53Z

The IndexingMemoryController determines the amount of indexing buffer size and translog buffer size each shard should have. It takes memory from inactive shards (indexing wise) and assigns it to other shards. To do so it needs to know about the addition and closing of shards. The current implementation hooks into the indicesService.indicesLifecycle() mechanism to receive call backs, such shard entered the POST_RECOVERY state. Those call backs are typically run on the thread that actually made the change. A mutex was used to synchronize those callbacks with IndexingMemoryController's background thread, which updates the internal engines memory usage on a regular interval. This introduced a dependency between those threads and the locks of the internal engines hosted on the node. In a very rare situation (two tests runs locally) this can cause recovery time outs where two nodes are recovering replicas from each other.

This commit introduces a a lock free approach that updates the internal data structures during iterations in the background thread.

The IndexingMemoryController determines the amount of indexing buffer size and translog buffer size each shard should have. It takes memory from inactive shards (indexing wise) and assigns it to other shards. To do so it needs to know about the addition and closing of shards. The current implementation hooks into the indicesService.indicesLifecycle() mechanism to receive call backs, such shard entered the POST_RECOVERY state. Those call backs are typically run on the thread that actually made the change. A mutex was used to synchronize those callbacks with IndexingMemoryController's background thread, which updates the internal engines memory usage on a regular interval. This introduced a dependency between those threads and the locks of the internal engines hosted on the node. In a *very* rare situation (two tests runs locally) this can cause recovery time outs where two nodes are recovering replicas from each other. This commit introduces a a lock free approach that updates the internal data structures during iterations in the background thread.

bleskes · 2014-07-17T08:38:22Z

Tagged as 1.3

s1monw · 2014-07-17T09:41:57Z

@bleskes I think this looks good though. I left some comments but on the commit :/ I like the removal of the listeners and the sync stuff

bleskes · 2014-07-17T11:08:55Z

@s1monw I pushed another update

s1monw · 2014-07-17T12:26:50Z

NICE LGTM

The IndexingMemoryController determines the amount of indexing buffer size and translog buffer size each shard should have. It takes memory from inactive shards (indexing wise) and assigns it to other shards. To do so it needs to know about the addition and closing of shards. The current implementation hooks into the indicesService.indicesLifecycle() mechanism to receive call backs, such shard entered the POST_RECOVERY state. Those call backs are typically run on the thread that actually made the change. A mutex was used to synchronize those callbacks with IndexingMemoryController's background thread, which updates the internal engines memory usage on a regular interval. This introduced a dependency between those threads and the locks of the internal engines hosted on the node. In a *very* rare situation (two tests runs locally) this can cause recovery time outs where two nodes are recovering replicas from each other. This commit introduces a a lock free approach that updates the internal data structures during iterations in the background thread. Closes #6892

The IndexingMemoryController determines the amount of indexing buffer size and translog buffer size each shard should have. It takes memory from inactive shards (indexing wise) and assigns it to other shards. To do so it needs to know about the addition and closing of shards. The current implementation hooks into the indicesService.indicesLifecycle() mechanism to receive call backs, such shard entered the POST_RECOVERY state. Those call backs are typically run on the thread that actually made the change. A mutex was used to synchronize those callbacks with IndexingMemoryController's background thread, which updates the internal engines memory usage on a regular interval. This introduced a dependency between those threads and the locks of the internal engines hosted on the node. In a *very* rare situation (two tests runs locally) this can cause recovery time outs where two nodes are recovering replicas from each other. This commit introduces a a lock free approach that updates the internal data structures during iterations in the background thread. Closes elastic#6892

bleskes added v2.0.0 and removed v1.4.0 labels Jul 16, 2014

s1monw removed the review label Jul 17, 2014

Separate logic into methods

2f30d70

bleskes added the review label Jul 17, 2014

s1monw removed the review label Jul 17, 2014

bleskes closed this in 38d8e3c Jul 17, 2014

bleskes deleted the indexing_memory_controller_decoupling branch July 17, 2014 12:45

clintongormley added the :Internal label Jun 7, 2015

clintongormley changed the title ~~[Infra] remove indicesLifecycle.Listener from IndexingMemoryController~~ Remove indicesLifecycle.Listener from IndexingMemoryController Jun 7, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove indicesLifecycle.Listener from IndexingMemoryController #6892

Remove indicesLifecycle.Listener from IndexingMemoryController #6892

bleskes commented Jul 16, 2014

bleskes commented Jul 17, 2014

s1monw commented Jul 17, 2014

bleskes commented Jul 17, 2014

s1monw commented Jul 17, 2014

Remove indicesLifecycle.Listener from IndexingMemoryController #6892

Remove indicesLifecycle.Listener from IndexingMemoryController #6892

Conversation

bleskes commented Jul 16, 2014

bleskes commented Jul 17, 2014

s1monw commented Jul 17, 2014

bleskes commented Jul 17, 2014

s1monw commented Jul 17, 2014