Core: only one optimize operation should run at once #9638

mikemccand · 2015-02-10T20:44:59Z

Even though we have ThreadPool.OPTIMIZE pool with size=1, if the incoming optimize request does not wait_for_completion, then Lucene's IndexWriter runs the optimize in the background and the request returns immediately, freeing the thread pool to run another optimize.

So if the application submits 10 optimize requests (without wait_for_completion), all 10 will run concurrently, which is bad.

If the optimize is for upgrade, or flush is requested, InternalEngine.optimize does submit a waitForMerges call back to the OPTIMIZE pool, but that's at the end, and so all 10 incoming requests will still run at once I think?

rjernst · 2015-02-10T20:58:51Z

If the optimize is for upgrade, or flush is requested, InternalEngine.optimize does submit a waitForMerges call back to the OPTIMIZE pool, but that's at the end, and so all 10 incoming requests will still run at once I think?

I think that is correct, or rather whatever other optimize requests have already been queued will be run before the blocking wait for merges for the first request that ran. So I think we need to somehow push the waiting thread to the front of the queue, and always have that regardless of the optimize settings (except for wait_for_completion, which of course just runs in the foreground and holds the single optimize thread).

rjernst · 2015-02-10T21:39:43Z

Ok, here is my proposal after speaking with Shay:

For 1.4.3: Change the default for wait_for_completion to true
For 1.5.0: Remove wait_for_completion (and wait_for_merges in the optimize api)
For 2.0: Once we have the task api, try to add back some of this async functionality as a long running task that can be managed.

mikemccand · 2015-02-10T21:46:04Z

+1 for this plan.

Separately, it would be nice if we could simply call IW.forceMerge(), which waits itself. This would fix #8923 ... must we really hold the readLock when calling forceMerge? Anyway, that can be done separately...

This has ended up being very trappy. Most people don't realize the parameter is there, and using a wildcard on index names for upgrade will end up essentially bypassing the optimize concurrency controls through its threadpool. See elastic#9638

This has been very trappy. Rather than continue to allow buggy behavior of having upgrade/optimize requests sidestep the single shard per node limits optimize is supposed to be subject to, this removes the ability to run the upgrade/optimize async. closes elastic#9638

This has been very trappy. Rather than continue to allow buggy behavior of having upgrade/optimize requests sidestep the single shard per node limits optimize is supposed to be subject to, this removes the ability to run the upgrade/optimize async. closes #9638

This has ended up being very trappy. Most people don't realize the parameter is there, and using a wildcard on index names for upgrade will end up essentially bypassing the optimize concurrency controls through its threadpool. See elastic#9638

mikemccand added v1.4.4 v1.5.0 v2.0.0-beta1 >bug labels Feb 10, 2015

rjernst added v1.4.3 and removed v1.4.4 labels Feb 10, 2015

rjernst mentioned this issue Feb 10, 2015

Upgrade: Change wait_for_completion to default to true #9639

Merged

rjernst mentioned this issue Feb 10, 2015

Remove ability to run optimize and upgrade async #9640

Merged

rjernst removed v1.5.0 v2.0.0-beta1 v1.4.3 labels Feb 10, 2015

rjernst closed this as completed in #9640 Feb 11, 2015

rjernst added a commit that referenced this issue Feb 12, 2015

Fix backcompat issue for #9638 (caused by bad cherry pick).

5fed9aa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core: only one optimize operation should run at once #9638

Core: only one optimize operation should run at once #9638

mikemccand commented Feb 10, 2015

rjernst commented Feb 10, 2015

rjernst commented Feb 10, 2015

mikemccand commented Feb 10, 2015

Core: only one optimize operation should run at once #9638

Core: only one optimize operation should run at once #9638

Comments

mikemccand commented Feb 10, 2015

rjernst commented Feb 10, 2015

rjernst commented Feb 10, 2015

mikemccand commented Feb 10, 2015