Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException #5038

anrask · 2014-02-06T13:51:43Z

When using a BulkProcessor with setting concurrentRequests > 0 and an exception occurs on row 283 (client.bulk(...)) other than InterruptedException then it is not caught and subsequently the afterBulk function is never called.

It is reproducible by running:

a transport client without an elasticsearch to connect to
with the BulkProcessor configured with concurrentRequests > 0
sending in enough documents so that the bulk is sent

This problem leads to that you only get exception for 1 of the documents in the bulk and there is no way to know that the other documents also failed.

karlney · 2014-02-06T13:56:34Z

FYI this bug is related to #4153, #4155 and #4158

vorce · 2014-03-31T08:32:29Z

Any progress on this yet?

btiernay · 2014-05-27T12:50:42Z

I'm trying to implement a workaround for #6314 and this is required. Any update here for a fix?

telax1985 · 2014-06-13T14:01:18Z

Hello,

I'm seeing an issue with the bulk request API where I am sending many bulk requests simultaneously.

When my index reaches a certain size and the amount of memory assigned to each shard is increased (i.e. increasing from 56Mb allocated to 128Mb allocated) the pending bulk requests fail to respond (i.e. no call to 'onResponse' or 'onFailure' in the BulkRequest ActionListener) effectively locking my application from sending any additional requests as the semaphore flags acquired before calling 'execute' are never released.

Could this issue relate to a problem that I am seeing with the bulk request API?

Thanks,
Andrew

javanna · 2014-06-13T14:20:06Z

Hi @awnixon ,
if you mean the semaphore within BulkProcessor, that's now released in a finally block. Which version of elasticsearch are you using though? This reminds me of #4153, it might be your problem if you're using an old version of elasticsearch,

telax1985 · 2014-06-13T14:33:38Z

Hi @javanna,

Within my own client application I manage the number of bulk requests which may be sent using a semaphore flag which is acquired before calling execute and released once a response is received.
The issue you referenced does sound similar to my own, however I am using the 2.0.0-SNAPSHOT version (as of the 9th of June).

Also, I do not see any exceptions coming from the elasticsearch node. Simply the statement that the amount of memory allocated to each shard is to be increased at which point the number of pending requests in my client application steadily increases without any response from the server. I am still able to execute curl -XGet requests (i.e. elastisearch cluster state is reported as green on request).

I hope that helps.

All the best,
Andrew

javanna · 2014-06-13T14:59:17Z

Then the problem can't be in the BulkProcessor as you are not using it. I'd suggest to switch to it though. There might be an issue in your client code that handles the semaphore?

Also strenghtened BulkProcessorTests by adding randomizations to existing tests and new tests for concurrent requests and expcetions Closes elastic#5038

… beforeBulk Moved BulkProcessor tests from BulkTests to newly added BulkProcessorTests class. Strenghtened BulkProcessorTests by adding randomizations to existing tests and new tests for concurrent requests and expcetions. Also made sure that afterBulk is called only once per request if concurrentRequests==0. Closes #5038

telax1985 · 2014-06-13T18:47:57Z

Thanks for the suggestion. I've switched to the BulkProcessor and found the
situation to be the same where I set the level of concurrency to > 1.
I believe that I have found the root cause of my issue however. Whilst
using concurrent concurrent bulk requests I'm reaching (or exceeding) the
specified "indices.memory.max_index_buffer_size" (which I've set to 256Mb
during testing). When I increase the "indices.memory.max_index_buffer_size"
value my issue no longer occurs. Would you expect elasticsearch to handle
this situation by rejecting some/all pending requests or should it be
returning to the current set of pending requests once the
"index_buffer_size" for each shard has been updated?

On 13 June 2014 15:59, Luca Cavanna notifications@github.com wrote:

Then the problem can't be in the BulkProcessor as you are not using it.
I'd suggest to switch to it though. There might be an issue in your client
code that handles the semaphore?

—
Reply to this email directly or view it on GitHub
#5038 (comment)
.

javanna self-assigned this Feb 7, 2014

btiernay mentioned this issue May 27, 2014

BulkProcessor's close ignores in-flight bulkRequests #6314

Closed

javanna removed their assignment Jun 1, 2014

javanna self-assigned this Jun 12, 2014

javanna added the bug label Jun 12, 2014

javanna mentioned this issue Jun 13, 2014

Make sure afterBulk is always called in BulkProcessor #6495

Closed

javanna added v1.3.0 labels Jun 13, 2014

javanna closed this as completed in b9ffb2b Jun 13, 2014

spinscale changed the title ~~BulkProcessor does not call afterBulk when sending bulk fails with for example NoNodeAvailableException~~ Bulk API: BulkProcessor does not call afterBulk when sending bulk fails with for example NoNodeAvailableException Jun 18, 2014

clintongormley changed the title ~~Bulk API: BulkProcessor does not call afterBulk when sending bulk fails with for example NoNodeAvailableException~~ Bulk API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException Jul 16, 2014

clintongormley changed the title ~~Bulk API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException~~ Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException Sep 10, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException #5038

Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException #5038

anrask commented Feb 6, 2014

karlney commented Feb 6, 2014

vorce commented Mar 31, 2014

btiernay commented May 27, 2014

telax1985 commented Jun 13, 2014

javanna commented Jun 13, 2014

telax1985 commented Jun 13, 2014

javanna commented Jun 13, 2014

telax1985 commented Jun 13, 2014

Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException #5038

Java API: BulkProcessor does not call afterBulk when bulk throws eg NoNodeAvailableException #5038

Comments

anrask commented Feb 6, 2014

karlney commented Feb 6, 2014

vorce commented Mar 31, 2014

btiernay commented May 27, 2014

telax1985 commented Jun 13, 2014

javanna commented Jun 13, 2014

telax1985 commented Jun 13, 2014

javanna commented Jun 13, 2014

telax1985 commented Jun 13, 2014