LogstashTcpSocketAppender may loose events in its onEvent() method #329

brenuart · 2019-04-10T17:43:23Z

It seems the LogstashTcpSocketAppender may loose events in its onEvent() method if it fails to send it for 5 times in a row. It may for instance fail to send the event because the connection has been dropped by the remote peer or the encoder failed to encode it.

In case of a poison event (encoder failed to process it), then it is indeed safer to discard the event and proceed with the next one. In this case there is no need to retry 5 times - it can be discarded immediately. And has a comment in the code says, there is no need to reopen the socket as it won't help in this case...

But when the failure is caused by a broken connection, I think the appender should not drop the event but should rather keep trying to send it until it succeed. Think about a scenario where the remote peer drops the connection immediately after it is established. In this case, the onSend() method will quickly iterate over the 5 allowed attempts and drop the message!

What do you think ?

The text was updated successfully, but these errors were encountered:

philsttr · 2019-04-12T18:51:06Z

Ok, so, we would need a way to categorize exceptions into two types:
A. exceptions where we should reopen the connection and re-send event
B. exceptions where we should drop the event (probably should still reopen the connection for future events in case the event was partially-sent, and the stream corrupted)

How do you propose categorizing these two exception types?

Retrying indefinitely will help in a short term blip. By "short term" I mean an amount of time that is short enough so that the ringbuffer does not fill completely with log events. If the connection problems last long enough so that the ring buffer fills up completely, then log events will be dropped when they are attempted to be inserted in the ring buffer.

So, for long term connection problems, changing the behavior won't really help, since events will get dropped anyway. It's just a matter of which events get dropped.

However, for short term connection problems, changing the behavior will be an improvement.

brenuart · 2019-04-16T12:18:08Z

I’m off for a week. I’ll come back to you in a few days...

brenuart · 2019-06-25T10:41:47Z

I'm back, was a very long week... ;-)

Here is an idea:

wrap the socket output stream with a wrapper whose purpose is to detect if the write operation threw an exception.
in catch section of TcpSendingEventHandler, check whether the wrapper detected an exception. If it did, we know the problem is in the "network" layer and we trigger a reconnection. If it did not, then the exception was thrown by the encoder and there is nothing much we can do about it besides dropping the "poison" event.

We can then safely remove the MAX_REPEAT_WRITE_ATTEMPTS from the loop and be sure that nothing but poison events will be dropped on "this side" of the buffer.

brenuart · 2019-07-19T09:45:08Z

Want me to submit a PR with this approach ?

philsttr · 2019-07-19T16:29:18Z

The approach seems reasonable to me. And yes, PRs are always welcome. :)

philsttr added the type/enhancement label Apr 12, 2019

brenuart mentioned this issue Sep 15, 2021

Drop event when Encoder fails to encode it before it becomes a "poison" event #649

Merged

brenuart linked a pull request Sep 15, 2021 that will close this issue

Drop event when Encoder fails to encode it before it becomes a "poison" event #649

Merged

brenuart added this to the 7.0 milestone Sep 17, 2021

brenuart closed this as completed in #649 Sep 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LogstashTcpSocketAppender may loose events in its onEvent() method #329

LogstashTcpSocketAppender may loose events in its onEvent() method #329

brenuart commented Apr 10, 2019

philsttr commented Apr 12, 2019

brenuart commented Apr 16, 2019 via email

brenuart commented Jun 25, 2019 •

edited

Loading

brenuart commented Jul 19, 2019 •

edited

Loading

philsttr commented Jul 19, 2019

LogstashTcpSocketAppender may loose events in its onEvent() method #329

LogstashTcpSocketAppender may loose events in its onEvent() method #329

Comments

brenuart commented Apr 10, 2019

philsttr commented Apr 12, 2019

brenuart commented Apr 16, 2019 via email

brenuart commented Jun 25, 2019 • edited Loading

brenuart commented Jul 19, 2019 • edited Loading

philsttr commented Jul 19, 2019

brenuart commented Jun 25, 2019 •

edited

Loading

brenuart commented Jul 19, 2019 •

edited

Loading