Logging: Log back pressure from Logstash #5205

ppf2 · 2017-09-19T22:23:06Z

We have seen situations where Logstash is not able to keep up downstream (eg. 5.4-5.5 performance regression bug in Logstash). For these situations, filebeat can leave file handles open (unless close_timeout is used). For troubleshooting, it will be nice if there is some indication in the beats log file indicating that Logstash is not able to keep up so users can quickly focus on downstream components. @ph noted that Logstash will send keep alive info to beats so we can potentially leverage that.

ruflin · 2017-10-10T12:41:54Z

We recently had a discussion about this in the team and definitively want do some improvements here. I suggest we use this issue also to brainstorm on what we could do to improve the logging here.

Some additional logs entries which could be useful:

Data on how many bytes per file/total filebeat is behind on reading. Note: Tracking how much behind in lines is trickier as it doesn't know line numbers of a file before reading all lines
How many events in the queue / pipeline
Current window size used
Change in window size to LS compared to 30s ago

@elastic/beats Can you please add your own thoughts here?

andrewkroh · 2017-10-10T15:29:16Z

I think having a histogram of the (max?) age of items in the publisher queue with some different time windows like 1m, 5m, 15m would be helpful.

ruflin · 2017-10-12T09:00:09Z

@andrewkroh So you would compare the current timestamp on sending to the timestamp in the event itself and get the max value? I like that :-)

cdenneen · 2018-03-26T20:51:58Z

Trying to diagnose a beat 5.1.1 -> logstash 5.6.8 issue and having some sort of back pressure log message would be useful here...

botelastic · 2020-07-09T02:30:30Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

elasticmachine · 2022-03-15T11:03:54Z

Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane)

jlind23 · 2022-03-21T08:56:15Z

Backlog grooming: closing for now until the needs pops up again.

ppf2 added the enhancement label Sep 19, 2017

ruflin added the libbeat label Sep 20, 2017

ppf2 mentioned this issue Jul 25, 2018

Improve Filebeat inputs metrics visibility to help identify back pressure #7743

Closed

botelastic bot added Stalled needs_team Indicates that the issue/PR needs a Team:* label labels Jul 9, 2020

botelastic bot closed this as completed Aug 8, 2020

henrikno reopened this Mar 10, 2022

botelastic bot removed the Stalled label Mar 10, 2022

jsoriano added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Mar 15, 2022

botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Mar 15, 2022

jlind23 closed this as completed Mar 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging: Log back pressure from Logstash #5205

Logging: Log back pressure from Logstash #5205

ppf2 commented Sep 19, 2017

ruflin commented Oct 10, 2017

andrewkroh commented Oct 10, 2017

ruflin commented Oct 12, 2017

cdenneen commented Mar 26, 2018

botelastic bot commented Jul 9, 2020

elasticmachine commented Mar 15, 2022

jlind23 commented Mar 21, 2022

Logging: Log back pressure from Logstash #5205

Logging: Log back pressure from Logstash #5205

Comments

ppf2 commented Sep 19, 2017

ruflin commented Oct 10, 2017

andrewkroh commented Oct 10, 2017

ruflin commented Oct 12, 2017

cdenneen commented Mar 26, 2018

botelastic bot commented Jul 9, 2020

elasticmachine commented Mar 15, 2022

jlind23 commented Mar 21, 2022