Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Logging: Log back pressure from Logstash #5205

Closed
ppf2 opened this issue Sep 19, 2017 · 7 comments
Closed

Logging: Log back pressure from Logstash #5205

ppf2 opened this issue Sep 19, 2017 · 7 comments
Labels

Comments

@ppf2
Copy link
Member

ppf2 commented Sep 19, 2017

We have seen situations where Logstash is not able to keep up downstream (eg. 5.4-5.5 performance regression bug in Logstash). For these situations, filebeat can leave file handles open (unless close_timeout is used). For troubleshooting, it will be nice if there is some indication in the beats log file indicating that Logstash is not able to keep up so users can quickly focus on downstream components. @ph noted that Logstash will send keep alive info to beats so we can potentially leverage that.

@ruflin
Copy link
Member

ruflin commented Oct 10, 2017

We recently had a discussion about this in the team and definitively want do some improvements here. I suggest we use this issue also to brainstorm on what we could do to improve the logging here.

Some additional logs entries which could be useful:

  • Data on how many bytes per file/total filebeat is behind on reading. Note: Tracking how much behind in lines is trickier as it doesn't know line numbers of a file before reading all lines
  • How many events in the queue / pipeline
  • Current window size used
  • Change in window size to LS compared to 30s ago

@elastic/beats Can you please add your own thoughts here?

@andrewkroh
Copy link
Member

I think having a histogram of the (max?) age of items in the publisher queue with some different time windows like 1m, 5m, 15m would be helpful.

@ruflin
Copy link
Member

ruflin commented Oct 12, 2017

@andrewkroh So you would compare the current timestamp on sending to the timestamp in the event itself and get the max value? I like that :-)

@cdenneen
Copy link

Trying to diagnose a beat 5.1.1 -> logstash 5.6.8 issue and having some sort of back pressure log message would be useful here...

@botelastic
Copy link

botelastic bot commented Jul 9, 2020

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@botelastic botelastic bot added Stalled needs_team Indicates that the issue/PR needs a Team:* label labels Jul 9, 2020
@botelastic botelastic bot closed this as completed Aug 8, 2020
@henrikno henrikno reopened this Mar 10, 2022
@botelastic botelastic bot removed the Stalled label Mar 10, 2022
@jsoriano jsoriano added the Team:Elastic-Agent-Data-Plane Label for the Agent Data Plane team label Mar 15, 2022
@elasticmachine
Copy link
Collaborator

Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane)

@botelastic botelastic bot removed the needs_team Indicates that the issue/PR needs a Team:* label label Mar 15, 2022
@jlind23
Copy link
Collaborator

jlind23 commented Mar 21, 2022

Backlog grooming: closing for now until the needs pops up again.

@jlind23 jlind23 closed this as completed Mar 21, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

8 participants