Provide APIs to monitor pipeline #2611

suyograo · 2015-02-13T23:45:36Z

Today, most Logstash monitoring functions are accomplished by tailing logs or outputting debug messages. Users typically send specially tagged tracer events to check the health of the system. These special events are also used to measure the latency of the pipeline. This is definitely not straightforward and it becomes hard to administer a large-scale Logstash cluster.

We plan to introduce a Logstash monitoring API endpoint, which will provide visibility into the pipeline. Some important metrics are:

health
number of events processed
latency metrics (average, percentile, etc)
size of the persistent queues (Provide option to have variable size internal queues which are persisted #2606)
number of errors/success

Medium term, we should provide plugin level granularity. For example, it would be great to know how long (on average) an event spends on grok filters, geo ip filters etc. This would help users drill in to the expensive parts of the pipeline.

Care should be taken to make sure metrics collection do not add additional stress on the pipeline and affect the latency and throughput of the events.

ph · 2015-02-27T20:41:56Z

This is certainly something I would like to improve, I was bite to debug an issue on a cluster and the tools aren't that great. I am pretty sure @abonuccelli-es would give us some awesome input :)

nellicus · 2015-04-07T17:50:29Z

@suyograo @ph sorry a bit late here
not sure if these were mentioned somewhere else however, couple of ideas:

have logstash to produce internal events providing metrics (e.g. regularly print every T seconds even if no actual events coming in SLC events sent/received, SLC events filtered,queue sizes etc. also some one off ones e.g Instance started/stopped, queue full, destination down).
having an array where each logstash instance appends its own ID (if anything like will exist?). imagine one event going through multi-logstash layer, this would help understanding through which logstash instances the event has gone through
for events coming via tcp/udp add a @source field where we stamp the sourceIP of the sender. a bit like 'path' when we read files

any other info at runtime via API of course great to have

KlavsKlavsen · 2015-04-30T13:37:29Z

I would definetely prefer to have logstash have the same kind of API that mysql, varnish and many more have.. where you connect to a management port - and get numbers out.
It's pretty cheap for logstash to just keep a memory segment for performance stats and update that, and then i can poll the counters at the interval I want to (every minute f.ex.) and input into my favourite monitoring stack (I use graphite) - to get graphs and to be able to do alerting based on a time-perspective.

An ability to have logstash simply just send performance counters to api's such as graphite etc. - would also be super cool.

simmel · 2015-05-21T09:26:02Z

To expose those numbers via JMX would be perfect!
Then we could just use jmxtrans-agent to output it to a console, file, graphite or statsd.

purbon · 2015-05-21T09:28:44Z

+1 on improving this capabilities of logstash. Monitoring it's things to be improved a lot nowadays. Using jmx might be a nice option that will enable other java components get data out of LS naturally.

m1k3ga · 2015-05-21T13:19:06Z

+1 ;)

ph · 2015-05-22T14:22:36Z

We need better introspection into what a filter worker is actually doing, we should be able to output which plugin and which configuration is actually running. see #3294 for a usecase that metrics and stats should help to solve.

ph · 2015-05-27T13:34:11Z

Another cool feature would be able to turn on a flag and be able to know which event is currently in transit in a specific plugin, this will help people to debug problems with blocking plugins.
A real world example is when a regex in a grok filter is blocking a thread with high cpu usage. See #3302

svenmueller · 2015-05-27T19:53:20Z

JVM metrics would be nice

suyograo · 2016-01-12T17:16:45Z

Implementation details are in #3908

jakauppila · 2016-03-15T19:02:09Z

+1

suyograo · 2016-05-09T20:21:17Z

Fixed in 5.0

suyograo added feature v2.0.0 roadmap labels Feb 13, 2015

suyograo mentioned this issue Feb 17, 2015

Add support for clustering Logstash instances #2632

Open

JPvRiel mentioned this issue Feb 27, 2015

Feature Request: Mechanism for monitoring LSF state via scout, nagios, etc elastic/logstash-forwarder#245

Closed

ph added this to the v2.0.0 milestone Feb 27, 2015

ph self-assigned this Feb 27, 2015

suyograo added enhancement and removed feature labels Apr 14, 2015

ph removed their assignment Apr 30, 2015

ph mentioned this issue May 22, 2015

How to collect JVM metrics of running logstash instance? #3294

Closed

suyograo added v2.0.0 and removed v2.0.0 labels Jun 18, 2015

suyograo removed this from the v2.0.0 milestone Jun 18, 2015

suyograo mentioned this issue Aug 7, 2015

Logstash monitoring tools #2462

Closed

ph self-assigned this Aug 25, 2015

This was referenced Aug 26, 2015

The pipeline should return the current active plugins #3799

Closed

Evaluate framework or routing strategy for the API endpoints #3802

Closed

Evaluate how we are exposing the api endpoints #3801

Closed

Remove Watchdog in the code base #3828

Closed

suyograo added v2.1.0 and removed v2.0.0 labels Sep 1, 2015

ph mentioned this issue Sep 9, 2015

high level metrics need #3889

Closed

suyograo added v5.0.0 and removed v2.1.0 labels Oct 20, 2015

suyograo added monitoring and removed manageability labels Jan 12, 2016

ph mentioned this issue Jan 13, 2016

Expose metrics in Logstash pipeline #3908

Closed

24 tasks

suyograo closed this as completed May 9, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide APIs to monitor pipeline #2611

Provide APIs to monitor pipeline #2611

suyograo commented Feb 13, 2015

ph commented Feb 27, 2015

nellicus commented Apr 7, 2015

KlavsKlavsen commented Apr 30, 2015

simmel commented May 21, 2015

purbon commented May 21, 2015

m1k3ga commented May 21, 2015

ph commented May 22, 2015

ph commented May 27, 2015

svenmueller commented May 27, 2015

suyograo commented Jan 12, 2016

jakauppila commented Mar 15, 2016

suyograo commented May 9, 2016

Provide APIs to monitor pipeline #2611

Provide APIs to monitor pipeline #2611

Comments

suyograo commented Feb 13, 2015

ph commented Feb 27, 2015

nellicus commented Apr 7, 2015

KlavsKlavsen commented Apr 30, 2015

simmel commented May 21, 2015

purbon commented May 21, 2015

m1k3ga commented May 21, 2015

ph commented May 22, 2015

ph commented May 27, 2015

svenmueller commented May 27, 2015

suyograo commented Jan 12, 2016

jakauppila commented Mar 15, 2016

suyograo commented May 9, 2016