wip: expose consumer group lag as prometheus metrics #149

JamieAP · 2016-11-01T15:10:05Z

No description provided.

baakind · 2017-12-07T06:38:54Z

Is there a reason why this PR died? We are currently using Burrow to monitor lag and partition-status, and we do want to monitor this using Prometheus. I can see that it is a bit outdated, but I'm curious if there is a reason why it just died. @toddpalino @JamieAP

toddpalino · 2017-12-12T20:16:03Z

Not sure why this dropped out, but it probably needs a do-over for 1.0 anyways. The one caveat I would say on that is that if the write to prometheus is straight HTTP, it should probably be implemented using the HTTP notifier and an example template and config provided (which can be added to the documentation).

We do something like this internally with metrics, but I haven't published a sample template yet.

varun06 · 2017-12-12T23:44:15Z

Yeah, I am also looking for prom metrics in burrow.

daodennis-zz · 2017-12-13T00:22:18Z

There is some existing work too that maybe we can borrow for Burrow in the burrow_exporter

Other projects like Kubernetes, etcd, and Docker utilize Prometheus instrumentation. Also, organizations like Cloudflare who use both Kafka and Prometheus like ours are not an isolated intersection. There is a formidable and active operator community behind Prometheus as well.
Anyway...

jirwin's exporter has these metrics:

KafkaConsumerPartitionCurrentOffset
KafkaConsumerPartitionMaxOffset
KafkaConsumerTotalLag

Also, it would be nice to expose operational metrics, most places where there's a logging statement could use a metric. @JamieAP are you planning on shoring this PR up for 1.0 by chance?

varun06 · 2017-12-27T16:20:34Z

Don't want to ruin holiday week, but if any help needed for this PR, I can find some time. Will really appreciate if this get merged sooner than later.

JamieAP · 2018-01-02T12:30:57Z

Sorry for the delay. I abandoned this (and forgot to tidy the PR) in favour of building something slightly more specific to my use case. I'm unlikely to have any time to work on this PR in the next couple of weeks so please feel free to pick it up.

@toddpalino Prometheus uses a pull model. The prometheus backend makes a request to an endpoint to fetch metrics rather than a service making a request to a prometheus endpoint to write them. The HTTP notifier would be no help here.

JamieAP added 11 commits November 1, 2016 15:08

wip: expose consumer group lag as prometheus metrics

83d72d8

add prometheus dependency

792ad11

add created gauges to metrics map

1f5c625

update metrics endpoint

34764a5

update deps

2bd398e

update Dockerfile

5f312ad

update config

e9ce82f

go fmt

ca4b7f7

add circle.yml manifest

8f9f545

use dev env in repo config

fd2b8b1

fix log config path

39b6d4e

JamieAP closed this Jan 2, 2018

varun06 mentioned this pull request Jan 2, 2018

Feature request - Prometheus metrics support #318

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wip: expose consumer group lag as prometheus metrics #149

wip: expose consumer group lag as prometheus metrics #149

JamieAP commented Nov 1, 2016

baakind commented Dec 7, 2017 •

edited

toddpalino commented Dec 12, 2017

varun06 commented Dec 12, 2017

daodennis-zz commented Dec 13, 2017

varun06 commented Dec 27, 2017

JamieAP commented Jan 2, 2018

wip: expose consumer group lag as prometheus metrics #149

wip: expose consumer group lag as prometheus metrics #149

Conversation

JamieAP commented Nov 1, 2016

baakind commented Dec 7, 2017 • edited

toddpalino commented Dec 12, 2017

varun06 commented Dec 12, 2017

daodennis-zz commented Dec 13, 2017

varun06 commented Dec 27, 2017

JamieAP commented Jan 2, 2018

baakind commented Dec 7, 2017 •

edited