Expose http endpoint details #63

hynek · 2015-10-22T07:17:53Z

I’m building an infrastructure that uses service discovery to find metrics. Now, I find it very tedious to set ports by hand so I prefer to listen on port 0 and then use service discovery to tell prometheus about it:

>>> from BaseHTTPServer import HTTPServer
>>> h = HTTPServer(("", 0), None)
>>> h.server_port
57782
>>> h.server_name
'alpha-2.local'

Currently there’s no way to achieve that using client_python. It would be super helpful if you’d expose httpd from https://github.com/prometheus/client_python/blob/master/prometheus_client/exposition.py#L64 .

As a bonus point, this also solves the multiple processes problem from #30 in a more prometheus-like way I find: just expose multiple metrics and let prometheus figure it out.

The text was updated successfully, but these errors were encountered:

brian-brazil · 2015-10-22T08:54:18Z

I find it very tedious to set ports by hand so I prefer to listen on port 0 and then use service discovery

From a manageability standpoint, I'd advise against this. You want you cluster manager to be assigning ports so that it can do healthchecking, graceful shutdown and longer term network QoS based on port numbers.

It would be super helpful if you’d expose httpd

I'm unsure if I should support this, you can always implement your own variant of start_http_server. These are meant as starting points for the most common use cases.

As a bonus point, this also solves the multiple processes problem from #30 in a more prometheus-like way I find: just expose multiple metrics and let prometheus figure it out.

This isn't completely sufficient to solve this. Consider a gauge whose role is "last time X happened", when a process died that could go backwards.

hynek · 2015-10-22T09:05:52Z

From a manageability standpoint, I'd advise against this. You want you cluster manager to be assigning ports so that it can do healthchecking, graceful shutdown and longer term network QoS based on port numbers.

Agreed; I’m gonna claim that most of your users don’t have a cluster manager. :) Also maybe I misunderstand, but I’m solely talking about the metrics endpoint. Not the application service port.

I'm unsure if I should support this, you can always implement your own variant of start_http_server. These are meant as starting points for the most common use cases.

Fair enough. I’m closing then.

This isn't completely sufficient to solve this. Consider a gauge whose role is "last time X happened", when a process died that could go backwards.

True, but wouldn’t that something to solve using max() or something similar? Sorry, I’m still at my Prometheus beginnings. :)

brian-brazil · 2015-10-22T09:09:51Z

True, but wouldn’t that something to solve using max() or something similar? Sorry, I’m still at my Prometheus beginnings. :)

The problem is that once the process isn't there anymore, Prometheus will stop scraping it and the value it has will go stale so the max won't see it.

hynek · 2015-10-22T11:55:00Z

That depends on the approach tho, right?

Because the way I do it, is to use consul, and register the workers’ metrics endpoints a service_id like metrics.dyndns.1, with 1 being uwsgi.get_worker_id(). Then I use relabel to get to job_id=dyndns, worker_id=1.

Therefore unless I scale down, there’s always metrics with those worker ids.

Am I missing something?

brian-brazil · 2015-10-22T12:05:04Z

The problem is the scaling down, or if any worker gets restarted.

hynek · 2015-10-22T12:08:32Z

Hm how does restarting affect anything? Assuming they re-register on start?

brian-brazil · 2015-10-22T12:21:45Z

They no longer have the latest time whatever event happened, as that's all stored in memory.

The challenge is to make a unicorned app have the same semantics metric wise as a threaded one.

hynek · 2015-10-22T12:24:02Z

Ah, I get it now. For them, the last moment it happened is “never” after start. Still feels like something that could be figured out within Prometheus (dropping 0 or something?).

Anyhow, thanks for your time!

brian-brazil · 2015-10-22T12:26:46Z

Still feels like something that could be figured out within Prometheus (dropping 0 or something?).

It's possible, but it's heading very much into advanced topics that something simple like this shouldn't require.

hynek · 2015-10-22T12:39:54Z

Yeah totally. I’m just in love with Prometheus’ philosophy of keeping the client side as simple as possible so I’m exploring my options.

hynek closed this as completed Oct 22, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose http endpoint details #63

Expose http endpoint details #63

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

Expose http endpoint details #63

Expose http endpoint details #63

Comments

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015

brian-brazil commented Oct 22, 2015

hynek commented Oct 22, 2015