Lack of process_* metrics #117

hynek · 2016-11-12T07:29:15Z

Recently I’ve run into the problem that my processes don't have any process_ metrics although the platform (Ubuntu Trusty) hasn’t changed.

I’m not sure how to debug the problem, do you have any idea what could lead to this problem? They are not using the new multiprocessing feature.

The text was updated successfully, but these errors were encountered:

brian-brazil · 2016-11-12T09:27:38Z

Are you doing anything that'd make /proc inaccessible?

hynek · 2016-11-12T09:37:20Z

Not that I’d know of.

They’re run using runit and inside envconsul. The latter is the only recent change I can think of. Other than that, it’s regular processes in an LXC container with a distinct user.

Are you saying that /proc problems are the only possible reason why those metrics might be missing?

brian-brazil · 2016-11-12T09:42:05Z

With the current implementation, yes. We read from /proc/self/stat.

hynek · 2016-11-12T09:58:57Z

I've added a

with open("/proc/self/stat", "rb") as f:
    print(f.read())¬

block to the app's startup and before calling generate_latest(core.REGISTRY) and in both cases I get a legit-looking output back.

I suppose it's too late at that point anyway because something goes wrong write registering metrics?

hynek · 2016-11-12T10:04:12Z

OK it was /proc but not the self thing (free -m stopped working too). Seems like procfs crashed and I needed to reboot the container.

It may be nice if the docs mentioned this error case? Or a log message a la “could not access XYZ, process metrics won't be collected”?

brian-brazil · 2016-11-13T12:03:10Z

I don't think we need to document that if kernel is hosed things will break. We don't log as that could get spammy in other circumstances.

hynek · 2016-11-13T12:16:11Z

OK. Maybe if someone runs into it, they find this ticket. :)

hynek closed this as completed Nov 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lack of process_* metrics #117

Lack of process_* metrics #117

hynek commented Nov 12, 2016

brian-brazil commented Nov 12, 2016

hynek commented Nov 12, 2016

brian-brazil commented Nov 12, 2016

hynek commented Nov 12, 2016

hynek commented Nov 12, 2016

brian-brazil commented Nov 13, 2016

hynek commented Nov 13, 2016

Lack of process_* metrics #117

Lack of process_* metrics #117

Comments

hynek commented Nov 12, 2016

brian-brazil commented Nov 12, 2016

hynek commented Nov 12, 2016

brian-brazil commented Nov 12, 2016

hynek commented Nov 12, 2016

hynek commented Nov 12, 2016

brian-brazil commented Nov 13, 2016

hynek commented Nov 13, 2016