Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus hangs and generates too much ps processes #4938
Comments
Hacky-DH
changed the title
Prometheus hangs and generates many ps processes
Prometheus hangs and generates too much ps processes
Dec 1, 2018
This comment has been minimized.
This comment has been minimized.
hoffie
commented
Dec 1, 2018
•
|
Is this system still hanging? Can you please provide some more context like distribution, kernel version, Prometheus version, Regarding the generation of ps processes: I don't think prometheus ever starts such processes, so would rather suspect a human or some other process to execute them. I recently had a similar case where all tools which accessed a specific pid directory in /proc got into a hung state (including process_exporter, ps, top, cat /proc/SOMEPID/cmdline). In my experience, the high load in such cases is caused by the kernel task queue which will only grow due to the blocking. I would also expect to see hung task messages in Don't know what the maintainers think, but I would assume that this is no issue which can be caused by userspace (i.e. Prometheus) alone and you may have to get some help from your OS vendor instead. |
This comment has been minimized.
This comment has been minimized.
|
Thanks for your reply.
The system halt, and not boot so far. dmesg has't any information from Prometheus. And there is no crash messages. we suspect that Prometheus generates many ps processes because all the ps processes open a file fd=6 The other information is that Prometheus use a storage volume in k8s. Prometheus probably stunk when volume is unavailable. do you have any ideas? |
This comment has been minimized.
This comment has been minimized.
|
Huh, I would wonder if kubernetes could be spawning those. |
simonpasquier
added
the
kind/more-info-needed
label
Dec 3, 2018
This comment has been minimized.
This comment has been minimized.
|
The reason is that storage volume in k8s is unavailable, all ops access volume will hang. |
Hacky-DH commentedDec 1, 2018
•
edited
The Prometheus main process hangs
kill -9 not work!!!
and it generates many ps processes, on my host has 5912 ps processes, the process stat is D, and also can't kill by kill -9
this cause high load
how to fix this? and is there a bug in Prometheus ?