Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upDebugging cause of high CPU usage queries #4923
Comments
This comment has been minimized.
This comment has been minimized.
jaredeis
commented
Dec 12, 2018
|
I'm new to supporting Prometheus and this is one of my concerns. There's a fine line between scaling and correct usage, and if there was a way to determine what's hammering the CPU it would help. We have quite a few people on-boarding into our EKS architecture, which includes istio, and we have a ton of metrics now. It would be great to know if some badly formed query on a dashboard somewhere is causing the hurt. |
This comment has been minimized.
This comment has been minimized.
fanhaozzu
commented
Mar 1, 2019
|
Same concerns and problem |
This comment has been minimized.
This comment has been minimized.
fanhaozzu
commented
Mar 1, 2019
|
Same concerns and problems |
This comment has been minimized.
This comment has been minimized.
|
We mitigated the issue by setting a very strict limit for max samples. Some dashboards were broken due to the limit, but they were probably also causing high CPU usages as we don't have these issues anymore. Still I think there should be a better way for profiling the cpu usage |
This comment has been minimized.
This comment has been minimized.
|
I'm closing it for now. If you have further questions, please use our user mailing list, which you can also search. |
simonpasquier
closed this
Mar 4, 2019
This comment has been minimized.
This comment has been minimized.
|
@simonpasquier I believe this issue should not be closed because the issue itself exists. I just found a workaround without knowing if this was actually the cause. Others may still have issues with other root causes. |
This comment has been minimized.
This comment has been minimized.
|
#1315 already exists for a similar reason. |
weeco commentedNov 27, 2018
I am reposting my issue of debugging high cpu usage queries here, since I haven't got any responses/ideas on stackoverflow, nor from colleagues who also run prometheus clusters. I want to figure out how I could log prometheus queries which cause high cpu usage.
The use case and more details are given in this stackoverflow post: https://stackoverflow.com/questions/53432660/figuring-out-high-cpu-usage-queries-in-prometheus
I hope someone can help with some ideas. I saw an issue about debugging slow queries, but I am afraid this is not going to help me in the near future because this has been opened for nearly 3 years now: #1315 .