Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Document memory planning guidelines #5469

Open
itaysk opened this Issue Apr 16, 2019 · 0 comments

Comments

Projects
None yet
1 participant
@itaysk
Copy link

itaysk commented Apr 16, 2019

Proposal

I would like to ask the prometheus dev community to consider adding some documentation around estimating the memory requirements of prometheus.
The fact that prometheus is mostly being run in memory capped containers and is OOMkilled when reaching the mem limit is surprising many users who are accustomed to robust software that degrades instead of dying.

I've been running prometheus in production for a while which is occasionally OOMKilled. I have expanded the memory limits more than once to arbitrary limits but I would prefer to make an educated estimation for a reasonable limit given my workload.
I have read the TSDB design paper which was super interesting but I don't feel that it gave me a practical hint to my problem, and even if I did, I think this is something the average user is far away from doing, and I'd consider this a development task rather then a usage question.
Since being in this state I have also learned that many others are in the same situation and are just "living with it", which is unfortunate.

Can we please discuss this topic, and possibly document our conclusions in the readme?
(I understand that planning for queries is harder, so we can stick to planning for metrics ingestion only which is supposed to be predictable)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.