Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upOut of memory: Kill process 24355 (prometheus) score 945 or sacrifice child #2525
Comments
This comment has been minimized.
This comment has been minimized.
|
Your server has a lot of metrics. The logs are truncated, so I cannot see how many series you have currently active in memory, but 12630000 archived metrics is a lot. They all have to be indexed, and you OOM half way through that. That's mostly a LevelDB problem, in my experience. If you have to index a lot in a short amount of time, it takes a ginormous amount of RAM. You can try to GC more aggressively by setting the
But even if you make it through crash recovery, your server might need way more RAM to cope with your number of time series. (Earlier in the logs, you can see how many in-memory time series you have. With 32GiB of RAM, I would not go beyond 2–3 million time series in memory. And even then, you need to tweak your flags. With your current flags, you should not have more than 1M time series for smooth operation.) |
This comment has been minimized.
This comment has been minimized.
|
I'm closing this as it doesn't appear to be a bug but the expected behavior. Should you need more support, please ask on the prometheus-users mailinglist, where more people are available to help and other users can benefit from the answers. |
beorn7
closed this
Mar 26, 2017
This comment has been minimized.
This comment has been minimized.
|
Thanks @beorn7. I'll study a better way to reduce the number of time series. |
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 23, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
robsonpeixoto commentedMar 25, 2017
I have a single Prometheus server that scrape 366 targets. It be in OOM and never recover from
crashrecovery.I tried to get run the
pproftool but the process die before open the port9090.Environment
Dmesg
Command
System information:
More detail
Prometheus version:
Using the version 1.5.2 + patch cc3e859
built with the command:
Prometheus configuration file:
Logs