Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus OOM crash upon restart #3559
Comments
This comment has been minimized.
This comment has been minimized.
|
If you crashed initially due to an OOM, it's not surprising that the same issue reoccurs when you restart. I'd suggest checking what queries are running against the Prometheus, one of them might be taking a lot of ram. |
brian-brazil
added
the
kind/more-info-needed
label
Dec 8, 2017
This comment has been minimized.
This comment has been minimized.
|
@brian-brazil There were no queries running on Prometheus at the time. |
This comment has been minimized.
This comment has been minimized.
|
It seems to get to the scraping phase, so you're probably pulling in too much data. Try a machine with more RAM. |
This comment has been minimized.
This comment has been minimized.
|
We've done that, thanks. |
christinehylin
closed this
Dec 8, 2017
This comment has been minimized.
This comment has been minimized.
|
@brian-brazil |
christinehylin
reopened this
Dec 14, 2017
This comment has been minimized.
This comment has been minimized.
|
That, and there's scraping goroutines. Do you have additional information that indicates this is a bug? |
This comment has been minimized.
This comment has been minimized.
|
No the behavior seems to be in line with what you suggest. However, is it sometimes expected that prometheus runs oom on restart even when Prometheus didn't crash prior? I was restarting Prometheus to reload configs and it was fine prior to the restart but then it refused to start up. |
This comment has been minimized.
This comment has been minimized.
|
That can happen depending on query load. You don't need to restart Prometheus for a new config, you can send a SIGHUP. |
This comment has been minimized.
This comment has been minimized.
|
I learned that after the first time i tried restarting. We ended up reducing scrape targets to get Prometheus back up, there were no queries at the time though. |
This comment has been minimized.
This comment has been minimized.
|
@christinehylin are you still getting this issue? If yes, have you tried upgrading Prometheus to the latest stable version (v2.3.2)? |
This comment has been minimized.
This comment has been minimized.
|
@simonpasquier I think we're good now. |
simonpasquier
closed this
Jul 25, 2018
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 22, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
christinehylin commentedDec 7, 2017
What did you do?
Started prometheus service after service stopped due to OOM.
What did you expect to see?
Prometheus starting normally.
What did you see instead? Under which circumstances?
OOM crash due to old memory issue before crash.
Environment
Linux
System information:
Linux 4.4.0-101-generic x86_64
Prometheus version:
prometheus, version 2.0.0 (branch: HEAD, revision: 0a74f98)
build user: root@615b82cb36b6
build date: 20171108-07:11:59
go version: go1.9.2
Prometheus is running though
Alertmanager version:
insert output of
alertmanager --versionhere (if relevant to the issue)Prometheus configuration file:
prometheus_crash.txt