Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.
Sign upPrometheus v2.2.0 deadlocked loading targets page #3940
Comments
This comment has been minimized.
This comment has been minimized.
|
Looks like its blocked here: https://github.com/prometheus/prometheus/blob/master/scrape/manager.go#L120 |
This comment has been minimized.
This comment has been minimized.
|
@krasi-georgiev that has good odds of being a regression from the scrape refactoring. Over the years we had more deadlocks caused by loading target page than I can count – and they all came somewhere from there :) |
This comment has been minimized.
This comment has been minimized.
|
Sorry I didn't add more background:
|
This comment has been minimized.
This comment has been minimized.
|
I will have a look when I get bored poking with tsdb |
This comment has been minimized.
This comment has been minimized.
|
There are a bunch of goroutines all blocked at roughly the same point - goroutine 1114, 1293, 1118. Goroutine 1118 is holding m.mtx write lock, and blocked on the getting the scrape pool read lock. The scrape pool write lock is being held by scrapePool.Sync (goroutine 183) which is waiting for scrapers to stop. The scrapers in turn are blocked on remote write, which is blocked on resharding, which is blocked on the remote write bug I was trying to fix (#3809). |
This comment has been minimized.
This comment has been minimized.
|
As 3809 is fixed, looks like nothing to fix here. |
tomwilkie
closed this
Mar 13, 2018
This comment has been minimized.
This comment has been minimized.
lock
bot
commented
Mar 22, 2019
|
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
tomwilkie commentedMar 9, 2018
Heres the stack trace: https://gist.github.com/tomwilkie/af4ffa4c8f50f723b55f8d4fd82f7c9c
I'm diagnosing...