Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upReproducible machine lockup with BTRFS backend on OOM kill #15654
Comments
|
Hi! Please read this important information about creating issues. If you are reporting a new issue, make sure that we do not have any duplicates already open. You can ensure this by searching the issue list for this repository. If there is a duplicate, please close your issue and add a comment to the existing issue instead. If you suspect your issue is a bug, please edit your issue description to include the BUG REPORT INFORMATION shown below. If you fail to provide this information within 7 days, we cannot debug your issue and will close it. We will, however, reopen it if you later provide the information. This is an automated, informational response. Thank you. For more information about reporting issues, see https://github.com/docker/docker/blob/master/CONTRIBUTING.md#reporting-other-issues BUG REPORT INFORMATIONUse the commands below to provide key information from your environment:
Provide additional environment details (AWS, VirtualBox, physical, etc.): List the steps to reproduce the issue: Describe the results you received: Describe the results you expected: Provide additional info you think is important: ----------END REPORT --------- #ENEEDMOREINFO |
|
so the machine was completely dead or did it recover? |
|
It did not recover at all, I had to hard reboot the server. Docker was completely unresponsive, and some of the containers running on it were completely frozen/IO blocked. |
|
I've some news about that, I think I've identified who such lock happened in my use case.
|
|
Ok, I eventually handled to reproduce the exactly same situation, the image I was using was
OOM logs: http://pastebin.com/Lb4Gybay
Here are all the logs and inclusing a final Sysrq-w at the end. From this point Docker was not working anymore, the BTRFS driver was "broken"
Syslog + Sysrq-w logs: http://pastebin.com/qd1yB8u2 I'll post on BTRFS ML, if you have any idea, I take it! |
|
We just ran into this in a rather harsh way. With almost exact scenario. Any movement? |
|
We were experiencing the same issue (high io, btrfs volume unresponsive, server completely frozen) and we finally switched to devicemapper (direct-lvm) to address this problem. |
|
Old and not happened for a long time, support linux update/docker update did the trick |
It seems related to btrfs, according to the stacks I can find in
/var/log/syslogThe whole machine was impacted, I'm looking for hints about how to avoid this. Thanks