OOM kill on pip install #1022

ezyang · 2017-03-17T04:01:22Z

When I attempt to install Torch with pip, the process gets OOM killed:

ezyang@sabre:~/Dev$ pip install https://download.pytorch.org/whl/cu75/torch-0.1.10.post2-cp27-none-linux_x86_64.whl 
Collecting torch==0.1.10.post2 from https://download.pytorch.org/whl/cu75/torch-0.1.10.post2-cp27-none-linux_x86_64.whl
  Downloading https://download.pytorch.org/whl/cu75/torch-0.1.10.post2-cp27-none-linux_x86_64.whl (360.3MB)
    99% |████████████████████████████████| 360.3MB 32.1MB/s eta 0:00:01Killed

dmesg:

[326093.958653] Out of memory: Kill process 14093 (pip) score 452 or sacrifice child
[326093.958669] Killed process 14093 (pip) total-vm:5029612kB, anon-rss:4217296kB, file-rss:4kB

Feel free to close this if it is by design that 4G is not enough memory to install Torch (yes I don't have very much memory) but perhaps there is something here worth investigating?

The text was updated successfully, but these errors were encountered:

soumith · 2017-03-17T11:15:38Z

thanks for reporting this.

We ship binaries, which means that all pip is doing is: unzipping the package on your computer and moving files into the right location.

With this context, there is very little I see I can do from the package side to improve this.

When I get time, I can try to simulate this in a limited memory environment and try to find out the exact reasons and report something to the pip folks upstream, but as the task is not really actionable from my side, I'll close this issue.

diwu1989 · 2017-09-07T18:37:45Z

You can wget the whl package locally and then run pip install on it, that seems to use less memory

rkingery · 2019-03-29T19:15:18Z

I'm still seeing this problem when trying to pip install torch inside the base Ubuntu Docker container. It'll get 99% installed and then kill. Other packages install fine.

vikramriyer · 2019-04-06T04:28:14Z

I am facing the exact issue as one faced by @rkingery.

heiner · 2019-04-08T15:40:17Z

Same issue here.

Can be fixed by increasing Docker memory (e.g. https://stackoverflow.com/questions/44533319/how-to-assign-more-memory-to-docker-container).

@diwu1989's comment is interesting though: Is there a less demanding way of downloading and installing PyTorch than pip?

heiner · 2019-04-12T17:06:05Z

(Turns out a pip download torch followed by pip install torch*.whl does not go OOM for me.)

wpietri · 2020-08-22T17:17:22Z

In case others need a workaround, pip townload torch OOMed for me. Instead I had to:

Google the thing it was downloading
Visit that page on Wheelodex; for me that was https://www.wheelodex.org/projects/torch/wheels/torch-1.6.0-cp38-cp38-manylinux1_x86_64.whl/
grab the download link; for me that was https://files.pythonhosted.org/packages/8c/5d/faf0d8ac260c7f1eda7d063001c137da5223be1c137658384d2d45dcd0d5/torch-1.6.0-cp38-cp38-manylinux1_x86_64.whl
manually download that with wget
do pip install torch-1.6.0-cp38-cp38-manylinux1_x86_64.whl

This suggests to me that the problem here is a pip bug; it must be allocating a lot of memory when it apparently doesn't need to.

perber · 2020-10-06T14:45:17Z

I got the same issue. Another way to install the package is to use the --no-cache-dir option.
It worked on our environment.
pip --no-cache-dir install torch
Hopefully this helps some one.

Horsmann · 2020-10-13T13:56:05Z

Same problem @perber's solution worked for me in my docker container

espears1 · 2020-11-05T23:55:52Z

I am experiencing this same problem but --no-cache-dir does not solve it.

bthiban · 2021-05-03T09:53:14Z

RUN pip install -r requirements.txt --no-cache-dir when your torch is inside requirements.txt

espears1 · 2021-05-03T13:45:57Z

@bthiban I mentioned that in #1022 (comment) but unfortunately is did not solve the problem.

Vectorization was disabled when broadcast inner axes exist. Fixes pytorch#1021 patched with CI failure Co-authored-by: jjsjann123 <alex.jann2012@gmail.com>

we pass DESIRED_CUDA=cpu-cxx11-abi to the container to build pytorch wheel with file name like *cpu.cxx11.abi*, and so it is different with the original cpu wheel file. this patch corrects the test setting to use same test for cpu and cpu-cxx11-abi.

soumith closed this as completed Mar 17, 2017

humitos mentioned this issue Apr 19, 2018

Build time out readthedocs/readthedocs.org#1767

Closed

benjamin-work mentioned this issue Aug 16, 2018

Doc build is failing skorch-dev/skorch#304

Closed

chenzimin mentioned this issue Aug 30, 2019

solve strange issues occurred in building golden image and invoking SequenceR ASSERT-KTH/sequencer#3

Closed

simon-mo mentioned this issue Mar 3, 2022

[Bug] Unable to succeed in selecting a random port ray-project/ray#22352

Closed

2 tasks

jaglinux pushed a commit to jaglinux/pytorch that referenced this issue Jul 19, 2022

Workaround issue with devtoolset sudo (pytorch#1022)

1112d0e

jpvillam-amd pushed a commit to jpvillam-amd/pytorch that referenced this issue Aug 10, 2022

Workaround issue with devtoolset sudo (pytorch#1022)

af7c2e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOM kill on pip install #1022

OOM kill on pip install #1022

ezyang commented Mar 17, 2017

soumith commented Mar 17, 2017

diwu1989 commented Sep 7, 2017

rkingery commented Mar 29, 2019

vikramriyer commented Apr 6, 2019

heiner commented Apr 8, 2019

heiner commented Apr 12, 2019

wpietri commented Aug 22, 2020

perber commented Oct 6, 2020

Horsmann commented Oct 13, 2020

espears1 commented Nov 5, 2020

bthiban commented May 3, 2021

espears1 commented May 3, 2021

OOM kill on pip install #1022

OOM kill on pip install #1022

Comments

ezyang commented Mar 17, 2017

soumith commented Mar 17, 2017

diwu1989 commented Sep 7, 2017

rkingery commented Mar 29, 2019

vikramriyer commented Apr 6, 2019

heiner commented Apr 8, 2019

heiner commented Apr 12, 2019

wpietri commented Aug 22, 2020

perber commented Oct 6, 2020

Horsmann commented Oct 13, 2020

espears1 commented Nov 5, 2020

bthiban commented May 3, 2021

espears1 commented May 3, 2021