RuntimeError: CUDA error: out of memory #3

peng666 · 2021-05-14T11:13:05Z

I train the network through the following command：
python train.py --gpu 0,,1 --workdir log --model msn

When I execute the command, the program stops as follows：

THCudaCheck FAIL file=/opt/conda/conda-bld/pytorch_1616554800319/work/aten/src/THC/THCCachingHostAllocator.cpp line=278 error=2 : out of memory
Traceback (most recent call last):
File "train.py", line 83, in
main()
File "train.py", line 79, in main
model.runner()
File "/home/pjw/projects5/SpareNet/runners/base_runner.py", line 338, in runner
self.val()
File "/home/pjw/projects5/SpareNet/runners/base_runner.py", line 209, in val
self.val_step(items)
File "/home/pjw/projects5/SpareNet/runners/msn_runner.py", line 54, in val_step
, (, _, _, data) = items
File "/home/pjw/projects5/SpareNet/utils/misc.py", line 18, in var_or_cuda
x = x.cuda(non_blocking=True)
RuntimeError: CUDA error: out of memory

The text was updated successfully, but these errors were encountered:

peng666 · 2021-05-14T13:07:33Z

The problem has been solved, thank you

HackHarry · 2021-05-26T09:21:19Z

The problem has been solved, thank you

hello, I have this problem too. How do you solve this?

peng666 closed this as completed May 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: CUDA error: out of memory #3

RuntimeError: CUDA error: out of memory #3

peng666 commented May 14, 2021 •

edited

peng666 commented May 14, 2021

HackHarry commented May 26, 2021

RuntimeError: CUDA error: out of memory #3

RuntimeError: CUDA error: out of memory #3

Comments

peng666 commented May 14, 2021 • edited

peng666 commented May 14, 2021

HackHarry commented May 26, 2021

peng666 commented May 14, 2021 •

edited