[Bug] MaxPool3d causes GPU memory leaking #6222

xichangzun · 2018-04-03T07:12:10Z

When I try to train my model which contains MaxPool3d , It always end up with 'out of memory' error.
my environment info is here:

16.04
PyTorch version: 0.4.0a0+da6c3c9
installed PyTorch from source
python version is 3.6.2
CUDA/cuDNN version: 9.1/7.1.2
GCC version (if compiling from source):GCC 5.4
Build command you used (if compiling from source): as default

I can reproduce this bug by the following script:

import torch
import torch.nn as nn
from torch import optim
from torch.nn.functional import smooth_l1_loss
model = nn.Sequential(
    nn.Conv3d(1,1,kernel_size=3,padding=1),
    nn.ReLU(),
    nn.MaxPool3d(kernel_size=3, stride=2, padding=1),
)
#optimizer = optim.SGD((x for x in model.parameters() if x.requires_grad is True), lr=1e-3, momentum=0.9,nesterov=True)
crit = smooth_l1_loss
model.cuda()
count = 0
while True:
#    optimizer.zero_grad()
    input = torch.rand(30,1,200,200,200).cuda()
    loc_output= model(input)
    loc_outpus = loc_output
    if type(loc_output) == tuple:
        loc_output = loc_output[0]
    targets = torch.rand(loc_output.size()).cuda()
    loss = crit(loc_output,targets)
    loss.backward()
#    optimizer.step()
    del loss,loc_output,targets,input
    torch.cuda.empty_cache()
    count += 1
    print(count)

zou3519 · 2018-04-03T15:07:26Z

Thanks for the report, @xichangzun. I can reproduce this and am looking into it.

Fixes pytorch#6222 We don't need to make sure gradInput is contiguous because it's always passed in as an empty tensor (see CUDAFloatType.cpp after it gets codegen-ed). This was increasing the reference on gradInput and leaking it. I'm not sure if there's a good way to test this. I put together a script that 1) Prints out when a tensor is allocated and deallocated 2) Checks allocations vs deallocations after running a python script And verified that each allocation matches each deallocation.

Fixes #6222 We don't need to make sure gradInput is contiguous because it's always passed in as an empty tensor (see CUDAFloatType.cpp after it gets codegen-ed). This was increasing the reference on gradInput and leaking it. I'm not sure if there's a good way to test this. I put together a script that 1) Prints out when a tensor is allocated and deallocated 2) Checks allocations vs deallocations after running a python script And verified that each allocation matches each deallocation.

zou3519 mentioned this issue Apr 3, 2018

Fix memory leak in maxpool3d backwards #6230

Merged

mdraw mentioned this issue Apr 3, 2018

OOM when using the current PyTorch git master ELEKTRONN/elektronn3#13

Closed

soumith closed this as completed in #6230 Apr 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] MaxPool3d causes GPU memory leaking #6222

[Bug] MaxPool3d causes GPU memory leaking #6222

xichangzun commented Apr 3, 2018 •

edited

Loading

zou3519 commented Apr 3, 2018

[Bug] MaxPool3d causes GPU memory leaking #6222

[Bug] MaxPool3d causes GPU memory leaking #6222

Comments

xichangzun commented Apr 3, 2018 • edited Loading

zou3519 commented Apr 3, 2018

xichangzun commented Apr 3, 2018 •

edited

Loading