Is it possible to use smaller GPU for inference? #8

AmitRozner · 2019-03-20T12:59:27Z

I read about you training 8 images in a batch on P40. Is it possible to use the code with GTX 1080TI (12GB) with smaller batch size?

lijiannuist · 2019-03-21T06:04:11Z

Hi @AmitRozner
Of course. But Training with smaller batch size may reduce detection performance slightly.

cqlyiyeshu · 2019-04-06T02:34:05Z

My gpu is V100, 16G, when I run
python demo.py --trained_model weights/WIDERFace_DSFD_RES152.pth --img_root data/worlds-largest-selfie.jpg
It gets error: RuntimeError: CUDA out of memory
Is it possible to use smaller GPU?

lijiannuist · 2019-04-08T09:51:27Z

Hi @cqlyiyeshu
I think that 16G is enough.
you can try to use less scale in demo.py. especially for 2x.

jiangziya · 2019-04-11T02:33:39Z

@lijiannuist My gpu is 8G，how can i solve the error "RuntimeError: CUDA out of memory "?
error log is:
traceback (most recent call last):
File "demo.py", line 248, in
test_oneimage()
File "demo.py", line 232, in test_oneimage
det_b = np.row_stack((det_b, infer(net , img , transform , thresh , cuda , bt)))
File "demo.py", line 98, in infer
y = net(x) # forward pass
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/home/jy/FaceDetection-DSFD/face_ssd.py", line 235, in forward
conv4_3_x = self.layer2(conv3_3_x)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/container.py", line 91, in forward
input = module(input)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/container.py", line 91, in forward
input = module(input)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/usr/lib/python2.7/site-packages/torchvision/models/resnet.py", line 85, in forward
out = self.bn3(out)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/module.py", line 477, in call
result = self.forward(*input, **kwargs)
File "/usr/lib64/python2.7/site-packages/torch/nn/modules/batchnorm.py", line 66, in forward
exponential_average_factor, self.eps)
File "/usr/lib64/python2.7/site-packages/torch/nn/functional.py", line 1254, in batch_norm
training, momentum, eps, torch.backends.cudnn.enabled
RuntimeError: CUDA error: out of memory

JaywongWang · 2019-04-11T05:00:46Z

Hi @cqlyiyeshu
I think that 16G is enough.
you can try to use less scale in demo.py. especially for 2x.

@lijiannuist Do you mean resizing the input image to a smaller size?

jiangziya · 2019-04-11T06:03:55Z

Hi @cqlyiyeshu
I think that 16G is enough.
you can try to use less scale in demo.py. especially for 2x.

@lijiannuist Do you mean resizing the input image to a smaller size?

yes，i resize as 100x100

yihongXU · 2019-04-24T12:45:48Z

Hi, try to add torch.set_grad_enabled(False) inside test_oneimage() function (at the beginning of the function) if your torch version is >= 0.4. It works for me.

vlad3996 · 2019-04-24T20:25:09Z

You can use this https://github.com/vlad3996/FaceDetection-DSFD with original author's checkpoint
or try at least
with torch.no_grad():
to inference

TekiLi · 2019-05-14T08:22:13Z

My gpu is V100, 16G, when I run
python demo.py --trained_model weights/WIDERFace_DSFD_RES152.pth --img_root data/worlds-largest-selfie.jpg
It gets error: RuntimeError: CUDA out of memory
Is it possible to use smaller GPU?
应该用的是0.4+的torch吧，内存没有释放，用0.3的torch就没问题了

KarelZhang · 2019-05-23T04:08:52Z

My gpu is V100, 16G, when I run
python demo.py --trained_model weights/WIDERFace_DSFD_RES152.pth --img_root data/worlds-largest-selfie.jpg
It gets error: RuntimeError: CUDA out of memory
Is it possible to use smaller GPU?
应该用的是0.4+的torch吧，内存没有释放，用0.3的torch就没问题了

用0.4+的torch确实会报内存不够，请问有什么方法可以释放内存嘛？我现在只能一张图片一张图片地测。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to use smaller GPU for inference? #8

Is it possible to use smaller GPU for inference? #8

AmitRozner commented Mar 20, 2019

lijiannuist commented Mar 21, 2019

cqlyiyeshu commented Apr 6, 2019

lijiannuist commented Apr 8, 2019

jiangziya commented Apr 11, 2019

JaywongWang commented Apr 11, 2019 •

edited

jiangziya commented Apr 11, 2019

yihongXU commented Apr 24, 2019 •

edited

vlad3996 commented Apr 24, 2019

TekiLi commented May 14, 2019

KarelZhang commented May 23, 2019

Is it possible to use smaller GPU for inference? #8

Is it possible to use smaller GPU for inference? #8

Comments

AmitRozner commented Mar 20, 2019

lijiannuist commented Mar 21, 2019

cqlyiyeshu commented Apr 6, 2019

lijiannuist commented Apr 8, 2019

jiangziya commented Apr 11, 2019

JaywongWang commented Apr 11, 2019 • edited

jiangziya commented Apr 11, 2019

yihongXU commented Apr 24, 2019 • edited

vlad3996 commented Apr 24, 2019

TekiLi commented May 14, 2019

KarelZhang commented May 23, 2019

JaywongWang commented Apr 11, 2019 •

edited

yihongXU commented Apr 24, 2019 •

edited