CUDA out of memory #28

ruodingt · 2020-04-16T04:46:36Z

Hi @youngwanLEE
I was trying centermask2 on a different dataset other than COCO.
I use a single V100 GPU.

I put Batch size to 8 and remain MIN_SIZE_TRAIN unchanged.
The config file I used is centermask_V_39_eSE_FPN_ms_3x.yaml

Yet I still got CUDA OOM error.

I couldn't see other factors that could leading to this OOM error.

Could you please give me some tips?

The text was updated successfully, but these errors were encountered:

youngwanLEE · 2020-04-16T05:52:20Z

@ruodingt

why don't you lower the batch size to 4?

ruodingt · 2020-04-17T09:09:20Z

You are totally right.

In your experiment script and paper, I saw you are using 16 as batch size.
May I ask whether 16 is for per-GPU or not?

Thank you.

jgsch · 2020-04-17T14:51:27Z

@ruodingt > from the detectron2 documentation:

# Number of images per batch across all machines.
# If we have 16 GPUs and IMS_PER_BATCH = 32,
# each GPU will see 2 images per batch.
_C.SOLVER.IMS_PER_BATCH = 16

ruodingt closed this as completed May 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA out of memory #28

CUDA out of memory #28

ruodingt commented Apr 16, 2020

youngwanLEE commented Apr 16, 2020

ruodingt commented Apr 17, 2020

jgsch commented Apr 17, 2020

CUDA out of memory #28

CUDA out of memory #28

Comments

ruodingt commented Apr 16, 2020

youngwanLEE commented Apr 16, 2020

ruodingt commented Apr 17, 2020

jgsch commented Apr 17, 2020