Cuda out of memory #5

Arman-IMRSV · 2021-04-09T13:38:07Z

Hello. I am trying to reproduce the paper results. I am currently running the code on 2 Tesla V100 GPUs each containing 16GB of memory, but still I am getting out-of-memory error. I also tried to decrease MAX_TRANSCRIPT_WORD to 1000, but it did not help. Could you please let me what hardware and GPU it requires to run?

Arman-IMRSV · 2021-04-09T13:50:45Z

@xrc10

ilyaivensky · 2021-06-23T18:55:58Z

The same story. Running with 4 Quadro RTX 6000, each with 24GB of memory

xrc10 · 2021-06-24T00:55:41Z

We used V100 GPU with 32GB of memory. Unfortunately, I haven't tried it with other GPUs. Can you also try to decrease MAX_SENT_LEN and MAX_SENT_NUM to smaller values to see if the OOM eror still occurs?

Arman-IMRSV · 2021-06-24T17:46:07Z

Thanks @xrc10 for the response. I had tried decreasing those parameters, but didn't help.

omelnikov · 2021-08-14T17:36:49Z

Thanks @xrc10 for the response. I had tried decreasing those parameters, but didn't help.

Hi @Arman-IMRSV , good observations! Could you clarify what parameter values you have tried and decreased from what to what? Judging by the difference in GPU memory sizes, the change in parameters needs to produce batches about half the (byte) size of those used by the authors. Note that some GPU memory is used by its own tasks, so not all 24GB is available for training batches. Also, have you used the same training set? Sentence length will vary on different corpora. The byte size can also be estimated from the average character length of the batch. I'm also curious if you investigated the batch that caused the memory crash. Was it the first batch? What was the size of the batch (in bytes), etc.? You might also try a lower precision of your tensors versus that used in the paper. Try exploring the memory-crashing batch in greater detail. I hope it works out, but do tell what you discover. It helps others to reproduce with fewer glitches on a different hardware.

shonaviso · 2021-10-12T16:02:42Z

Hi @Arman-IMRSV
I am facing the above issue while evaluating. Is the case same for you also?

xrc10 mentioned this issue Jun 24, 2021

How to solve cuda out of memory error? #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda out of memory #5

Cuda out of memory #5

Arman-IMRSV commented Apr 9, 2021

Arman-IMRSV commented Apr 9, 2021

ilyaivensky commented Jun 23, 2021

xrc10 commented Jun 24, 2021

Arman-IMRSV commented Jun 24, 2021

omelnikov commented Aug 14, 2021 •

edited

shonaviso commented Oct 12, 2021

Cuda out of memory #5

Cuda out of memory #5

Comments

Arman-IMRSV commented Apr 9, 2021

Arman-IMRSV commented Apr 9, 2021

ilyaivensky commented Jun 23, 2021

xrc10 commented Jun 24, 2021

Arman-IMRSV commented Jun 24, 2021

omelnikov commented Aug 14, 2021 • edited

shonaviso commented Oct 12, 2021

omelnikov commented Aug 14, 2021 •

edited