Out of memory in Colab (free) #15

gnmarten · 2023-03-17T11:27:18Z

Thanks for the Colab.

I ran into the dreaded out memory while

processing the embedding files saved to nemo_outputs/speaker_outputs/embeddings

Just trying to understand: My audio file is only 1h18 minutes long, but its embeddings amount to: "Dataset loaded with 9209 items, total duration of 2.55 hours." Since segmentations and subsegmentations are done automatically, I wonder what the upper limit filesize for the script to work on Colab would be.

Full error:
RuntimeError: CUDA out of memory. Tried to allocate 7.61 GiB (GPU 0; 14.75 GiB total capacity; 7.81 GiB already allocated; 5.65 GiB free; 7.83 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

KevinGeLe · 2023-03-17T12:20:40Z

Thats weird havent had that problem on long audio files. Try with another smaller whisper model "--whisper-model" Default is "medium.en", so try with "small.en" it will reduce the accuracy but should use 3x less memory.

Edit: adding "torch.cuda.empty_cache()", will free up memory thats not in use.

MahmoudAshraf97 · 2023-03-17T17:22:53Z

Hi, this is strange as nemo automatically segments the file to avoid OOM errors, setting the max split size might help because there's plenty of memory available

rashi-budati · 2023-04-05T08:11:12Z

Hello, I would like your help to know where to set the max split size. in the create_config function, i could not find any such parameter.

MahmoudAshraf97 · 2023-04-05T21:09:54Z

@rashi-budati check max_split_size_mb Here

kksgandhi · 2024-01-05T14:16:52Z

If you're having OOM issues and don't mind a performance hit, always remember that you can run this with --device cpu, that way you aren't worrying about your GPU at all.

I'm running whisper-diarization locally though, unsure how this would interact with GColab

MahmoudAshraf97 closed this as completed Apr 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out of memory in Colab (free) #15

Out of memory in Colab (free) #15

gnmarten commented Mar 17, 2023

KevinGeLe commented Mar 17, 2023 •

edited

Loading

MahmoudAshraf97 commented Mar 17, 2023

rashi-budati commented Apr 5, 2023

MahmoudAshraf97 commented Apr 5, 2023

kksgandhi commented Jan 5, 2024

Out of memory in Colab (free) #15

Out of memory in Colab (free) #15

Comments

gnmarten commented Mar 17, 2023

KevinGeLe commented Mar 17, 2023 • edited Loading

MahmoudAshraf97 commented Mar 17, 2023

rashi-budati commented Apr 5, 2023

MahmoudAshraf97 commented Apr 5, 2023

kksgandhi commented Jan 5, 2024

KevinGeLe commented Mar 17, 2023 •

edited

Loading