Cannot finish tasks in Colab because runtime crashes due to low RAM. #57

ritog · 2023-06-22T11:11:41Z

On the setting up section of this course, it says that:

Google Colab for hands-on exercises. The free version is enough.

But the section where the feature extractor is applied to the music database, the Colab runtime crashes saying that it crashed due to low RAM.

What could be a possible workaround?

MKhalusova · 2023-06-23T12:54:30Z

@sanchit-gandhi Can you please take a look?

sanchit-gandhi · 2023-06-23T16:26:01Z

Thanks for flagging @ritog! There are a few 'tricks' we can employ to try and get this working with lower RAM (I'm fairly confident it's just a case of tweaking the .map hyper-parameters to get this to work on a free Google Colab)

Could you try reducing two parameters please?

batch_size: defaults to 1000, let's try setting this to 100, and if that doesn't work then reduce it by a factor of 2 again to 50
writer_batch_size: defaults to 1000, let's try setting this to 500, and if that doesn't work then reduce it by a factor of 2 to 250

=> using a combination of the above two should be most optimal here, so I would try batch_size=100, writer_batch_size=500, and if that doesn't work then batch_size=50, writer_batch_size=500:

gtzan_encoded = gtzan.map(
    preprocess_function, remove_columns=["audio", "file"], batched=True, num_proc=1, batch_size=100, writer_batch_size=500,
)

sanchit-gandhi · 2023-07-12T11:13:49Z

Hey @ritog - wondering if you had any luck here? Would be interested in hearing whether you found a configuration that worked for the .map method. If so, I can update the Unit to use your configs. Otherwise, we'll have to find a different workaround!

MHRDYN7 · 2023-07-15T18:44:06Z

@sanchit-gandhi
It works fine with batch_size=100, no need to change writer_batch_size. You may also update the output pointed out in #95.

practice-dump mentioned this issue Jul 25, 2023

Added a tip & minor output correction #108

Merged

sanchit-gandhi linked a pull request Jul 27, 2023 that will close this issue

Added a tip & minor output correction #108

Merged

MKhalusova closed this as completed in #108 Jul 27, 2023

kamilakesbi mentioned this issue May 13, 2024

Preprocessing consumes all available memory huggingface/diarizers#6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot finish tasks in Colab because runtime crashes due to low RAM. #57

Cannot finish tasks in Colab because runtime crashes due to low RAM. #57

ritog commented Jun 22, 2023

MKhalusova commented Jun 23, 2023

sanchit-gandhi commented Jun 23, 2023

sanchit-gandhi commented Jul 12, 2023

MHRDYN7 commented Jul 15, 2023 •

edited

Loading

Cannot finish tasks in Colab because runtime crashes due to low RAM. #57

Cannot finish tasks in Colab because runtime crashes due to low RAM. #57

Comments

ritog commented Jun 22, 2023

MKhalusova commented Jun 23, 2023

sanchit-gandhi commented Jun 23, 2023

sanchit-gandhi commented Jul 12, 2023

MHRDYN7 commented Jul 15, 2023 • edited Loading

MHRDYN7 commented Jul 15, 2023 •

edited

Loading