Increased memory usage vs. torchvision equivalent #42

nlml · 2021-01-18T22:39:28Z

Hello,

First of all: fantastic paper and contribution -- and the pypi package is the cherry on top :D

I decided to try switching one of my model trainings to use antialiased_cnns.resnet34 as a drop-in replacement for torchvision.models.resnet34. It seems however that the memory needs are almost 1.5x higher with the anti-aliased CNN. This is based on the fact that with the torchvision version, my model trains with a batch size of 16 per GPU (it's a sequence model, so the actual number of images going through the CNN per batch is actually much higher). With the anti-aliased CNN, I get CUDA out of memory errors for any batch size above 11.

Were you aware of this? I'm not really expecting you to post a fix, just wondering if it makes sense to you and if you were already aware of it.

Thanks again!

The text was updated successfully, but these errors were encountered:

richzhang · 2021-01-19T21:42:40Z

Thanks for the message! Yes, it takes more memory. I implemented a gradient accumulation flag, which approximates training with larger batch sizes (for example, accumulate the gradient over 2 batches of size N/2 and then update)

nlml · 2021-01-20T08:33:19Z

Thanks for the response! Out of curiosity, did you think about whether the memory footprint can be reduced, maybe by using some in-place operations, or is that not possible here?

richzhang · 2021-01-20T17:10:46Z

I think if it were forward-pass only, the operation could be made in-place

nlml closed this as completed Jan 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increased memory usage vs. torchvision equivalent #42

Increased memory usage vs. torchvision equivalent #42

nlml commented Jan 18, 2021

richzhang commented Jan 19, 2021

nlml commented Jan 20, 2021

richzhang commented Jan 20, 2021

Increased memory usage vs. torchvision equivalent #42

Increased memory usage vs. torchvision equivalent #42

Comments

nlml commented Jan 18, 2021

richzhang commented Jan 19, 2021

nlml commented Jan 20, 2021

richzhang commented Jan 20, 2021