Pruning doesn't affect speed nor memory usage #36214

Cyril9227 · 2020-04-08T07:08:50Z

Hello,

I'm experimenting with the newly pruning feature on Pytorch and going thru the tutorial. My issue is that after pruning weights of a ResNet, the inference time and the memory footprint is exactly the same.

How do I get speed-up and reduced memory footprint using pruning in Pytorch ?

My code for pruning is pretty simple so far :

for name, module in model.named_modules():
    # prune 20% of connections in all 2D-conv layers
    if isinstance(module, torch.nn.Conv2d):
        prune.l1_unstructured(module, name='weight', amount=0.2)
        prune.remove(module, 'weight') # make it permanent
    # prune 40% of connections in all linear layers
    elif isinstance(module, torch.nn.Linear):
        prune.l1_unstructured(module, name='weight', amount=0.4)
        prune.remove(module, 'weight')

For timing I use :

    start = time.time()
    # get predictions
    output = model(inputs)
    end = time.time()  
    print(end - start)

Any help is welcome,
thanks

The text was updated successfully, but these errors were encountered:

gchanan · 2020-04-08T23:30:16Z

Since this is a question, you might get more help on the forums (https://discuss.pytorch.org/).

CC @mickypaganini.

mickypaganini · 2020-04-08T23:48:06Z

Correct, it doesn't. See the conversation here as well: pytorch/tutorials#605 (comment)

Cyril9227 · 2020-04-09T01:48:36Z

Correct, it doesn't. See the conversation here as well: pytorch/tutorials#605 (comment)

Thank's for the answer @mickypaganini , this is what I thought.

Any idea of the easiest way to sparsify the model once pruned ?

Thank's

zeng-hello-world · 2020-04-26T08:16:43Z

The pytorch pruner just mask zeros out in weight tensor, so number of the computation ops is still the same.

mickypaganini · 2020-04-26T17:05:40Z

As far as I know, for memory concerns, you can use .to_sparse().
Example:

import torch
import torch.nn.utils.prune as prune

t = torch.randn(1000, 1000)
torch.save(t, 'original.pth')

p = prune.L1Unstructured(amount=0.99)
pruned_t = p.prune(t)
torch.save(pruned_t, 'pruned.pth')

sparse_t = pruned_t.to_sparse()
torch.save(sparse_t, 'sparse.pth')

Then ls -lht *.pth should return:

196K sparse.pth
3.9M pruned.pth
3.9M original.pth

This only works if you prune your tensor by a large enough amount that representing it in coordinate space is actually memory efficient.

You can look at torch.sparse to see what ops are then supported on these sparse tensors. Note that this is marked as experimental and the API might change in the future: https://pytorch.org/docs/stable/sparse.html

I'm not sure about the computational speed-up concerns... @bwasti, maybe?

albanD · 2020-05-20T17:53:57Z

Corresponding PR has been merged.

gchanan added module: pruning triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Apr 8, 2020

albanD closed this as completed May 20, 2020

carloalbertobarbano mentioned this issue May 24, 2021

Simplification of pruned models #58846

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pruning doesn't affect speed nor memory usage #36214

Pruning doesn't affect speed nor memory usage #36214

Cyril9227 commented Apr 8, 2020

gchanan commented Apr 8, 2020

mickypaganini commented Apr 8, 2020

Cyril9227 commented Apr 9, 2020 •

edited

Loading

zeng-hello-world commented Apr 26, 2020 •

edited

Loading

mickypaganini commented Apr 26, 2020

albanD commented May 20, 2020

Pruning doesn't affect speed nor memory usage #36214

Pruning doesn't affect speed nor memory usage #36214

Comments

Cyril9227 commented Apr 8, 2020

gchanan commented Apr 8, 2020

mickypaganini commented Apr 8, 2020

Cyril9227 commented Apr 9, 2020 • edited Loading

zeng-hello-world commented Apr 26, 2020 • edited Loading

mickypaganini commented Apr 26, 2020

albanD commented May 20, 2020

Cyril9227 commented Apr 9, 2020 •

edited

Loading

zeng-hello-world commented Apr 26, 2020 •

edited

Loading