filter pruning error #86

cattpku · 2018-11-28T01:33:00Z

Hi Neta,
I met a an error when doing filter pruning, after debugging, I found it might because Distiller does not support concatenate operation.

The related layers of my network:
(aspp): ASPP_module(
(aspp0): Sequential(
(0): Conv2d(116, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
(1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(aspp1): Sequential(
(0): Conv2d(116, 256, kernel_size=(3, 3), stride=(1, 1), padding=(6, 6), dilation=(6, 6), bias=False)
(1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(aspp2): Sequential(
(0): Conv2d(116, 256, kernel_size=(3, 3), stride=(1, 1), padding=(12, 12), dilation=(12, 12), bias=False)
(1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
(aspp3): Sequential(
(0): Conv2d(116, 256, kernel_size=(3, 3), stride=(1, 1), padding=(18, 18), dilation=(18, 18), bias=False)
(1): BatchNorm2d(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
)
)
(global_avg_pool): Sequential(
(0): AdaptiveAvgPool2d(output_size=(1, 1))
(1): Conv2d(116, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
)
(conv): Conv2d(1280, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)

where input of layer 'conv' is concatenating outputs of aspp0. aspp1. aspp2, aspp3 and global_avg_pool (at dim = 1)
my configuration for pruning:

        module.aspp.aspp0.0.weight: [0.5, '3D']
        module.aspp.aspp1.0.weight: [0.5, '3D']
        module.aspp.aspp2.0.weight: [0.5, '3D']
        module.aspp.aspp3.0.weight: [0.5, '3D']
        module.global_avg_pool.1.weight: [0.5, '3D']

Then it is supposed that Distiller should prune the following layer 'conv' to be Conv2d(640, 256, kernel_size=(1, 1), stride=(1, 1), bias=False), but I got error ' Given groups=1, weight of size [256, 128, 1, 1], expected input[8, 640, 60, 80] to have 128 channels, but got 640 channels instead', which means Distiller does not recognise the concatenated inputs.
Please advise, thanks.

The text was updated successfully, but these errors were encountered:

nzmora · 2018-11-28T06:32:51Z

Hi @cattpku,

You probably missed issue #85, but I think that what you are seeing is this same bug.
If you can, please try using branch https://github.com/NervanaSystems/distiller/tree/issue_85 which fixes this issue (for filters; channels-pruning is WiP).

As issue #85 explains, some networks have complicated data-dependencies which means that we can't arbitrarily prune filters, but we have to group them so that they all follow the same pruning decision. I realize this is somewhat hard to follow w/o more details so I plan to write and post a loner explanation in a couple of days.

Until then, you can create a PNG graph of your model and I can use it to illustrate to you the dependencies and how to express them.

Another option: take a look at this example, specifically at low_pruner_2.

I hope this helps for now,
Neta

cattpku · 2018-11-28T06:37:26Z

Thanks for your kind explanation, Neta. I will try the branch later.

cattpku · 2018-11-28T12:33:19Z

Hi Neta,
I tried to use the recommended branch, but the error is still there and exactly the same.

nzmora · 2018-11-28T20:10:49Z

@cattpku - any chance you can provide more information? For example, the model definition and your YAML schedule file can help me understand, and hopefully recreate what you are seeing.
Thanks

cattpku · 2018-11-28T23:30:11Z

Hi Neta,
Sure. It is a DeepLabV3+ structure, definition is here:

`
class ASPP_module(nn.Module):
def init(self, inplanes, planes):
super(ASPP_module, self).init()

    self.aspp0 = nn.Sequential(nn.Conv2d(inplanes, planes, kernel_size=1,
                                         stride=1, padding=0, dilation=1, bias=False),
                               nn.BatchNorm2d(planes))
    self.aspp1 = nn.Sequential(nn.Conv2d(inplanes, planes, kernel_size=3,
                                         stride=1, padding=6, dilation=6, bias=False),
                               nn.BatchNorm2d(planes))
    self.aspp2 = nn.Sequential(nn.Conv2d(inplanes, planes, kernel_size=3,
                                         stride=1, padding=12, dilation=12, bias=False),
                               nn.BatchNorm2d(planes))
    self.aspp3 = nn.Sequential(nn.Conv2d(inplanes, planes, kernel_size=3,
                                         stride=1, padding=18, dilation=18, bias=False),
                               nn.BatchNorm2d(planes))

def forward(self, x):
    #print(x.size())
    x0 = self.aspp0(x)
    x1 = self.aspp1(x)
    x2 = self.aspp2(x)
    x3 = self.aspp3(x)

    return torch.cat((x0, x1, x2, x3), dim=1)

class DeepLab_v3_plus(nn.Module):
def init(self, nInputChannels=3, n_classes=2):
super(DeepLab_v3_plus, self).init()
# wrapped feature
self.wrapped_features = wrapper(1)
# ASPP
self.aspp = ASPP_module(stage_out_channels[2], 256)
# global pooling
self.global_avg_pool = nn.Sequential(nn.AdaptiveAvgPool2d((1, 1)),
nn.Conv2d(stage_out_channels[2], 256, 1, stride=1, bias=False))
self.conv = nn.Conv2d(1280, 256, 1, bias=False)
self.bn = nn.BatchNorm2d(256)
self.upsample_size = nn.Upsample(size = (60, 80),mode = 'bilinear', align_corners=None)
self.upsample = nn.Upsample(scale_factor=2, mode = 'bilinear', align_corners=None)
self.upsample_4 = nn.Upsample(scale_factor=4, mode = 'bilinear', align_corners=None)
self.last_conv = nn.Sequential(nn.Conv2d(256, n_classes, kernel_size=1, stride=1))

def forward(self, x):

    x = self.wrapped_features(x)
    x_aspp = self.aspp(x)
    x_ = self.global_avg_pool(x)
    x_ = self.upsample_size(x_)
    x_concat = torch.cat((x_aspp, x_), dim=1)
    x = self.conv(x_concat)
    x = self.bn(x)
    x = self.upsample(x)
    x = self.last_conv(x)
    x = self.upsample_4(x)

    return x

`

And the YAML schedule is

version: 1
pruners:
filter_pruner_50:
class: 'L1RankedStructureParameterPruner'
group_type: Filters
desired_sparsity: 0.5
weights: [
module.aspp.aspp0.0.weight,
module.aspp.aspp1.0.weight,
module.aspp.aspp2.0.weight,
module.aspp.aspp3.0.weight,
module.global_avg_pool.1.weight]
extensions:
net_thinner:
class: 'FilterRemover'
thinning_func_str: remove_filters
arch: 'DeepLab_v3_plus'
dataset: 'my_own_data'
policies:

pruner:
instance_name: filter_pruner_50
epochs: [46]
extension:
instance_name: net_thinner
epochs: [46]

The error comes from
x_concat = torch.cat((x_aspp, x_), dim=1) x = self.conv(x_concat)

I tried to manually modify 'module.conv' in 'thinning.py' by clearing the 'module.conv.weight', and then using 'append_module_directive' and 'append_param_directive' to assign the correct value, now my own experiment succeeded.

nzmora · 2018-12-04T23:30:29Z

Hi @cattpku
"I tried to manually modify 'module.conv' in 'thinning.py' by clearing the 'module.conv.weight', and then using 'append_module_directive' and 'append_param_directive' to assign the correct value, now my own experiment succeeded." ==> can you share the fix?
I tried creating an environment for running DeepLab, but this is taking me too much time.
Thanks!
Neta

cattpku · 2018-12-04T23:55:30Z

Sure.
All modifications are in 'def create_thinning_recipe_filters(sgraph, model, zeros_mask_dict):'
Firstly, I use 5 new variables to store the 'indices' of the 5 targeting layers (which are specified in my network definition above) before 'for successor in successors:':
`
if layer_name == 'module.aspp.aspp0.0':
indices_aspp0 = indices

    elif layer_name == 'module.aspp.aspp1.0':
        indices_aspp1 = torch.add(indices, 128)

    elif layer_name == 'module.aspp.aspp2.0':
        indices_aspp2 = torch.add(indices, 256)

    elif layer_name == 'module.aspp.aspp3.0':
        indices_aspp3 = torch.add(indices, 384)

    elif layer_name == 'module.global_avg_pool.1':
        indices_pool = torch.add(indices, 512)

`

Then before 'return thinning_recipe', I manually modify the 'module.conv' as following:
`
thinning_recipe.parameters['module.conv.weight'].clear()

append_module_directive(model, thinning_recipe, 'module.conv', key='in_channels', val=640)

append_param_directive(thinning_recipe, 'module.conv.weight', (1, torch.cat((indices_aspp0,  indices_aspp1,  indices_aspp2,  indices_aspp3,  indices_pool), 0)))

`
From my current experiment result, it seems worked for me.

nzmora mentioned this issue Dec 1, 2018

Structure pruning is broken for models with non-serial connections #85

Closed

nzmora self-assigned this Dec 4, 2018

cattpku closed this as completed Dec 4, 2018

soumendukrg mentioned this issue Nov 13, 2019

Thinning Issue on Networks with Concatenation in Forward Operation #424

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

filter pruning error #86

filter pruning error #86

cattpku commented Nov 28, 2018 •

edited

nzmora commented Nov 28, 2018

cattpku commented Nov 28, 2018

cattpku commented Nov 28, 2018

nzmora commented Nov 28, 2018

cattpku commented Nov 28, 2018 •

edited

nzmora commented Dec 4, 2018

cattpku commented Dec 4, 2018

filter pruning error #86

filter pruning error #86

Comments

cattpku commented Nov 28, 2018 • edited

nzmora commented Nov 28, 2018

cattpku commented Nov 28, 2018

cattpku commented Nov 28, 2018

nzmora commented Nov 28, 2018

cattpku commented Nov 28, 2018 • edited

nzmora commented Dec 4, 2018

cattpku commented Dec 4, 2018

cattpku commented Nov 28, 2018 •

edited

cattpku commented Nov 28, 2018 •

edited