Feat/unfreeze layers fpn backbone #2160

muaz-urwa · 2020-04-29T14:44:49Z

Currently, fasterrcnn_resnet50_fpn function is used to create a faster rcnn with resnet50 backbone and fpn. resnet_fpn_backbone function is backbone utils is used by this function. This function freezes the backbone layers in resnet apart form layer2, layer3 and layer4. This freezing is hard coded to reflect the faster rcnn paper which frooze the initial layers of pretrained backbone.

If pretrained backbone is not used and one intends to train the entire network from scratch, no layers should be frozen. Otherwise initial layers will always have randomly initialized weights. I think this can be considered a bug, because layer freezing is not even mentioned in the function docs so user are not aware of it.

This resulted in poor AP when we were training faster rcnn with resnet backbone from scratch on Detrac dataset. And it took a while a figure out.

Furthermore, the number of resnet backbone layers that should be frozen is an important hyper parameter in my experience, and it needs to be tuned for each dataset. So adding an argument to control this enabled me to integrate it in hyper-parameter tuning.

I have created this pull request so that others don't run into the same problem when conducting experiments similar to mine.

I have moved the layer freezing logic to fasterrcnn_resnet50_fpn so that layers can be frozen if either a pretrained backbone or pretrained faster rcnn are used, and are not frozen otherwise.

If pretrained backbone is not used and one intends to train the entire network from scratch, no layers should be frozen.

Depending on the size of dataset one might want to control the number of tunable parameters in the backbone, and this parameter in hyper parameter optimization for the dataset. It would be nice to have this function support this.

Handle backbone freezing in fasterrcnn_resnet50_fpn function rather than the resnet_fpn_backbone function that it uses to get the backbone.

layer freezing code has been moved to fasterrcnn_resnet50_fpn function that consumes resnet_fpn_backbone function.

fmassa

Hi,

Thanks for the PR!

In it's current state, it changes the behavior for Mask R-CNN and Keypoint R-CNN, so it can't be merged as is.

I propose that we keep the freezing logic inside resnet_fpn_backbone, but we expose another argument to allow users to change it.

Let me know what you think

Moved layer freezing logic to resnet_fpn_backbone with an additional parameter.

Layer freezing logic has been moved to resnet_fpn_backbone. This function only ensures that the all layers are made trainable if pretrained models are not used.

muaz-urwa · 2020-04-30T21:05:22Z

@fmassa oh, understood.

I have made the requested changes and moved freezing logic back into the resnet_fpn_backbone function.

The current behavior is:

Both resnet_fpn_backbone and fasterrcnn_resnet50_fpn functions have an additional tunable layers parameter.
Default values ensure that existing functionality is not broken if this argument is not set.
If any pretrained model or backbone are not used, no layer is frozen.

Please let me know if you require any other changes.

fmassa

Thanks for the improvements and the documentation!

Can you add a test for resnet_fpn_backbone in test/test_detection_utils.py checking that the parameters are correctly set to require / not require gradient?
I think there could be an issue with the conv1 handling, although it might just work as is (although it looks a bit fragile to me).

fmassa · 2020-05-04T12:41:57Z

torchvision/models/detection/backbone_utils.py

+    """
+    # select layers that wont be frozen
+    assert trainable_layers<=5 and trainable_layers >=0
+    layers_to_train = ['layer4', 'layer3', 'layer2', 'layer1', 'conv1'][:trainable_layers]


conv1 is part of every Bottleneck and BasicBlock, see

vision/torchvision/models/resnet.py

Line 91 in bd27e94

self.conv1 = conv1x1(inplanes, width)

so I'm not sure that the conv1 part in here has the expected behavior (it might have, but can be by accident).

Sure, I will add test cases and update the PR.

Regarding the conv1 part, I understand that it looks hacky and is not intuitive. But it works it this case because of the way freezing is done.

Because of the order of strings in layers_to_train 'conv1' is only used when trainable_layers is set to 5 and the entire network should be unfrozen. So even if 'conv1' finds matches in other blocks, its okay.

The fully qualified layer names that the resnet_fpn_backbone observes for res50 are:

conv1.weight
layer1.0.conv1.weight
layer1.0.conv2.weight
layer1.0.conv3.weight
layer1.0.downsample.0.weight
layer1.1.conv1.weight
layer1.1.conv2.weight
layer1.1.conv3.weight
layer1.2.conv1.weight
layer1.2.conv2.weight
layer1.2.conv3.weight
layer2.0.conv1.weight
layer2.0.conv2.weight
layer2.0.conv3.weight
layer2.0.downsample.0.weight
layer2.1.conv1.weight
layer2.1.conv2.weight
layer2.1.conv3.weight
layer2.2.conv1.weight
layer2.2.conv2.weight
layer2.2.conv3.weight
layer2.3.conv1.weight
layer2.3.conv2.weight
layer2.3.conv3.weight
layer3.0.conv1.weight
layer3.0.conv2.weight
layer3.0.conv3.weight
layer3.0.downsample.0.weight
layer3.1.conv1.weight
layer3.1.conv2.weight
layer3.1.conv3.weight
layer3.2.conv1.weight
layer3.2.conv2.weight
layer3.2.conv3.weight
layer3.3.conv1.weight
layer3.3.conv2.weight
layer3.3.conv3.weight
layer3.4.conv1.weight
layer3.4.conv2.weight
layer3.4.conv3.weight
layer3.5.conv1.weight
layer3.5.conv2.weight
layer3.5.conv3.weight
layer4.0.conv1.weight
layer4.0.conv2.weight
layer4.0.conv3.weight
layer4.0.downsample.0.weight
layer4.1.conv1.weight
layer4.1.conv2.weight
layer4.1.conv3.weight
layer4.2.conv1.weight
layer4.2.conv2.weight
layer4.2.conv3.weight
fc.weight
fc.bias

One thing I can do to make this more intuitive is to replace :

if all([layer not in name for layer in layers_to_train]):

with

if all([not name.startswith(layer) for layer in layers_to_train]):

this will make sure that only clean matches are found. What are your thoughts on this ?

Here is a notebook which tests the desired functionality.

I will add the test cases based on it into test_models_detection_utils.py

One thing I can do to make this more intuitive is to replace :

if all([layer not in name for layer in layers_to_train]):

with

if all([not name.startswith(layer) for layer in layers_to_train]):

this will make sure that only clean matches are found. What are your thoughts on this

sure, looks good to me!

Test case has been added.

This PR adds functionality to specify the number of trainable layers while initializing the faster rcnn using fasterrcnn_resnet50_fpn function. This commits adds a test case to test this functionality.

More information in PR

muaz-urwa · 2020-05-11T16:01:30Z

Test cases have been added.

Let me know if any other change is required.

fmassa

Thanks a lot!

fmassa · 2020-05-15T13:02:54Z

For a follow-up PR, can you add the same flags to Mask R-CNN and Keypoint R-CNN?

muaz-urwa · 2020-05-15T13:08:18Z

@fmassa absolutely.

Will do that in a couple of days.

muaz-urwa added 7 commits April 29, 2020 18:33

freeze layers only if pretrained backbone is used

837bed5

If pretrained backbone is not used and one intends to train the entire network from scratch, no layers should be frozen.

function argument to control the trainable features

c57c66a

Depending on the size of dataset one might want to control the number of tunable parameters in the backbone, and this parameter in hyper parameter optimization for the dataset. It would be nice to have this function support this.

ensuring tunable layer argument is valid

e972118

backbone freezing in fasterrcnn_resnet50_fpn

44e94ba

Handle backbone freezing in fasterrcnn_resnet50_fpn function rather than the resnet_fpn_backbone function that it uses to get the backbone.

remove layer freezing code

4aa9f04

layer freezing code has been moved to fasterrcnn_resnet50_fpn function that consumes resnet_fpn_backbone function.

correcting linting errors

1f79756

correcting linting errors

f93f80b

muaz-urwa mentioned this pull request Apr 29, 2020

Faster Rcnn construction function freezes backbone layers even for random initialized backbone #2164

Closed

fmassa requested changes Apr 30, 2020

View reviewed changes

muaz-urwa added 3 commits May 1, 2020 01:44

move freezing logic to resnet_fpn_backbone

e1c9592

Moved layer freezing logic to resnet_fpn_backbone with an additional parameter.

remove layer freezing from fasterrcnn_resnet50_fpn

abd3707

Layer freezing logic has been moved to resnet_fpn_backbone. This function only ensures that the all layers are made trainable if pretrained models are not used.

update example resnet_fpn_backbone docs

3ddfef0

muaz-urwa requested a review from fmassa April 30, 2020 21:02

muaz-urwa added 2 commits May 1, 2020 02:21

correct typo in var name

2ef1497

correct indentation

88b0439

fmassa requested changes May 4, 2020

View reviewed changes

muaz-urwa added 2 commits May 6, 2020 17:49

adding test case for layer freezing in faster rcnn

c40958a

This PR adds functionality to specify the number of trainable layers while initializing the faster rcnn using fasterrcnn_resnet50_fpn function. This commits adds a test case to test this functionality.

updating layer freezing condition for clarity

24e77c7

More information in PR

muaz-urwa requested a review from fmassa May 6, 2020 12:56

muaz-urwa added 3 commits May 6, 2020 23:25

remove linting errors

ca8b180

removing linting errors

9045110

removing linting errors

552221b

fmassa approved these changes May 15, 2020

View reviewed changes

fmassa merged commit 348dd5a into pytorch:master May 15, 2020

muaz-urwa deleted the feat/unfreeze_layers_fpn_backbone branch May 15, 2020 13:07

muaz-urwa mentioned this pull request May 19, 2020

Feature/layer freezing maskrcnn keypointrcnn #2242

Merged

jimexist mentioned this pull request Jul 22, 2020

fix resnet_fpn_backbone docstring, allow backbone to be pluggable #2499

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/unfreeze layers fpn backbone #2160

Feat/unfreeze layers fpn backbone #2160

muaz-urwa commented Apr 29, 2020 •

edited

fmassa left a comment

muaz-urwa commented Apr 30, 2020

fmassa left a comment

fmassa May 4, 2020

muaz-urwa May 4, 2020

muaz-urwa May 4, 2020

muaz-urwa May 4, 2020

fmassa May 5, 2020

muaz-urwa May 6, 2020

muaz-urwa commented May 11, 2020

fmassa left a comment

fmassa commented May 15, 2020

muaz-urwa commented May 15, 2020

Feat/unfreeze layers fpn backbone #2160

Feat/unfreeze layers fpn backbone #2160

Conversation

muaz-urwa commented Apr 29, 2020 • edited

fmassa left a comment

Choose a reason for hiding this comment

muaz-urwa commented Apr 30, 2020

fmassa left a comment

Choose a reason for hiding this comment

fmassa May 4, 2020

Choose a reason for hiding this comment

muaz-urwa May 4, 2020

Choose a reason for hiding this comment

muaz-urwa May 4, 2020

Choose a reason for hiding this comment

muaz-urwa May 4, 2020

Choose a reason for hiding this comment

fmassa May 5, 2020

Choose a reason for hiding this comment

muaz-urwa May 6, 2020

Choose a reason for hiding this comment

muaz-urwa commented May 11, 2020

fmassa left a comment

Choose a reason for hiding this comment

fmassa commented May 15, 2020

muaz-urwa commented May 15, 2020

muaz-urwa commented Apr 29, 2020 •

edited