Move resnet video models to single location #1190

fmassa · 2019-08-01T13:31:31Z

This simplifies hacking with the code, as now everything lives in a single place.

I have modified the functionality of the model (at least not on purpose).
I believe previous versions had a slight issue with the number of conv_builder blocks for mc_18 (5 instead of 4), but the last one wasn't used so this didn't change anything wrt the model that was generated.

I still need to update the trained model weights, which will be done in a follow-up PR.

cc @bjuncek

fmassa · 2019-08-01T13:33:26Z

torchvision/models/video/resnet.py

+        # init weights
+        self._initialize_weights()
+
+        if zero_init_residual:


Do we want to make this the default behavior, and remove the arg?

Do you know if this changes anything wrt the performance?

Do we want to make this the default behavior, and remove the arg?

Yeah, I think that's reasonable.

and no, haven't tried without it

bjuncek

Looks good to me - could we just verify that the number of params is still constant?

bjuncek · 2019-08-01T13:55:21Z

torchvision/models/video/resnet.py

+                         pretrained, progress,
+                         block=BasicBlock,
+                         conv_makers=[Conv2Plus1D] * 4,
+                         layers=[2, 2, 2, 2],


we should probably add a guide on default configs?

I was thinking that this is how we currently implement it for resnets, and it should be fine in most cases?

I suppose that makes sense - this is easier to hack ;)

torchvision/models/video/resnet.py

bjuncek · 2019-08-01T13:57:11Z

torchvision/models/video/resnet.py

+        # init weights
+        self._initialize_weights()
+
+        if zero_init_residual:


Do we want to make this the default behavior, and remove the arg?

Yeah, I think that's reasonable.

and no, haven't tried without it

fmassa · 2019-08-01T17:46:07Z

I just verified and the number of parameters is the same before and after the change.

bjuncek · 2019-08-01T19:05:39Z

Sounds good then :)

torchvision/models/video/resnet.py

codecov-io · 2019-08-04T22:04:13Z

Codecov Report

Merging #1190 into master will increase coverage by 0.01%.
The diff coverage is 79.68%.

@@            Coverage Diff             @@
##           master    #1190      +/-   ##
==========================================
+ Coverage   65.64%   65.65%   +0.01%     
==========================================
  Files          79       74       -5     
  Lines        5827     5780      -47     
  Branches      889      883       -6     
==========================================
- Hits         3825     3795      -30     
+ Misses       1731     1722       -9     
+ Partials      271      263       -8

Impacted Files	Coverage Δ
torchvision/models/video/__init__.py	`100% <100%> (ø)`	⬆️
torchvision/models/video/resnet.py	`79.52% <79.52%> (ø)`
torchvision/ops/boxes.py	`94.73% <0%> (ø)`	⬆️
torchvision/io/video.py	`72% <0%> (ø)`	⬆️
torchvision/transforms/transforms.py	`81.53% <0%> (+0.58%)`	⬆️
torchvision/transforms/functional.py	`71.38% <0%> (+1.44%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9168476...7dbd47f. Read the comment docs.

fmassa · 2019-08-04T22:07:12Z

Training those models yielded the following accuracies, for a clip length of 16 frames:

model	clip @ 1
r3d_18	52.748
mc3_18	53.898
r2plus1d_18	57.498

Which are pretty close to the reported results.

fmassa added 5 commits August 1, 2019 05:09

[WIP] Minor cleanups on R3d

b48ac44

Move all models to video/resnet.py

77a32ae

Remove old files

87f5d74

Make tests less memory intensive

a9fcea2

Lint

6c1d17f

fmassa commented Aug 1, 2019

View reviewed changes

bjuncek reviewed Aug 1, 2019

View reviewed changes

bjuncek approved these changes Aug 1, 2019

View reviewed changes

torchvision/models/video/resnet.py Show resolved Hide resolved

Fix typo and add pretraing arg to training script

7dbd47f

fmassa merged commit 6a834e9 into pytorch:master Aug 4, 2019

fmassa deleted the r3d-refactor branch August 4, 2019 22:07

Move resnet video models to single location #1190

Move resnet video models to single location #1190

Uh oh!

Conversation

fmassa commented Aug 1, 2019

Uh oh!

fmassa Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

bjuncek Aug 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bjuncek left a comment

Choose a reason for hiding this comment

Uh oh!

bjuncek Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

bjuncek Aug 1, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bjuncek Aug 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fmassa commented Aug 1, 2019

Uh oh!

bjuncek commented Aug 1, 2019

Uh oh!

Uh oh!

codecov-io commented Aug 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fmassa commented Aug 4, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bjuncek Aug 1, 2019 •

edited

Loading

bjuncek Aug 1, 2019 •

edited

Loading

codecov-io commented Aug 4, 2019 •

edited

Loading