Make initialization of GoogleNet / Inception faster #2166

fmassa · 2020-04-30T10:36:15Z

There have been several reports from users that GoogleNet and Inception are very slow to construct, see #1797 , #1977 and #2145 for example.

The underlying issue is that these models use scipy.truncnorm, for which the implementation was recently updated and became 100x slower than it was before, see scipy/scipy#11299 for reference. This slowdown has been fixed in scipy and will be present in the 1.5.0 release, but in the meantime, users of torchvision still obtain very long startup times.

I think the simplest alternative is to make init_weights default to False, and use a weight initialization from PyTorch instead. This is BC-breaking for users who want to train Inception from scratch, but I'm not sure how much it will affect users in general.

The text was updated successfully, but these errors were encountered:

bisakhmondal · 2020-04-30T11:58:47Z

Thanks for openning an seperate issue for this. As you said i was looking into the Scipy issue and trying to understand the behaviour of

vision/torchvision/models/inception.py

Lines 103 to 116 in f9ef235

    
           if init_weights: 
        
               for m in self.modules(): 
        
                   if isinstance(m, nn.Conv2d) or isinstance(m, nn.Linear): 
        
                       import scipy.stats as stats 
        
                       stddev = m.stddev if hasattr(m, 'stddev') else 0.1 
        
                       X = stats.truncnorm(-2, 2, scale=stddev) 
        
                       values = torch.as_tensor(X.rvs(m.weight.numel()), dtype=m.weight.dtype) 
        
                       values = values.view(m.weight.size()) 
        
                       with torch.no_grad(): 
        
                           m.weight.copy_(values) 
        
                   elif isinstance(m, nn.BatchNorm2d): 
        
                       nn.init.constant_(m.weight, 1) 
        
                       nn.init.constant_(m.bias, 0)

.
I think weight initialization with PyTorch nn.init module can significantly improve the scenario. I have tested with

modules=[
        nn.Conv2d(3,512,2),
        nn.BatchNorm2d(512),
        nn.Linear(512,3) ]  X 17 times

for weight initialization, the current implementation takes around 1.21 sec, where the nn.init.uniform_ API takes 0.0048 sec, I think it is far better until next scipy release.

model=nn.Sequential(*modules)
def weight_init(m):
  if isinstance(m, nn.Conv2d) or isinstance(m, nn.Linear):
    nn.init.uniform_(m.weight,-2,2)
  elif isinstance(m, nn.BatchNorm2d):
    nn.init.constant_(m.weight, 1)
    nn.init.constant_(m.bias, 0)
model.apply(weight_init)

Would you mind if I work on it?

fmassa · 2020-04-30T12:43:22Z

Hi,

Sure, please it would be great if you could work on it.

I think that we should still keep the old behavior if the user wants, and raise a warning to make users aware of it.

One option would be to change the default value of init_weights to be None, and raise a warning if it's None (forcing the users to be aware of it until they explicitly set the value to either True or False).

bisakhmondal · 2020-04-30T14:52:16Z

Hii,

One option would be to change the default value of init_weights to be None, and raise a warning if it's None (forcing the users to be aware of it until they explicitly set the value to either True or False).

Sure. That would be better.
Thanks.

fmassa added enhancement help wanted module: models topic: classification labels Apr 30, 2020

fmassa mentioned this issue Apr 30, 2020

Inception_v3 is not working #2145

Closed

bisakhmondal mentioned this issue Apr 30, 2020

Faster initialization of Inception family #2170

Merged

fmassa closed this as completed in #2170 May 5, 2020

yhn112 mentioned this issue May 8, 2020

Fixing of slow Inceptionv3 initialization yandexdataschool/Practical_DL#78

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make initialization of GoogleNet / Inception faster #2166

Make initialization of GoogleNet / Inception faster #2166

fmassa commented Apr 30, 2020

bisakhmondal commented Apr 30, 2020

fmassa commented Apr 30, 2020

bisakhmondal commented Apr 30, 2020

Make initialization of GoogleNet / Inception faster #2166

Make initialization of GoogleNet / Inception faster #2166

Comments

fmassa commented Apr 30, 2020

bisakhmondal commented Apr 30, 2020

fmassa commented Apr 30, 2020

bisakhmondal commented Apr 30, 2020