Some comments about code of PartialConv2d #28

shaibagon · 2020-11-02T18:51:20Z

Hi,
I really like your work - thank you for sharing the code.

I have some remarks on the code of PartialConv2d:

1. Using buffers instead of simple "member tensors":

The class attribute self.weight_maskUpdater is defined as a "plain" tensor:

Lines 32 to 35 in 610d373

    
           if self.multi_channel: 
        
               self.weight_maskUpdater = torch.ones(self.out_channels, self.in_channels, self.kernel_size[0], self.kernel_size[1]) 
        
           else: 
        
               self.weight_maskUpdater = torch.ones(1, 1, self.kernel_size[0], self.kernel_size[1])

As a result, when the model is transferred to GPU, or data type is changing you need to explicitly check for it and change it:

partialconv/models/partialconv2d.py

Lines 49 to 50 in 610d373

    
           if self.weight_maskUpdater.type() != input.type(): 
        
               self.weight_maskUpdater = self.weight_maskUpdater.to(input)

A more elegant way is to use non-persistent buffers:

        if self.multi_channel:
            self.register_buffer(name='weight_maskUpdater', persistent=False,
                                 tensor=torch.ones(self.out_channels, self.in_channels,
                                                   self.kernel_size[0], self.kernel_size[1]))
        else:
            self.register_buffer(name='weight_maskUpdater', persistent=False,
                                 tensor=torch.ones(1, 1, self.kernel_size[0], self.kernel_size[1]))

This way self.weight_maskUpdater will be affected by any .to(...) / .cuda() / .cpu() invoked on the model hosting this layer, making the condition on line 49 redundent.

2. Use of `torch.ones_like`

Instead of torch.ones(...).to(...)

partialconv/models/partialconv2d.py

Lines 54 to 57 in 610d373

    
           if self.multi_channel: 
        
               mask = torch.ones(input.data.shape[0], input.data.shape[1], input.data.shape[2], input.data.shape[3]).to(input) 
        
           else: 
        
               mask = torch.ones(1, 1, input.data.shape[2], input.data.shape[3]).to(input)

You can use torch.ones_like which is simpler and easier to read:

                    if self.multi_channel:
                        mask = torch.ones_like(input)
                    else:
                        mask = torch.ones_like(input[:1, :1, ...])

3. No need to import `Variable`

You import Variable, but never use it.

partialconv/models/partialconv2d.py

Line 12 in 610d373

from torch.autograd import Variable

BTW All these comments are applicable to the code of PartialConv3d.

The text was updated successfully, but these errors were encountered:

liuguilin1225 · 2020-11-02T19:02:24Z

Thanks for your great suggestions. Will update them accordingly after testing.

ivanstepanovftw · 2024-03-03T10:29:09Z

torch.ones_like(input[:1, :1, ...])

Does input[:1, :1, ...] taking a slice of the tensor without making a copy?

shaibagon · 2024-03-03T12:01:05Z

@ivanstepanovftw I don't think it makes a copy. However, since it is only used as an argument for torch.ones_like the values of the tensor are not being used at all, only the shape, dtype and device.

ivanstepanovftw · 2024-03-09T09:34:00Z

Also I think this operation is useless:

partialconv/models/partialconv2d.py

Line 75 in 610d373

output = torch.mul(output, self.update_mask)

because input will be multiplied by the mask anyway here:

partialconv/models/partialconv2d.py

Line 70 in 610d373

    
           raw_out = super(PartialConv2d, self).forward(torch.mul(input, mask) if mask_in is not None else input)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some comments about code of PartialConv2d #28

Some comments about code of PartialConv2d #28

shaibagon commented Nov 2, 2020 •

edited

Loading

liuguilin1225 commented Nov 2, 2020

ivanstepanovftw commented Mar 3, 2024 •

edited

Loading

shaibagon commented Mar 3, 2024

ivanstepanovftw commented Mar 9, 2024

Some comments about code of PartialConv2d #28

Some comments about code of PartialConv2d #28

Comments

shaibagon commented Nov 2, 2020 • edited Loading

1. Using buffers instead of simple "member tensors":

2. Use of torch.ones_like

3. No need to import Variable

liuguilin1225 commented Nov 2, 2020

ivanstepanovftw commented Mar 3, 2024 • edited Loading

shaibagon commented Mar 3, 2024

ivanstepanovftw commented Mar 9, 2024

shaibagon commented Nov 2, 2020 •

edited

Loading

2. Use of `torch.ones_like`

3. No need to import `Variable`

ivanstepanovftw commented Mar 3, 2024 •

edited

Loading