Optimizer initialization issue in DeepLabv3+ #21

SunnerLi · 2018-07-21T03:55:42Z

Sorry to bother!
Recently, I try to use DeepLabv3+ and train the new model.
Also, I'm very thank that you can provide the code of model.
However, there is some error that will occur:

TypeError: optimizer can only optimize Tensors, but one of the params is NoneType

I think the issue is that the bias term in first convolution layer is set as False.
This is the default setting in standard ResNet.
However, the initialization part will yield the bias term into SGD constructor.
Hence the SGD raise Exception since the param is Nonetype.
Here is the part of the SGD source:

for param in param_group['params']:
    if not isinstance(param, Variable):
        raise TypeError("optimizer can only optimize Variables, "
                        "but one of the params is " + torch.typename(param))
    if not param.requires_grad:
        raise ValueError("optimizing a parameter that doesn't require gradients")
    if not param.is_leaf:
        raise ValueError("can't optimize a non-leaf Variable")

I give some advice at the end!
Maybe we can add some constraint to check if the bias term is None in train.py.
Just like the following:

def get_lr_params(model, key):
    # For Dilated FCN
    if key == "1x":
        for m in model.named_modules():
            if "layer" in m[0]:
                if isinstance(m[1], nn.Conv2d):
                    for p in m[1].parameters():
                        yield p
    # For conv weight in the ASPP module
    if key == "10x":
        for m in model.named_modules():
            if "aspp" in m[0]:
                if isinstance(m[1], nn.Conv2d):
                    yield m[1].weight
    # For conv bias in the ASPP module
    if key == "20x":
        for m in model.named_modules():
            if "aspp" in m[0]:
                if isinstance(m[1], nn.Conv2d):
                    if m[1].bias is not None:    # Add this line
                        yield m[1].bias

After this small revision, the code can run normally.

The text was updated successfully, but these errors were encountered:

kazuto1011 · 2018-07-23T06:42:16Z

Thank you for suggesting the revision. As far as I can see the last snippet, I think the issue is related to the improved ASPP module in the v3+ rather than the non-biased conv in ResNet. The yielding of the ResNet part is done in the "1x" scope without causing the NoneType error. The script train.py is made just for parsing and training the params in the v2 model. The reported error is due to the fact that the v3+ ASPP does not have biasses, while the v2 one has them.
Anyway, I think we need more strict modification for adapting it to v3/v3+, e.g., batch norms should also be observed/trained. I'm sorry but this codebase does not assume the v3/v3+ training now.

SunnerLi · 2018-07-23T23:50:37Z

I think you are right!
However, I hope that the the training part of v3/v3+ can be bare soon
Very thanks for your contribution!

Ericargus · 2018-10-17T12:04:11Z

class _ConvBatchNormReLU(nn.Sequential):
def init(
self,
in_channels,
out_channels,
kernel_size,
stride,
padding,
dilation,
relu=True,
):
super(_ConvBatchNormReLU, self).init()
self.add_module(
"conv",
nn.Conv2d(
in_channels=in_channels,
out_channels=out_channels,
kernel_size=kernel_size,
stride=stride,
padding=padding,
dilation=dilation,
bias=False,
),
)
###
I think the problem is in you _ConvBatchNormReLU function, because you set conv's bias to be False

kazuto1011 · 2018-10-18T05:12:22Z

Do you mean the _ConvBatchNormReLU in v3+ ASPP? I have mentioned above:

The reported error is due to the fact that the v3+ ASPP does not have biasses, while the v2 one has them.

The non-biased conv is from the official implementation. And the init part is just for v2 here.

SunnerLi closed this as completed Jul 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizer initialization issue in DeepLabv3+ #21

Optimizer initialization issue in DeepLabv3+ #21

SunnerLi commented Jul 21, 2018 •

edited

kazuto1011 commented Jul 23, 2018 •

edited

SunnerLi commented Jul 23, 2018 •

edited

Ericargus commented Oct 17, 2018

kazuto1011 commented Oct 18, 2018

Optimizer initialization issue in DeepLabv3+ #21

Optimizer initialization issue in DeepLabv3+ #21

Comments

SunnerLi commented Jul 21, 2018 • edited

kazuto1011 commented Jul 23, 2018 • edited

SunnerLi commented Jul 23, 2018 • edited

Ericargus commented Oct 17, 2018

kazuto1011 commented Oct 18, 2018

SunnerLi commented Jul 21, 2018 •

edited

kazuto1011 commented Jul 23, 2018 •

edited

SunnerLi commented Jul 23, 2018 •

edited