[feature]requesting for pretrained weights #51

jianyin2016 · 2020-04-26T03:13:36Z

Is your feature request related to a problem? Please describe.

I would like to train a CNN-classifier with my custom data using the widely-used models like ResNet series，I found it is useful to initialize the model weights with ImageNet pretrained weights, and it is easy to implement with the torch::load API when the image channels of my dataset is 3, the same as ImageNet，under which situation no change should be made to the Conv1 layer.
It is the other situation when I try to train with gray images，as the Conv1 weights is supposed to be of in_channels=3, In the python fronten, I guess this maybe solved but imdieatly repalce the model.conv1 like this:

model.conv1 = nn.Conv2d(in_channels=1, out_channels=64, kernel_size=7, stride=2, padding=3, bias=False)

but as for the C++ fronten, repalcing seems not work:

model->conv1 = torch::nn::Conv2d(torch::nn::Conv2dOptions(in_channels, 64, 7).stride(2).padding(3).bias(false).dilation(1));

Describe the solution you'd like

Correcting the API use to rightly loading the pretrained weights.

Describe alternatives you've considered

Maybe a pretrained model on gray image dataset would bypass the problem.

Additional context

Exception occurs during the model forward process.

The text was updated successfully, but these errors were encountered:

mfl28 · 2020-04-26T07:43:23Z

Hi @jianyin2016 !
I think the easiest way to achieve what you want to do - if I understand correctly - is:

Create your model as you want it in python:

model = torchvision.models.resnet...(pretrained=True)
model.conv1 = nn.Conv2d(in_channels=1, out_channels=64, kernel_size=7, stride=2, padding=3, bias=False)

Save it as a scriptmodule:

example = torch.rand(1, 1, 224, 224)
traced_script_module = torch.jit.trace(model, example)
traced_script_module.save("my_resnet.pt")

Load it in C++ via auto model = torch::jit::load("path/to/my_resnet.pt")

There is a nice and much more detailed description of this process in the official Pytorch tutorials:
Loading a TorchScript Model in C++

jianyin2016 · 2020-04-26T08:12:59Z

Hi，@mfl28 , thanks for your nice advice.

There is no doubt that your solution will work, I quite believe in that , but the solution seems to be sort of bypassing rather than solving. I think maybe it is not the right way using the C++ API while at the same time relying too much on the PYTHON parts. so, still, I am here asking for a solution absolutly within libtorch.

thanks!

mfl28 · 2020-04-26T09:09:23Z

To my knowledge it is currently not possible to do this completely within C++, as torch::load() cannot load pickled weight files and there is also no load_state_dict() function for models in libtorch (see this recent issue in the official pytorch repo). Maybe you can find more information on this in the Pytorch forum or the official repo's issues.

jianyin2016 added the enhancement New feature or request label Apr 26, 2020

jianyin2016 closed this as completed Apr 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature]requesting for pretrained weights #51

[feature]requesting for pretrained weights #51

jianyin2016 commented Apr 26, 2020

mfl28 commented Apr 26, 2020

jianyin2016 commented Apr 26, 2020

mfl28 commented Apr 26, 2020 •

edited

[feature]requesting for pretrained weights #51

[feature]requesting for pretrained weights #51

Comments

jianyin2016 commented Apr 26, 2020

mfl28 commented Apr 26, 2020

jianyin2016 commented Apr 26, 2020

mfl28 commented Apr 26, 2020 • edited

mfl28 commented Apr 26, 2020 •

edited