mismatch with torchvision resnets #228

CarloLucibello · 2023-04-24T17:47:30Z

I'm using the script
https://github.com/FluxML/Metalhead.jl/blob/master/scripts/port_torchvision.jl
to load torchvision's models and copy their weights into Metalhead's ones.

With vggX model all is fine.

With resnets instead I get the following mismatchs:

ResNet18:

flux_key = "model.layers[1].layers[3].layers[1].layers[1].layers[1].conv_weight"
size(flux_param) = (1, 1, 64, 128)
pytorch_key = "layer2.0.conv1.weight"
size(pytorch_param) = (3, 3, 64, 128)

ResNet34

flux_key = "model.layers[1].layers[3].layers[1].layers[1].layers[1].conv_weight"
size(flux_param) = (1, 1, 64, 128)
pytorch_key = "layer2.0.conv1.weight"
size(pytorch_param) = (3, 3, 64, 128)

ResNet50

flux_key = "model.layers[1].layers[2].layers[1].layers[1].layers[1].conv_weight"
size(flux_param) = (1, 1, 64, 256)
pytorch_key = "layer1.0.conv1.weight"
size(pytorch_param) = (1, 1, 64, 64)

ResNet101

flux_key = "model.layers[1].layers[2].layers[1].layers[1].layers[1].conv_weight"
size(flux_param) = (1, 1, 64, 256)
pytorch_key = "layer1.0.conv1.weight"
size(pytorch_param) = (1, 1, 64, 64)

ResNet152

flux_key = "model.layers[1].layers[2].layers[1].layers[1].layers[1].conv_weight"
size(flux_param) = (1, 1, 64, 256)
pytorch_key = "layer1.0.conv1.weight"
size(pytorch_param) = (1, 1, 64, 64)

The text was updated successfully, but these errors were encountered:

theabhirath · 2023-04-24T18:27:24Z

It's because Parallel gives the layers in a different order than the one in torchvision. Enumerating over reverse(node.layers) at

Metalhead.jl/scripts/pytorch2flux.jl

Line 79 in e3a82ff

for (i, n) in enumerate(node.layers)

should fix this

CarloLucibello mentioned this issue Apr 24, 2023

fix the torchvision porting script #229

Merged

CarloLucibello closed this as completed in #229 Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mismatch with torchvision resnets #228

mismatch with torchvision resnets #228

CarloLucibello commented Apr 24, 2023 •

edited

Loading

theabhirath commented Apr 24, 2023

mismatch with torchvision resnets #228

mismatch with torchvision resnets #228

Comments

CarloLucibello commented Apr 24, 2023 • edited Loading

theabhirath commented Apr 24, 2023

CarloLucibello commented Apr 24, 2023 •

edited

Loading