Mismatch in loading model #21

munirfarzeen · 2020-12-22T01:56:20Z

Hi,
@jackroos I am trying to run the code by using the model weights provided. I used r50_deformable_detr_plus_iterative_bbox_refinement_plus_plus_two_stage-checkpoint.pth
and respective config.
When loading the model it gives mismatch of transformer shape.
Is the checkpoint correct?

azamshoaib · 2020-12-23T16:25:47Z

@likui01 @jackroos @daijifeng001 Hi, I am having the same problem. Kindly help me in this regard. Thank you

jackroos · 2020-12-30T02:57:27Z

Could you provide the full command you ran? @likui01 @ayberksener
And do you use the latest code and latest model?

munirfarzeen · 2020-12-30T03:05:50Z

@jackroos
GPUS_PER_NODE=8 ./tools/run_dist_launch.sh 8 ./config/r50_deformable_detr_plus_iterative_bbox_refinement_plus_plus_two_stage.sh

with resume from checkpoint provided.

jackroos · 2020-12-30T03:08:38Z

@likui01 What is the detailed error message?

munirfarzeen · 2020-12-30T03:25:31Z

@jackroos
RuntimeError: Error(s) in loading state_dict for DeformableDETR: size mismatch for class_embed.0.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.0.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]). size mismatch for class_embed.1.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.1.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]). size mismatch for class_embed.2.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.2.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]). size mismatch for class_embed.3.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.3.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]). size mismatch for class_embed.4.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.4.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]). size mismatch for class_embed.5.weight: copying a param with shape torch.Size([91, 256]) from checkpoint, the shape in current model is torch.Size([4, 256]). size mismatch for class_embed.5.bias: copying a param with shape torch.Size([91]) from checkpoint, the shape in current model is torch.Size([4]).

jackroos · 2020-12-30T03:33:54Z

Do you change any code? It seems you change the num_classes to 4 here, but it should be 91 (set here).

munirfarzeen · 2020-12-30T03:35:32Z

i have 4 classes, so i chnaged it to 4.

jackroos · 2020-12-30T03:37:36Z

The checkpoint is trained on COCO detection. If you want to train on your custom dataset, you should train it by yourself and you don't need to resume the checkpoint. Thanks!

ducvuuit · 2021-09-10T14:06:21Z

Hi @likui01 @azamshoaib, has anyone fixed the problem about mismatch?

amirhesamyazdi · 2021-11-28T18:24:22Z

The checkpoint is trained on COCO detection. If you want to train on your custom dataset, you should train it by yourself and you don't need to resume the checkpoint. Thanks!

In DETR you can resume (transfer learn) from any checkpoint and still change the number of class. It is an obvious requirement for supporting tranfer learning. If D-DETR doesn't do that, it is something wrong about it. And plus, the problem with shape mismatch might not be just that. Could you please follow this up?

nwoyecid · 2023-03-02T13:33:58Z

You can have the line 239 of main.py:
missing_keys, unexpected_keys = model_without_ddp.load_state_dict(checkpoint['model'], strict=False)
Change to

    `missing_keys, unexpected_keys = model_without_ddp.load_state_dict( {k:v for k,v in checkpoint['model'].items() if "class_embed" not in k}, strict=False`)

jackroos closed this as completed Dec 30, 2020

1757525671 mentioned this issue Aug 24, 2021

AP=0 #91

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mismatch in loading model #21

Mismatch in loading model #21

munirfarzeen commented Dec 22, 2020 •

edited

azamshoaib commented Dec 23, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020 •

edited

ducvuuit commented Sep 10, 2021 •

edited

amirhesamyazdi commented Nov 28, 2021

nwoyecid commented Mar 2, 2023

Mismatch in loading model #21

Mismatch in loading model #21

Comments

munirfarzeen commented Dec 22, 2020 • edited

azamshoaib commented Dec 23, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020

munirfarzeen commented Dec 30, 2020

jackroos commented Dec 30, 2020 • edited

ducvuuit commented Sep 10, 2021 • edited

amirhesamyazdi commented Nov 28, 2021

nwoyecid commented Mar 2, 2023

munirfarzeen commented Dec 22, 2020 •

edited

jackroos commented Dec 30, 2020 •

edited

ducvuuit commented Sep 10, 2021 •

edited