size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). #27

stiwfmjX · 2022-08-02T08:46:12Z

我用LEVIR数据集训练的模型，在测试的时候报错：
Traceback (most recent call last):
File "eval_cd.py", line 58, in
main()
File "eval_cd.py", line 54, in main
model.eval_models(checkpoint_name=args.checkpoint_name)
File "/tmp/pycharm_project_668/models/evaluator.py", line 158, in eval_models
self._load_checkpoint(checkpoint_name)
File "/tmp/pycharm_project_668/models/evaluator.py", line 70, in _load_checkpoint
self.net_G.load_state_dict(checkpoint['model_G_state_dict'])
File "/root/miniconda3/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1497, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for BASE_Transformer:
size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.0.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.0.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.0.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.1.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.1.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.1.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.1.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.2.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.2.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.2.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.2.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.3.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.3.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.3.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.3.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.4.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.4.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.4.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.4.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.5.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.5.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.5.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.5.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.6.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.6.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.6.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.6.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
size mismatch for transformer_decoder.layers.7.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.7.0.fn.fn.to_k.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.7.0.fn.fn.to_v.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]).
size mismatch for transformer_decoder.layers.7.0.fn.fn.to_out.0.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([32, 64]).
数据集的size是256*256的，训练和测试都是按照readme里写的方法，有人知道这个问题怎么解决吗？

autumoon · 2022-09-02T06:43:13Z

经过不断调试和测试，花了一天的时间才找到问题的所在。。。因为可能是作者故意留下的问题，所以这里就先不写出解决办法了，需要解决办法的可以私信我取得联系，9506#163.com

stiwfmjX · 2022-09-05T07:43:59Z

经过不断调试和测试，花了一天的时间才找到问题的所在。。。因为可能是作者故意留下的问题，所以这里就先不写出解决办法了，需要解决办法的可以私信我取得联系，9506#163.com

联系您啦！

autumoon · 2022-09-05T07:47:18Z

已经回复，请查收邮件

ChengxiHAN · 2022-09-12T03:15:59Z

已经回复，请查收邮件

你好，我的QQ是1121399040，想请教一下可视化的问题。

wuwuevan · 2022-09-26T14:23:50Z

已经回复，请查收邮件
请问您可以加一下我吗？我的qq是1261311737

autumoon · 2022-10-11T08:54:21Z

加你了可视化的问题我没注意测试可能我也帮不了你什么   秋月的私语 ***@***.***  

…

------------------ 原始邮件 ------------------ 发件人: ***@***.***>; 发送时间: 2022年9月12日(星期一) 中午11:16 收件人: ***@***.***>; 抄送: ***@***.***>; ***@***.***>; 主题: Re: [justchenhao/BIT_CD] size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). (Issue #27) 已经回复，请查收邮件你好，我的QQ是1121399040，想请教一下可视化的问题。 — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

sun321123 · 2022-11-18T07:54:04Z

他把那个测试模型改了，测试模型换成和训练模型一样就行，改了半天，我以为是模型出错了呢，打印保存模型和现有模型才发现这个问题

stiwfmjX · 2022-11-19T01:53:43Z

他把那个测试模型改了，测试模型换成和训练模型一样就行，改了半天，我以为是模型出错了呢，打印保存模型和现有模型才发现这个问题

十分感谢！这个问题已经解决啦

autumoon · 2022-11-21T01:31:10Z

他把那个测试模型改了，测试模型换成和训练模型一样就行，改了半天，我以为是模型出错了呢，打印保存模型和现有模型才发现这个问题

是的就是这个问题我当时也是搞了半天。。

zdw9915 · 2022-11-26T16:04:14Z

他把那个测试模型改了，测试模型换成和训练模型一样就行，改了半天，我以为是模型出错了呢，打印保存模型和现有模型才发现这个问题
非常感谢！问题已经解决了

dpt000121 · 2023-03-07T03:00:28Z

shizizuoing · 2023-03-12T11:01:18Z

他把那个测试模型改了，测试模型换成和训练模型一样就行，改了半天，我以为是模型出错了呢，打印保存模型和现有模型才发现这个问题
非常感谢！问题已经解决了

可以分享一下超参数吗，为什么我在level_cd的test的结果比论文高5-10个点，期待您的回复

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). #27

size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). #27

stiwfmjX commented Aug 2, 2022

autumoon commented Sep 2, 2022

stiwfmjX commented Sep 5, 2022

autumoon commented Sep 5, 2022

ChengxiHAN commented Sep 12, 2022

wuwuevan commented Sep 26, 2022

autumoon commented Oct 11, 2022 via email

sun321123 commented Nov 18, 2022

stiwfmjX commented Nov 19, 2022

autumoon commented Nov 21, 2022

zdw9915 commented Nov 26, 2022

dpt000121 commented Mar 7, 2023

shizizuoing commented Mar 12, 2023

size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). #27

size mismatch for transformer_decoder.layers.0.0.fn.fn.to_q.weight: copying a param with shape torch.Size([512, 32]) from checkpoint, the shape in current model is torch.Size([64, 32]). #27

Comments

stiwfmjX commented Aug 2, 2022

autumoon commented Sep 2, 2022

stiwfmjX commented Sep 5, 2022

autumoon commented Sep 5, 2022

ChengxiHAN commented Sep 12, 2022

wuwuevan commented Sep 26, 2022

autumoon commented Oct 11, 2022 via email

sun321123 commented Nov 18, 2022

stiwfmjX commented Nov 19, 2022

autumoon commented Nov 21, 2022

zdw9915 commented Nov 26, 2022

dpt000121 commented Mar 7, 2023

shizizuoing commented Mar 12, 2023