New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: Cannot match one checkpoint key to multiple keys in the model. #6
Comments
您好,与其他半监督检测工作相同,resume是为了让模型起初有更好的预测能力。我们是采用全监督训练做warmup,6000次迭代结束后进行半监督训练。 |
感谢您的回复!我理解的是在半监督训练前会有一个burn-in stage(全监督),burn-in阶段随机初始化两个相同的model,我使用命令python3 train_net.py --num-gpus 8 --config configs/voc/voc07_voc12.yaml进行初始化训练(voc07_voc12.yaml中WEIGHTS被注释掉),但是训练6个iter后会报错“FloatingPointError: Predicted boxes or scores contain Inf/NaN. Training has diverged.” 此外在trainer.py文件的def run_step_full_semisup(self):中if....else.....只在else......中定义了record_dict,没在if.....中定义,所以运行时报错“UnboundLocalError: local variable 'record_dict' referenced before assignment” 想向您请教一下
|
全监督的warmup需要resume MSRA/R-50.pkl(实际上就是backbone参数),以保证模型初始的特征提取能力。 |
太感谢您的解答了,我现在还存在一点问题
|
我刚刚上传了一个merge_two_ckpts脚本,可以将两个checkpoint融合起来用于resume,您可以参考下 |
非常感谢您,问题解决了! |
您好,我在使用detectron2进行cross-training时,没有使用resume,直接从头训练,会报错“ValueError: Cannot match one checkpoint key to multiple keys in the model.”
请问cross-training必须resume,您运行命令中的output/model_0005999.pth是怎么得到的呢,期待您的回复
The text was updated successfully, but these errors were encountered: