Error Loading a custom trained model or resume training from a check point. #44

abhineet-pandey · 2022-07-14T18:24:38Z

File "/mnt/disk/code/AdelaiDepth/LeReS/Train/lib/utils/net_tools.py", line 44, in load_ckpt
checkpoint_state_dict_noprefix = strip_prefix_if_present(checkpoint['model_state_dict'], "module.")
KeyError: 'model_state_dict'

With resuming training from a checkpoint or loading a self trained model.

Similar KeyError when loading a demo trained model in the Minist_test script (test_depth.py or test_shape.py)

To reproduce this error just train the demo model (..Train/scripts/train_demo.py) and try loading using the inference code in the Minist_test.

Please let me know if you need more details.
Any help would be much appreciated

guangkaixu · 2022-07-19T09:30:21Z

@abhineet-pandey Hi, if you would like to load the self-trained model and do inference, you can:

AdelaiDepth/LeReS/Minist_Test/lib/net_tools.py

Line 40 in c5370f1

    
           depth_model.load_state_dict(strip_prefix_if_present(checkpoint['depth_model'], "module."),

modify checkpoint['depth_model'] to checkpoint['model_state_dict'] and ensure strict=False (or remove the weights of depth_model.auxi_modules).

Resuming training from a self-trained model weight seems to contain no bugs for me. Thanks for your following and please comment casually if you still suffer from some problems.

abhineet-pandey · 2022-07-26T21:47:37Z

Thanks a lots.

777-en · 2022-12-28T13:19:38Z

@abhineet-pandey Hi, if you would like to load the self-trained model and do inference, you can:

AdelaiDepth/LeReS/Minist_Test/lib/net_tools.py

Line 40 in c5370f1

depth_model.load_state_dict(strip_prefix_if_present(checkpoint['depth_model'], "module."),

modify checkpoint['depth_model'] to checkpoint['model_state_dict'] and ensure strict=False (or remove the weights of depth_model.auxi_modules).
Resuming training from a self-trained model weight seems to contain no bugs for me. Thanks for your following and please comment casually if you still suffer from some problems.

Hi! Thank you for your gorgeous work. I met the same problem. Could you explain more about "ensure strict=False (or remove the weights of depth_model.auxi_modules)", what exactly should I do? @guangkaixu @abhineet-pandey

abhineet-pandey closed this as completed Jul 26, 2022

guangkaixu mentioned this issue Aug 31, 2022

RuntimeError: stack expects a non-empty TensorList #47

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error Loading a custom trained model or resume training from a check point. #44

Error Loading a custom trained model or resume training from a check point. #44

abhineet-pandey commented Jul 14, 2022

guangkaixu commented Jul 19, 2022

abhineet-pandey commented Jul 26, 2022

777-en commented Dec 28, 2022

Error Loading a custom trained model or resume training from a check point. #44

Error Loading a custom trained model or resume training from a check point. #44

Comments

abhineet-pandey commented Jul 14, 2022

guangkaixu commented Jul 19, 2022

abhineet-pandey commented Jul 26, 2022

777-en commented Dec 28, 2022