load_state_dict does not return the model #1503

konstantinklemmer · 2023-07-31T22:51:38Z

Fixed an error in the state dict loading of the tutorial

adamjstewart · 2023-08-01T02:25:29Z

Not sure why tests are failing but it's clearly unrelated to this PR. Will try to investigate in a separate PR.

adamjstewart · 2023-09-07T17:07:28Z

Fixed an error in the state dict loading of the turorial

What's the error?

calebrob6 · 2023-09-25T15:49:54Z

@konstantinklemmer ping

adamjstewart · 2023-09-25T16:17:51Z

P.S. I fixed the failing test, tests should pass after updating your branch. I'm just curious why this PR is needed since it doesn't fail during testing.

docs/tutorials/pretrained_weights.ipynb

adamjstewart · 2023-12-18T10:13:38Z

It's still unclear to me the purpose of this PR. What is the bug it is trying to solve?

calebrob6 · 2023-12-18T16:57:44Z

There isn't a bug, this is just an example of how to load pre-trained models differently.

adamjstewart · 2023-12-19T18:43:27Z

That's not what the PR descriptions says...

isaaccorley · 2024-01-03T18:10:51Z

The error that's being fixed is that model.load_state_dict() returns either None or a list of incompatible keys. Our example incorrectly does model = model.load_state_dict which overrides the model itself to None or a list.

isaaccorley · 2024-01-03T18:11:18Z

This looks okay to me

Fixed an error in the state dict loading of the turorial and added a comment on the num_classes parameter when creating timm models.

adamjstewart · 2024-01-07T10:49:06Z

Thanks @isaaccorley, I understand the error now.

It looks like we actually make this same mistake in the README. And we define our own custom torchgeo.trainers.utils.load_state_dict that behaves differently from the builtin one. We should fix all of these at the same time.

I'm not sure about the num_classes comment. We aren't trying to teach people how to use timm, just how to use TorchGeo. I don't disagree that it's useful, just that it's in the wrong place.

@konstantinklemmer let me know if you want me to make these changes myself. If I don't hear back I'll assume this PR has been abandoned and take over.

konstantinklemmer · 2024-01-15T04:14:26Z

Sorry, no idea why I am not getting notifications for this PR...

Yes, I am happy to remove that comment and the example with num_classes=0. Does that sound alright? Basically just change model = model.load_state_dict() to model.load_state_dict().

adamjstewart · 2024-01-15T08:12:25Z

Yes, and update the README and our builtin load_state_dict as well. Let me know if you want help with the latter.

konstantinklemmer · 2024-01-15T17:48:21Z

Yes please, I am not sure how to tackle the builtin load_state_dict problem (or what it even is exactly).

adamjstewart · 2024-01-16T12:24:24Z

or what it even is exactly

It's not really a problem per se, just that we define a wrapper around load_state_dict that returns the model instead of returning hits/misses like the builtin PyTorch method. It's kind of confusing to have different behavior, so I think we should match the behavior of the builtin method. Luckily, this method isn't public facing, so this change won't be backwards incompatible.

konstantinklemmer · 2024-01-17T18:34:22Z

Okay! So there are two functions that I found that, if I understand correctly, are relevant:

torchgeo/torchgeo/trainers/utils.py

Line 74 in 436baa9

def load_state_dict(model: Module, state_dict: "OrderedDict[str, Tensor]") -> Module:
torchgeo/tests/trainers/test_utils.py

Line 42 in 436baa9

def test_load_state_dict(checkpoint: str, model: Module) -> None:

Is that correct?

In both cases, if we want to keep them as standalone functions that return a model with weights loaded, we should probably rename them?

adamjstewart · 2024-01-18T12:14:00Z

torchgeo.trainers.utils.load_state_dict is the one you would change. The test_load_state_dict is just to test the method, the inputs don't matter.

We can either A) change the name, or B) change the return value to match torch.nn.Module.load_state_dict. If the latter works, I would actually prefer the latter.

konstantinklemmer · 2024-01-21T01:20:36Z

Well, load_state_dict does not have a return value, no? So for B) we'd just need to get rid of the return call, i.e. this line here (unless I misunderstand): https://github.com/microsoft/torchgeo/blob/436baa9773977c789152854ac7b4eff90e0d9e95/torchgeo/trainers/utils.py#L119C5-L119C17

Then load_state_dict(model, ckpt['state_dict']) should be equivalent to model.load_state_dict(ckpt['state_dict']).

adamjstewart · 2024-01-21T08:06:18Z

The builtin load_state_dict returns missing and unexpected keys: https://pytorch.org/docs/stable/generated/torch.nn.Module.html#torch.nn.Module.load_state_dict

So you'll just return the output of that call so that our wrapper matches.

konstantinklemmer · 2024-01-22T14:33:22Z

Okay after some more digging, the nn.Module.load_state_dict returns an object _IncompatibleKeys (https://github.com/pytorch/pytorch/blob/c3780010a58a84920335296ee5f091a0db18259f/torch/nn/modules/module.py#L29). This object uses missing and incompatible keys. However, when you check the load_state_dict and _load_from_state_dict (https://github.com/pytorch/pytorch/blob/c3780010a58a84920335296ee5f091a0db18259f/torch/nn/modules/module.py#L1953) functions, it seems that both will be empty dicts:

missing_keys: List[str] = []
unexpected_keys: List[str] = []

unless the strict=True flag is on. The torchgeo.trainers.utils.load_state_dict does not use that flag. So should I just return _IncompatibleKeys with those empty dicts?

adamjstewart · 2024-01-22T14:56:34Z

Let's just return the output of nn.Module.load_state_dict. For type hints, try:

-> Tuple[List[str], List[str]]

I think that will work, and be correct. The builtin method has no return type annotations so it shouldn't complain that we aren't using a NamedTuple. If you have any trouble with mypy, let me know and I can hack on it.

konstantinklemmer · 2024-01-22T14:59:31Z

Got it! Will try if that works and report back.

* Import Tuple from typing * Change return of `load_state_dict` from `model` to `Tuple[List[str], List[str]]`, matching the return of the standard PyTorch builtin function.

Remove example of loading pretrained model without prediction head (`num_classes=0`).

Adapt new `load_state_dict` function.

konstantinklemmer · 2024-01-30T16:19:57Z

Ok I think I updated all files (the README, the notebook, the utils.py) but tests are failing.

konstantinklemmer · 2024-01-31T17:10:56Z

Yay! Thanks for helping with this - seems though that the original problem with the notebook test still persists?

adamjstewart · 2024-01-31T17:31:39Z

Completely unrelated problem, fixed in #1838

* Update pretrained_weights.ipynb Fixed an error in the state dict loading of the turorial and added a comment on the num_classes parameter when creating timm models. * Update docs/tutorials/pretrained_weights.ipynb * Update utils.py * Import Tuple from typing * Change return of `load_state_dict` from `model` to `Tuple[List[str], List[str]]`, matching the return of the standard PyTorch builtin function. * Update pretrained_weights.ipynb Remove example of loading pretrained model without prediction head (`num_classes=0`). * Update README.md Adapt new `load_state_dict` function. * Mimic return type of builtin load_state_dict * Modern type hints * Blacken * Try being explicit --------- Co-authored-by: Caleb Robinson <calebrob6@gmail.com> Co-authored-by: Adam J. Stewart <ajstewart426@gmail.com>

github-actions bot added the documentation Improvements or additions to documentation label Jul 31, 2023

adamjstewart requested a review from isaaccorley August 1, 2023 02:20

adamjstewart added this to the 0.4.2 milestone Aug 1, 2023

adamjstewart removed this from the 0.4.2 milestone Sep 28, 2023

calebrob6 force-pushed the patch-1 branch from 4de023c to 1e19d7d Compare December 2, 2023 06:03

calebrob6 previously approved these changes Dec 18, 2023

View reviewed changes

calebrob6 reviewed Dec 18, 2023

View reviewed changes

docs/tutorials/pretrained_weights.ipynb Outdated Show resolved Hide resolved

calebrob6 dismissed their stale review via 31709ac December 18, 2023 05:31

calebrob6 force-pushed the patch-1 branch from 31709ac to d9e08be Compare December 18, 2023 05:31

clkruse mentioned this pull request Jan 3, 2024

pretrained_weights example will not work with the timm model #1643

Closed

konstantinklemmer and others added 2 commits January 3, 2024 12:11

Update pretrained_weights.ipynb

7cd3f61

Fixed an error in the state dict loading of the turorial and added a comment on the num_classes parameter when creating timm models.

Update docs/tutorials/pretrained_weights.ipynb

1f976cf

isaaccorley force-pushed the patch-1 branch from d9e08be to 1f976cf Compare January 3, 2024 18:11

konstantinklemmer added 2 commits January 30, 2024 10:43

Merge branch 'microsoft:main' into patch-1

46a8756

Update utils.py

392b305

* Import Tuple from typing * Change return of `load_state_dict` from `model` to `Tuple[List[str], List[str]]`, matching the return of the standard PyTorch builtin function.

github-actions bot added the trainers PyTorch Lightning trainers label Jan 30, 2024

konstantinklemmer added 2 commits January 30, 2024 11:16

Update pretrained_weights.ipynb

46afcd7

Remove example of loading pretrained model without prediction head (`num_classes=0`).

Update README.md

2324488

Adapt new `load_state_dict` function.

Mimic return type of builtin load_state_dict

53885dc

github-actions bot added the testing Continuous integration testing label Jan 31, 2024

adamjstewart added 2 commits January 31, 2024 15:18

Modern type hints

fd19ab1

Blacken

35b914c

adamjstewart added this to the 0.5.2 milestone Jan 31, 2024

adamjstewart changed the title ~~Update pretrained_weights.ipynb~~ load_state_dict does not return the model Jan 31, 2024

Try being explicit

b5a9007

Merge branch 'main' into patch-1

f952bab

adamjstewart approved these changes Feb 6, 2024

View reviewed changes

adamjstewart merged commit 55b3c50 into microsoft:main Feb 6, 2024
20 of 21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

load_state_dict does not return the model #1503

load_state_dict does not return the model #1503

konstantinklemmer commented Jul 31, 2023 •

edited by adamjstewart

adamjstewart commented Aug 1, 2023

adamjstewart commented Sep 7, 2023

calebrob6 commented Sep 25, 2023

adamjstewart commented Sep 25, 2023

adamjstewart commented Dec 18, 2023

calebrob6 commented Dec 18, 2023

adamjstewart commented Dec 19, 2023

isaaccorley commented Jan 3, 2024

isaaccorley commented Jan 3, 2024

adamjstewart commented Jan 7, 2024

konstantinklemmer commented Jan 15, 2024

adamjstewart commented Jan 15, 2024

konstantinklemmer commented Jan 15, 2024

adamjstewart commented Jan 16, 2024

konstantinklemmer commented Jan 17, 2024

adamjstewart commented Jan 18, 2024

konstantinklemmer commented Jan 21, 2024 •

edited

adamjstewart commented Jan 21, 2024 •

edited

konstantinklemmer commented Jan 22, 2024

adamjstewart commented Jan 22, 2024

konstantinklemmer commented Jan 22, 2024

konstantinklemmer commented Jan 30, 2024

konstantinklemmer commented Jan 31, 2024

adamjstewart commented Jan 31, 2024

load_state_dict does not return the model #1503

load_state_dict does not return the model #1503

Conversation

konstantinklemmer commented Jul 31, 2023 • edited by adamjstewart

adamjstewart commented Aug 1, 2023

adamjstewart commented Sep 7, 2023

calebrob6 commented Sep 25, 2023

adamjstewart commented Sep 25, 2023

adamjstewart commented Dec 18, 2023

calebrob6 commented Dec 18, 2023

adamjstewart commented Dec 19, 2023

isaaccorley commented Jan 3, 2024

isaaccorley commented Jan 3, 2024

adamjstewart commented Jan 7, 2024

konstantinklemmer commented Jan 15, 2024

adamjstewart commented Jan 15, 2024

konstantinklemmer commented Jan 15, 2024

adamjstewart commented Jan 16, 2024

konstantinklemmer commented Jan 17, 2024

adamjstewart commented Jan 18, 2024

konstantinklemmer commented Jan 21, 2024 • edited

adamjstewart commented Jan 21, 2024 • edited

konstantinklemmer commented Jan 22, 2024

adamjstewart commented Jan 22, 2024

konstantinklemmer commented Jan 22, 2024

konstantinklemmer commented Jan 30, 2024

konstantinklemmer commented Jan 31, 2024

adamjstewart commented Jan 31, 2024

konstantinklemmer commented Jul 31, 2023 •

edited by adamjstewart

konstantinklemmer commented Jan 21, 2024 •

edited

adamjstewart commented Jan 21, 2024 •

edited