Test fewer models in trainers to avoid exceeding RAM #1377

calebrob6 · 2023-05-28T23:00:23Z

We don't need to test the BYOL trainer with every set of pretrained weights (we don't actually need to involve pretrained weights at all).

adamjstewart · 2023-05-28T23:05:15Z

From #1376:

We might be able to only load 1 ResNet, but we'll still have to load all of them on the release branch, so we can't just avoid the problem.

We want to make sure our weights actually load in a model correctly. All of them. Every time someone adds a new one.

calebrob6 · 2023-05-28T23:20:49Z

We certainly don't need to check the cross product of pretrained weights with every trainer as there will be a lot of duplicate work (e.g. we have several different sets of ResNet50 weights). We need to check to make sure the pretrained weights are valid, then separately need to check to see if our trainers work with resnets, vits, etc.

We want to make sure our weights actually load in a model correctly.

Right, but the BYOL trainer is not the right place for this. We can check that the weights load in the model correctly with:

from torchgeo.models import get_model, get_model_weights, list_models
for model_name in list_models():
    for weights in get_model_weights(model_name):
        model = get_model(model_name, weights=weights)

calebrob6 · 2023-05-28T23:21:59Z

However, I would also say we should change strict=True in the model factory type methods -- https://github.com/microsoft/torchgeo/blob/main/torchgeo/models/resnet.py#L223

adamjstewart · 2023-05-28T23:58:58Z

Can you make the same change to the other trainers?

adamjstewart

Not only does this decrease memory usage, it also shaves off 1/3 of the time our tests take to run!

It's unclear if this is a solution or if we're just kicking the can. We could find that if we double the number of models, the model tests start to fail instead of the trainer tests. But I guess we'll find out when we add all of our Landsat weights.

isaaccorley · 2023-05-29T17:34:51Z

If we are using dummy models, does it even make sense to test that all the model weights work in a trainer other than just testing that the weights load properly into the specified backbone?

adamjstewart · 2023-05-29T18:06:35Z

At a bare minimum, we need these tests for test coverage. But we also want to make sure that enums, strings, and paths all work correctly

Stop the madness

14d0a54

github-actions bot added the testing Continuous integration testing label May 28, 2023

calebrob6 added 2 commits May 28, 2023 23:01

isort

ff22ae5

flake8

505c0fe

adamjstewart mentioned this pull request May 28, 2023

Revert "GASSL ResNet50 Weights" #1376

Closed

adamjstewart added 2 commits May 29, 2023 11:04

Repeat for other trainers

5b579b9

Parentheses not needed

570fd99

adamjstewart added this to the 0.4.2 milestone May 29, 2023

adamjstewart changed the title ~~Alternative fix for test RAM blowing up~~ Test fewer models in trainers to avoid exceeding RAM May 29, 2023

adamjstewart approved these changes May 29, 2023

View reviewed changes

adamjstewart merged commit 108c94b into main May 29, 2023
18 checks passed

adamjstewart deleted the reduce_byol_tests branch May 29, 2023 16:28

adamjstewart modified the milestones: 0.4.2, 0.5.0 Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test fewer models in trainers to avoid exceeding RAM #1377

Test fewer models in trainers to avoid exceeding RAM #1377

calebrob6 commented May 28, 2023

adamjstewart commented May 28, 2023

calebrob6 commented May 28, 2023 •

edited

Loading

calebrob6 commented May 28, 2023

adamjstewart commented May 28, 2023

adamjstewart left a comment

isaaccorley commented May 29, 2023

adamjstewart commented May 29, 2023

Test fewer models in trainers to avoid exceeding RAM #1377

Test fewer models in trainers to avoid exceeding RAM #1377

Conversation

calebrob6 commented May 28, 2023

adamjstewart commented May 28, 2023

calebrob6 commented May 28, 2023 • edited Loading

calebrob6 commented May 28, 2023

adamjstewart commented May 28, 2023

adamjstewart left a comment

Choose a reason for hiding this comment

isaaccorley commented May 29, 2023

adamjstewart commented May 29, 2023

calebrob6 commented May 28, 2023 •

edited

Loading