SeCo/BYOL: add datamodule, RandomSeasonContrast #1168

adamjstewart · 2023-03-09T20:59:30Z

This PR contains the following changes:

SeCo: faster initialization
SeCo: add datamodule
SeCo: add RandomSeasonContrast
BYOL: add RandomSeasonContrast

RandomSeasonContrast is the idea from SeCo and SSL4EO where images taken of the same location at different points in time are used as inputs to a contrastive SSL model instead of taking two random crops of the same image.

adamjstewart · 2023-03-09T21:03:52Z

torchgeo/datasets/seco.py

        self.transforms = transforms
        self.download = download
        self.checksum = checksum

        self._verify()

-        # TODO: This is slow, I think this should be generated on download and then


There's no reason we need to os.walk down 5M directories and 65M files, this is just silly. We know the total number of directories to expect (100k or 1m / 5) and can find subdirectories on the fly.

adamjstewart · 2023-03-09T21:04:59Z

torchgeo/datasets/seco.py

+            self.root, self.metadata[self.version]["directory"], f"{index:06}"
+        )
+        patch_dirs = glob.glob(os.path.join(directory, "*"))
+        patch_dirs = random.sample(patch_dirs, self.seasons)


RandomSeasonContrast. User defines how many seasons (patches) they want per location, and we randomly return them. Note that returning all 5 seasons will return them in a random order, but this was already the case in the previous implementation.

adamjstewart · 2023-03-09T21:05:49Z

torchgeo/datasets/seco.py

+            sample with an "image" in SCxHxW format where S is the number of seasons
+
+        .. versionchanged:: 0.5
+           Image shape changed from 5xCxHxW to SCxHxW


Kornia requires samples to be B x C x H x W, it doesn't support B x T x C x H x W.

adamjstewart · 2023-03-09T21:06:21Z

torchgeo/trainers/byol.py

+
+        in_channels = self.hyperparams["in_channels"]
+        assert x.size(1) == in_channels or x.size(1) == 2 * in_channels
+        if x.size(1) == in_channels:


This trainer now supports both datasets with and without RandomSeasonContrast

adamjstewart · 2023-03-09T21:07:44Z

torchgeo/trainers/byol.py

@@ -401,33 +409,10 @@ def training_step(self, *args: Any, **kwargs: Any) -> Tensor:
        return loss

    def validation_step(self, *args: Any, **kwargs: Any) -> None:


I don't think it makes sense to include validation/test/predict in an SSL trainer. For datasets without labels, there is no easy way to evaluate performance. For datasets with labels, there is no way to know which task we are trying to evaluate (classification, regression, semantic segmentation, etc.)

I think this is fine. There could be an infinite number of downstream tasks

https://pytorch-lightning.readthedocs.io/en/stable/notebooks/course_UvA-DL/13-contrastive-learning.html defines a validation_step but that's because their trainer only supports validation of classification datasets.

adamjstewart · 2023-03-09T21:08:54Z

tests/trainers/test_byol.py

-        "name,classname",
-        [
-            ("chesapeake_cvpr_7", ChesapeakeCVPRDataModule),
-            ("chesapeake_cvpr_prior", ChesapeakeCVPRDataModule),


We could test any of our 50+ supervised datasets, but I think it makes more sense to test our unsupervised datasets. Right now, we only have SeCo and BYOL, but I'm working on adding SSL4EO and SimCLR/MoCo which will also be tested in a similar fashion.

isaaccorley

LGTM

* SeCo/BYOL: add datamodule, RandomSeasonContrast * black * Fix length, mypy * Fix tests * Fix float length * Simplify length logic * Simpler plotting * Fix axes indexing * Increase coverage * Increase coverage * CVPR prior not compatible with segmentation, but is with BYOL * Increase coverage * isort fix * mypy fix

SeCo/BYOL: add datamodule, RandomSeasonContrast

e5fa20c

adamjstewart added this to the 0.5.0 milestone Mar 9, 2023

github-actions bot added datamodules PyTorch Lightning datamodules datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing trainers PyTorch Lightning trainers labels Mar 9, 2023

adamjstewart commented Mar 9, 2023

View reviewed changes

adamjstewart added 7 commits March 9, 2023 15:54

black

366d545

Fix length, mypy

22308ac

Fix tests

4577201

Fix float length

f67b91c

Simplify length logic

eebe673

Simpler plotting

0a2b243

Fix axes indexing

2c51585

adamjstewart mentioned this pull request Mar 10, 2023

SSL4EO-S12: add new dataset/datamodule #1151

Merged

adamjstewart added 6 commits March 9, 2023 21:11

Increase coverage

b4711bb

Increase coverage

b0ee496

CVPR prior not compatible with segmentation, but is with BYOL

187da38

Increase coverage

387c059

isort fix

4cc5c6d

mypy fix

f838861

isaaccorley approved these changes Mar 10, 2023

View reviewed changes

adamjstewart merged commit c1a6fb1 into main Mar 17, 2023

adamjstewart deleted the datamodules/seco branch March 17, 2023 19:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SeCo/BYOL: add datamodule, RandomSeasonContrast #1168

SeCo/BYOL: add datamodule, RandomSeasonContrast #1168

adamjstewart commented Mar 9, 2023 •

edited

Loading

adamjstewart Mar 9, 2023 •

edited

Loading

adamjstewart Mar 9, 2023

adamjstewart Mar 9, 2023

adamjstewart Mar 9, 2023

isaaccorley Mar 10, 2023

adamjstewart Mar 9, 2023

isaaccorley Mar 10, 2023

adamjstewart Mar 10, 2023 •

edited

Loading

adamjstewart Mar 9, 2023

isaaccorley left a comment

		@@ -401,33 +409,10 @@ def training_step(self, args: Any, *kwargs: Any) -> Tensor:
		return loss

		def validation_step(self, args: Any, *kwargs: Any) -> None:

SeCo/BYOL: add datamodule, RandomSeasonContrast #1168

SeCo/BYOL: add datamodule, RandomSeasonContrast #1168

Conversation

adamjstewart commented Mar 9, 2023 • edited Loading

adamjstewart Mar 9, 2023 • edited Loading

Choose a reason for hiding this comment

adamjstewart Mar 9, 2023

Choose a reason for hiding this comment

adamjstewart Mar 9, 2023

Choose a reason for hiding this comment

adamjstewart Mar 9, 2023

Choose a reason for hiding this comment

isaaccorley Mar 10, 2023

Choose a reason for hiding this comment

adamjstewart Mar 9, 2023

Choose a reason for hiding this comment

isaaccorley Mar 10, 2023

Choose a reason for hiding this comment

adamjstewart Mar 10, 2023 • edited Loading

Choose a reason for hiding this comment

adamjstewart Mar 9, 2023

Choose a reason for hiding this comment

isaaccorley left a comment

Choose a reason for hiding this comment

adamjstewart commented Mar 9, 2023 •

edited

Loading

adamjstewart Mar 9, 2023 •

edited

Loading

adamjstewart Mar 10, 2023 •

edited

Loading