Remove dataset-specific trainers #286

adamjstewart · 2021-12-15T02:47:23Z

Closes #205
Depends on #329

References:

isaaccorley · 2021-12-17T22:49:51Z

Just a note: I've recently been using kornia augmentations in the datamodule on a side project and one of the things I ran into was how to let the datamodule know if I'm loading data for a train/val/test set so that I can choose to augment or not. Found that you can access a bool attr self.trainer.training in the datamodule so you can do something like:

def on_after_batch_transfer(self, batch, dataloader_idx):
   if self.trainer.training:
      # Augment only if loading for train_step
      batch = augmentations(batch)
    return batch

torchgeo/datasets/landcoverai.py

adamjstewart · 2021-12-17T23:06:21Z

Note that none of this code currently gets hit by our tests. We aren't using a pl.Trainer and so things like self.trainer are None. Still trying to figure out the best way to test this.

adamjstewart · 2021-12-31T04:36:45Z

Note: I don't think we're adding the predictions to the batch before plotting, we probably should

adamjstewart · 2021-12-31T17:30:21Z

tests/trainers/test_utils.py

@@ -17,26 +16,6 @@
 )


-class FakeExperiment(object):


Forgot to remove these in #329

adamjstewart · 2021-12-31T18:13:44Z

Test failure is because the So2Sat dataset doesn't know how to plot any of the datamodule reduced band set options. We should probably move these to the dataset level.

adamjstewart · 2021-12-31T19:19:26Z

Another hiccup: self.trainer.datamodule.val_dataset.plot(...) doesn't work for datamodules that use Subset or random_split. One possible solution would be to use self.trainer.datamodule.plot(...) and add a plot(...) method to every DataModule that passes all args to the Dataset plot(...) method.

isaaccorley · 2021-12-31T20:05:36Z

You can access the plot method for Subset datasets like self.trainer.datamodule.val_dataset.dataset.plot. Not sure what workaround we should make for this.

Edit: I think adding a plot method to each datamodule that just calls the dataset plot method is a decent solution.

adamjstewart · 2022-01-01T02:56:21Z

I believe the failing unit tests for ClassificationTask are because VisionClassificationDataset is overwriting self.classes and our fake data only has 2 classes. Simple fix would be to add more fake data.

Still haven't investigated the failing unit tests for SemanticSegmentationTask. Will do so tomorrow.

torchgeo/trainers/classification.py

torchgeo/datamodules/landcoverai.py

conf/task_defaults/eurosat.yaml

torchgeo/datamodules/landcoverai.py

torchgeo/trainers/classification.py

* Remove dataset-specific trainers * Collation functions will be new in 0.2.0 * Clarify arg docstring * Style fixes * Remove files forgotten in rebase * Fix bug in unbind_samples, add tests * Fix bugs in datamodule augmentations * Increase coverage for datamodules * Fix bugs in logger plotting, properly test * Fix tests * Increase coverage of trainers * Use datamodule plot instead of dataset plot * Skip datamodules without tests * Plot predictions * Fix ClassificationTask tests * Fix SemanticSegmentationTask tests * EAFP -> LBYL * Ensure that tensors are on the CPU before plotting

adamjstewart added the trainers PyTorch Lightning trainers label Dec 15, 2021

adamjstewart added this to the 0.2.0 milestone Dec 15, 2021

adamjstewart commented Dec 17, 2021

View reviewed changes

torchgeo/datasets/landcoverai.py Outdated Show resolved Hide resolved

adamjstewart mentioned this pull request Dec 22, 2021

Move DataModules to torchgeo.datamodules #321

Merged

adamjstewart force-pushed the trainers/dataset-specific branch 3 times, most recently from c3c69bc to 9657a67 Compare December 24, 2021 22:37

github-actions bot added datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing datamodules PyTorch Lightning datamodules labels Dec 24, 2021

adamjstewart mentioned this pull request Dec 25, 2021

Refactor datamodule/model testing #329

Merged

adamjstewart force-pushed the trainers/dataset-specific branch from 9657a67 to 76f24fa Compare December 30, 2021 20:59

adamjstewart commented Dec 31, 2021

View reviewed changes

tests/trainers/test_utils.py

@@ -17,26 +16,6 @@

)

class FakeExperiment(object):

Copy link

Collaborator Author

adamjstewart Dec 31, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Forgot to remove these in #329

adamjstewart force-pushed the trainers/dataset-specific branch from 6bf2b19 to 1035d5e Compare December 31, 2021 20:46

adamjstewart marked this pull request as ready for review December 31, 2021 22:09

adamjstewart added 8 commits December 31, 2021 20:16

Remove dataset-specific trainers

b3440d4

Collation functions will be new in 0.2.0

76ae155

Clarify arg docstring

8a81aa0

Style fixes

0a9daca

Remove files forgotten in rebase

c489de0

Fix bug in unbind_samples, add tests

b701827

Fix bugs in datamodule augmentations

27b390c

Increase coverage for datamodules

6c66ea1

adamjstewart added 5 commits December 31, 2021 20:16

Fix bugs in logger plotting, properly test

73898c2

Fix tests

b02dc9d

Increase coverage of trainers

0e66aab

Use datamodule plot instead of dataset plot

bc5ae8a

Skip datamodules without tests

e4f08d3

adamjstewart force-pushed the trainers/dataset-specific branch from ccfe068 to e4f08d3 Compare January 1, 2022 02:16

adamjstewart marked this pull request as draft January 1, 2022 02:38

Plot predictions

8687382

adamjstewart added 2 commits January 1, 2022 10:38

Fix ClassificationTask tests

f1f81ed

Fix SemanticSegmentationTask tests

22dd647

adamjstewart marked this pull request as ready for review January 1, 2022 16:52

isaaccorley reviewed Jan 1, 2022

View reviewed changes

torchgeo/trainers/classification.py Show resolved Hide resolved

torchgeo/datamodules/landcoverai.py Outdated Show resolved Hide resolved

conf/task_defaults/eurosat.yaml Show resolved Hide resolved

calebrob6 reviewed Jan 1, 2022

View reviewed changes

torchgeo/datamodules/landcoverai.py Outdated Show resolved Hide resolved

calebrob6 reviewed Jan 1, 2022

View reviewed changes

torchgeo/trainers/classification.py Show resolved Hide resolved

adamjstewart added 2 commits January 1, 2022 13:39

EAFP -> LBYL

c4eefb2

Ensure that tensors are on the CPU before plotting

dea5b9a

calebrob6 mentioned this pull request Jan 1, 2022

Re-think how configs are handled in train.py #227

Closed

calebrob6 approved these changes Jan 1, 2022

View reviewed changes

adamjstewart merged commit 42b9a6d into main Jan 1, 2022

adamjstewart deleted the trainers/dataset-specific branch January 1, 2022 20:14

adamjstewart added utilities Utilities for working with geospatial data and removed utilities Utilities for working with geospatial data labels Jan 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove dataset-specific trainers #286

Remove dataset-specific trainers #286

adamjstewart commented Dec 15, 2021 •

edited

isaaccorley commented Dec 17, 2021

adamjstewart commented Dec 17, 2021

adamjstewart commented Dec 31, 2021

adamjstewart Dec 31, 2021

adamjstewart commented Dec 31, 2021

adamjstewart commented Dec 31, 2021

isaaccorley commented Dec 31, 2021 •

edited

adamjstewart commented Jan 1, 2022

Remove dataset-specific trainers #286

Remove dataset-specific trainers #286

Conversation

adamjstewart commented Dec 15, 2021 • edited

isaaccorley commented Dec 17, 2021

adamjstewart commented Dec 17, 2021

adamjstewart commented Dec 31, 2021

adamjstewart Dec 31, 2021

Choose a reason for hiding this comment

adamjstewart commented Dec 31, 2021

adamjstewart commented Dec 31, 2021

isaaccorley commented Dec 31, 2021 • edited

adamjstewart commented Jan 1, 2022

adamjstewart commented Dec 15, 2021 •

edited

isaaccorley commented Dec 31, 2021 •

edited