CDL/NLCD/SSL4EO: allow selection of classes #1392

adamjstewart · 2023-06-03T04:28:30Z

There are too many CDL classes. #1389 revealed that the top 3 classes cover more area than the bottom 130, and only 17 classes occupy more than 1% of land. We need a way to specify a smaller set of classes in order for benchmarking to be computationally tractable.

This PR introduces a classes parameter to all 3 datasets that lets users specify which classes they actually care about. In combination with the class_weights and ignore_index parameters of SemanticSegmentationTask, this will allow us to weight these remaining classes to fight class imbalance.

@nilsleh apologies, but this undoes a lot of your hard work and changes the cmaps to their previous values (albeit without unused classes). I only did NLCD to start, but see what you think of this implementation and we can update CDL and SSL4EO-L Benchmark to match. I also haven't tested any of this so it's very likely there are bugs. We should test both the getitem and plot methods. I also don't know what will happen if the user doesn't include 0 in classes.

All of the following plots have been validated in QGIS.

NLCD

All classes:

Few classes:

CDL

All classes:

Few classes:

calebrob6 · 2023-06-03T23:01:45Z

There are too many CDL classes. #1389 revealed that the top 3 classes cover more area than the bottom 130, and only 17 classes occupy more than 1% of land. We need a way to specify a smaller set of classes in order for benchmarking to be computationally tractable.

I don't understand this motivation:

the number of CDL classes is fixed, there aren't "too many". Long-tailed class distributions are something that needs to be dealt with in modeling.
<256 classes is definitely not computationally intractable (e.g. imagenet has 1,000)
Regardless, if a user doesn't want some classes, then they can remap them to 0 in a transform (see below). I don't see this as a feature that torchgeo needs for one particular dataset.

def preprocess(sample):
  mask = sample["mask"]
  mask[mask == 42] = 0
  sample["mask"] = mask
  return sample

Do you mind expanding on your thinking here?

adamjstewart · 2023-06-04T00:23:25Z

<256 classes is definitely not computationally intractable (e.g. imagenet has 1,000)

It isn't for classification, but it is for semantic segmentation. Our CDL benchmarks take 60x longer than our NLCD benchmarks. At the moment, it's basically impossible for us to finish all CDL benchmarks before the deadline. This is what we decided in the last meeting.

calebrob6 · 2023-06-04T05:29:15Z

I can't reproduce this behavior

calebrob6 · 2023-06-04T05:31:52Z

It is identical with 20 and 130 output classes, the last classification layer should not be a large fraction of the computation done in a UNet.

nilsleh · 2023-06-04T12:58:30Z

I think the story changes when you also include the backward pass, that is also what we found when using the pytorch Profiler.

calebrob6 · 2023-06-04T15:04:27Z

There's some difference but not double

adamjstewart · 2023-06-04T15:10:47Z

@yichiac tried training on our cluster and each epoch took 60x longer for CDL than NLCD. @nilsleh tried on his cluster and it was 10x. @calebrob6 if you want to train on your GPUs that's fine, but it's not feasible on our systems. This is what we spent a large portion of the last meeting discussing and this is the decision that we made. We've been trying to figure this out for days and no one has solved it and we're rapidly running out of time to start benchmarking. We have 65 CDL benchmarks to fill in. If each CDL benchmark takes > 24 hrs to complete, we're going to need quite a lot of GPUs...

calebrob6 · 2023-06-04T15:20:45Z

That's fine, but I don't think it needs to be baked into the library is what I'm saying.

adamjstewart · 2023-06-04T15:22:19Z

Writing transforms is a little painful at the moment but we can think about moving this into a datamodule transform before the next release.

adamjstewart · 2023-06-04T16:22:11Z

https://openaccess.thecvf.com/content/ICCV2021/papers/Jain_Scaling_Semantic_Segmentation_Beyond_1K_Classes_on_a_Single_GPU_ICCV_2021_paper.pdf

calebrob6 · 2023-06-04T20:27:04Z

Yes, it will take more memory, but the computation cost is linear in the number of classes.

See how the linear function fit on "time for inference + backprop" from 20 to 500 classes predicts the "time for inference + backprop" for 1000 classes:

CDL/NLCD/SSL4EO: allow selection of classes

64f5562

adamjstewart requested a review from nilsleh June 3, 2023 04:28

adamjstewart added this to In progress in SSL4EO-L via automation Jun 3, 2023

adamjstewart marked this pull request as draft June 3, 2023 04:28

github-actions bot added the datasets Geospatial or benchmark datasets label Jun 3, 2023

adamjstewart added 5 commits June 2, 2023 23:33

0 is already 0

a9903d1

Get NLCD tests to pass

1064591

Search recursively for NLCD files

1f3654a

Update CDL

c9b9f04

Update SSL4EO-L Benchmark

7d0c2e0

github-actions bot added the testing Continuous integration testing label Jun 3, 2023

adamjstewart added 6 commits June 3, 2023 17:27

Passing tests

52643ea

Test SSL4EO-L Benchmark

6818229

Test CDL

9e470d1

Test NLCD

2d273dd

Mypy fix

eb80453

Remove debugging code

5ab10e8

adamjstewart marked this pull request as ready for review June 3, 2023 22:53

calebrob6 approved these changes Jun 4, 2023

View reviewed changes

calebrob6 merged commit 9e57f27 into microsoft:main Jun 4, 2023
21 checks passed

SSL4EO-L automation moved this from In progress to Done Jun 4, 2023

adamjstewart deleted the datasets/cdl-nlcd-classes branch June 4, 2023 15:21

adamjstewart added this to the 0.5.0 milestone Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CDL/NLCD/SSL4EO: allow selection of classes #1392

CDL/NLCD/SSL4EO: allow selection of classes #1392

adamjstewart commented Jun 3, 2023 •

edited

Loading

calebrob6 commented Jun 3, 2023 •

edited

Loading

adamjstewart commented Jun 4, 2023 •

edited

Loading

calebrob6 commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

nilsleh commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

CDL/NLCD/SSL4EO: allow selection of classes #1392

CDL/NLCD/SSL4EO: allow selection of classes #1392

Conversation

adamjstewart commented Jun 3, 2023 • edited Loading

NLCD

CDL

calebrob6 commented Jun 3, 2023 • edited Loading

adamjstewart commented Jun 4, 2023 • edited Loading

calebrob6 commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

nilsleh commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

adamjstewart commented Jun 4, 2023

calebrob6 commented Jun 4, 2023

adamjstewart commented Jun 3, 2023 •

edited

Loading

calebrob6 commented Jun 3, 2023 •

edited

Loading

adamjstewart commented Jun 4, 2023 •

edited

Loading