Vaihingen datamodule #851

nilsleh · 2022-10-15T19:08:53Z

Description

I would expect that with a existing Vaihingen datamodule, I only need to define a segmentation task and a pl.Trainer to train a model on this dataset (but maybe this expectation is wrong). However, the Vaihingen dataset has variable sized image dimensions, and therefore one cannot specify a batch_size>1 because otherwise tensors cannot be stacked. So either there should be a collate function for the dataloaders in the datamodule or some comment in the documentation because the default batch_size of the datamodule is 64.

Steps to reproduce

from torchgeo.datamodules import Vaihingen2DDataModule
from torchgeo.trainers import SemanticSegmentationTask
import pytorch_lightning as pl

datamodule = Vaihingen2DDataModule(root="./data/Vaihingen")

task = SemanticSegmentationTask(
    segmentation_model="unet",
    encoder_name="resnet18",
    encoder_weights="imagenet",
    in_channels=3,
    num_classes=6,
    loss="jaccard",
    ignore_index=None,
    learning_rate=0.001,
    learning_rate_schedule_patience=5
)

trainer = pl.Trainer(
    fast_dev_run=True,
    enable_progress_bar=False
)

trainer.fit(
    model=task,
    datamodule=datamodule
)

Version

0.4.0.dev0

The text was updated successfully, but these errors were encountered:

adamjstewart · 2022-10-15T19:12:48Z

My vote is for data augmentation that pads or crops to a consistent size. How much do image sizes vary? A lot of other datasets have image sizes that vary by ± 1 px, so those are much easier to take care of.

calebrob6 · 2022-10-15T19:18:18Z

There are 16 samples in the training dataset and they are more like "tiles" or "scenes". I think the datamodule should randomly sample fixed size crops from them.

The sizes:

torch.Size([3, 2569, 1919])
torch.Size([3, 2566, 1893])
torch.Size([3, 2558, 2818])
torch.Size([3, 2565, 1919])
torch.Size([3, 1281, 2336])
torch.Size([3, 2546, 1903])
torch.Size([3, 2546, 1903])
torch.Size([3, 1783, 2995])
torch.Size([3, 2567, 1917])
torch.Size([3, 3007, 2006])
torch.Size([3, 2563, 1934])
torch.Size([3, 2555, 1980])
torch.Size([3, 2555, 1388])
torch.Size([3, 1995, 1996])
torch.Size([3, 2557, 1887])
torch.Size([3, 2557, 1887])

adamjstewart · 2022-10-15T19:20:53Z

In that case, we should convert Vaihingen2D from a NonGeoDataset to a GeoDataset and use a GeoSampler like we do in NAIPChesapeakeDataModule.

calebrob6 · 2022-10-15T19:29:50Z

They aren't georeferenced

adamjstewart · 2022-10-15T19:40:31Z

Guess we can do something like this then: https://kornia-tutorials.readthedocs.io/en/latest/geometry_generate_patch.html

isaaccorley · 2022-10-16T05:49:59Z

OSCDDataModule is a good reference. It also has variable sized images and we take random crops during training.

adamjstewart added the datamodules PyTorch Lightning datamodules label Oct 15, 2022

This was referenced Oct 16, 2022

Fix Vaihingen datamodule #853

Merged

Random sized patches support for other non-geospatial datamodules #855

Closed

adamjstewart closed this as completed in #853 Dec 30, 2022

adamjstewart added this to the 0.4.0 milestone Jan 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vaihingen datamodule #851

Vaihingen datamodule #851

nilsleh commented Oct 15, 2022

adamjstewart commented Oct 15, 2022

calebrob6 commented Oct 15, 2022

adamjstewart commented Oct 15, 2022

calebrob6 commented Oct 15, 2022

adamjstewart commented Oct 15, 2022

isaaccorley commented Oct 16, 2022

Vaihingen datamodule #851

Vaihingen datamodule #851

Comments

nilsleh commented Oct 15, 2022

Description

Steps to reproduce

Version

adamjstewart commented Oct 15, 2022

calebrob6 commented Oct 15, 2022

adamjstewart commented Oct 15, 2022

calebrob6 commented Oct 15, 2022

adamjstewart commented Oct 15, 2022

isaaccorley commented Oct 16, 2022