Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SKIPP'D Dataset #1267

Merged
merged 8 commits into from
Apr 19, 2023
Merged

Add SKIPP'D Dataset #1267

merged 8 commits into from
Apr 19, 2023

Conversation

nilsleh
Copy link
Collaborator

@nilsleh nilsleh commented Apr 19, 2023

This PR adds the SKIPPD dataset and datamodule with additional information available in their repo. The task is to predict power output from fish-eye images of the sky. They have done two approaches from my understanding:

  1. "Now Cast": Predict the power output from a single RGB image
  2. "Short-term Forecast": from time-series images make a forecast

This dataset version implements approach 1. Once we support Forecasting in torchgeo, I would add the "forecast" version of this dataset.

Dataset Format:

  • .hdf5 file containing images and labels
  • .npy files with corresponding datetime timestamps

Dataset Features:

  • fish-eye RGB images (64x64px)
  • power output measurements from 30-kW rooftop PV array
  • 1-min interval across 3 years (2017-2019)

The dataset comes with a "trainval" and "test" split:

  • For the datamodule I implemented a 80/20 random split on the "trainval" split to generate a train and validation set

Examples:

@github-actions github-actions bot added datamodules PyTorch Lightning datamodules datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing labels Apr 19, 2023
@nilsleh nilsleh added this to the 0.5.0 milestone Apr 19, 2023
docs/api/datasets.rst Outdated Show resolved Hide resolved
docs/api/non_geo_datasets.csv Outdated Show resolved Hide resolved
torchgeo/datamodules/skippd.py Outdated Show resolved Hide resolved
torchgeo/datamodules/skippd.py Outdated Show resolved Hide resolved
torchgeo/datamodules/skippd.py Outdated Show resolved Hide resolved
torchgeo/datasets/skippd.py Outdated Show resolved Hide resolved
torchgeo/datasets/skippd.py Outdated Show resolved Hide resolved
@adamjstewart adamjstewart changed the title Add SKIPPD Dataset Add SKIPP'D Dataset Apr 19, 2023
torchgeo/datamodules/skippd.py Outdated Show resolved Hide resolved
torchgeo/datasets/skippd.py Outdated Show resolved Hide resolved
@adamjstewart adamjstewart enabled auto-merge (squash) April 19, 2023 19:17
@adamjstewart adamjstewart merged commit c608045 into microsoft:main Apr 19, 2023
17 checks passed
yichiac pushed a commit to yichiac/torchgeo that referenced this pull request Apr 29, 2023
* add dataset

* add datamodule

* add unit tests

* add docs

* requested changes

* requested changes

* wrong args in docs

* args order
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datamodules PyTorch Lightning datamodules datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants