Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add instructions on downloading the DeepGlobeLandCover dataset #1797

Closed
robmarkcole opened this issue Jan 4, 2024 · 5 comments · Fixed by #1921
Closed

Add instructions on downloading the DeepGlobeLandCover dataset #1797

robmarkcole opened this issue Jan 4, 2024 · 5 comments · Fixed by #1921
Labels
documentation Improvements or additions to documentation
Milestone

Comments

@robmarkcole
Copy link
Contributor

robmarkcole commented Jan 4, 2024

Issue

The dataset docs state The dataset that we use with a custom train/test split can be downloaded from Kaggle - however this is a necessity as you cannot pass download=True

Fix

Suggest documenting the steps using kaggle CLI (below), or just to state that this must be performed? Alternatively host on Huggingface and automate the download

pip install kaggle # place api key at ~/.kaggle/kaggle.json
cd data
kaggle datasets download -d geoap96/deepglobe2018-landcover-segmentation-traindataset
unzip deepglobe2018-landcover-segmentation-traindataset.zip
@robmarkcole robmarkcole added the documentation Improvements or additions to documentation label Jan 4, 2024
@robmarkcole robmarkcole changed the title Add instructions on downloading the deepglobe dataset Add instructions on downloading the DeepGlobeLandCover dataset Jan 4, 2024
@adamjstewart
Copy link
Collaborator

Alternatively host on Huggingface and automate the download

We haven't yet found the license for the dataset, so I'm not sure if we can do this. Want to try to reach out to the authors of the dataset to see? If the license permits redistribution, I'll add it to our Hugging Face account.

@robmarkcole
Copy link
Contributor Author

Reading the paper, appears the imagery is from deepglobe/maxar, will check my network for contacts there

@adamjstewart
Copy link
Collaborator

Any updates on this?

@robmarkcole
Copy link
Contributor Author

RE license, I came to a dead end - the people I spoke to who had worked on the dataset have all moved on

As to updating the docs, I think it could be useful if you could clarify what level of detail we should be providing?

@adamjstewart
Copy link
Collaborator

https://torchgeo.readthedocs.io/en/stable/api/datasets.html#torchgeo.datasets.SSL4EOS12 is an example of a similar dataset where users have to manually download it. So a similar level of detail is fine. Want to submit a PR?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants