Add DOTA dataset #2551

nilsleh · 2025-01-31T15:15:45Z

This PR adds the DOTA dataset for object detection.

Dataset rehosted on HF for faster and comprehensive download in one place.

Dataset features:

multi-class object detection (15 classes in V1 and 18 classes in V2)
horizontal and oriented bounding boxes

Dataset format:

images are three channel PNGs with various pixel sizes
annotations are text files with one line per bounding box

Horizontal BBox example:

Oriented BBox example:

@dingjiansw101 as inquired in CAPTAIN-WHU/DOTA#29, this is the PR that aims to make the DOTA datset more accessible. The dataset structure in the google and baidu drives was a bit confusing and I restructured it on HF which is hopefully more clear. If you would take a look and have any comments, that would be helpful.

Also @robmarkcole perhaps interesting for you.

Closes #1
Closes #2599

docs/api/datasets/non_geo_datasets.csv

tests/datasets/test_dota.py

torchgeo/datasets/dota.py

nilsleh · 2025-03-10T13:29:24Z

I tediously reorganized the original data to now extract into separate annotation version directories, so they can be used interchangeably.

docs/api/datasets/non_geo_datasets.csv

torchgeo/datasets/dota.py

tests/datasets/test_dota.py

torchgeo/datasets/dota.py

adamjstewart · 2025-03-12T13:57:24Z

torchgeo/datasets/dota.py

+            image: image tensor
+        """
+        image = Image.open(os.path.join(self.root, path)).convert('RGB')
+        return torch.from_numpy(np.array(image).transpose(2, 0, 1)).float()


Could use einops.rearrange to make this more clear

docs/api/datasets/non_geo_datasets.csv

dota

bf92489

github-actions bot added documentation datasets testing labels Jan 31, 2025

nilsleh marked this pull request as draft January 31, 2025 15:16

nilsleh added this to the 0.7.0 milestone Jan 31, 2025

nilsleh and others added 4 commits February 3, 2025 09:48

ruff

d2a4ff4

ruff

e5d3de8

Merge branch 'main' into dota

7d1f77e

dataset test

0acd6b0

nilsleh marked this pull request as ready for review February 3, 2025 17:12

adamjstewart requested changes Feb 9, 2025

View reviewed changes

stach

843bec2

danphan mentioned this pull request Feb 21, 2025

Issue loading images from DOTA dataset on Huggingface #2599

Closed

nilsleh and others added 3 commits March 10, 2025 10:26

Merge branch 'main' into dota

6195033

merge main

a9d6396

update

4098d45

nilsleh and others added 5 commits March 10, 2025 14:32

ruff

30e30a9

Merge branch 'main' into dota

f69430e

mypy

3a1841a

codecov

5ee08e8

codecov

646445e

adamjstewart requested changes Mar 12, 2025

View reviewed changes

requests

57c43dd

adamjstewart previously approved these changes Mar 12, 2025

View reviewed changes

adamjstewart reviewed Mar 12, 2025

View reviewed changes

docs/api/datasets/non_geo_datasets.csv Outdated Show resolved Hide resolved

Hyphen -> en dash

fe1101d

adamjstewart dismissed their stale review via fe1101d March 12, 2025 16:26

Merge branch 'main' into dota

51f75ff

adamjstewart approved these changes Mar 12, 2025

View reviewed changes

adamjstewart enabled auto-merge (squash) March 12, 2025 16:27

adamjstewart merged commit 6067656 into microsoft:main Mar 12, 2025
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DOTA dataset #2551

Add DOTA dataset #2551

nilsleh commented Jan 31, 2025 •

edited by adamjstewart

Loading

nilsleh commented Mar 10, 2025

adamjstewart Mar 12, 2025

Add DOTA dataset #2551

Add DOTA dataset #2551

Conversation

nilsleh commented Jan 31, 2025 • edited by adamjstewart Loading

nilsleh commented Mar 10, 2025

adamjstewart Mar 12, 2025

Choose a reason for hiding this comment

nilsleh commented Jan 31, 2025 •

edited by adamjstewart

Loading