Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add DIOR dataset #2572

Merged
merged 14 commits into from
Mar 12, 2025
Merged

Add DIOR dataset #2572

merged 14 commits into from
Mar 12, 2025

Conversation

nilsleh
Copy link
Collaborator

@nilsleh nilsleh commented Feb 10, 2025

This PR adds the DIOR dataset. License found here. Dataset rehosted on Huggingface based on this google drive link.

Dataset features:

* 20 classes
* 192,472 manually annotated bounding box instances

Dataset format:

* Images are three channel .jpg files.
* Annotations are in xml format

dior_0

@nilsleh nilsleh added this to the 0.7.0 milestone Feb 10, 2025
@nilsleh nilsleh marked this pull request as draft February 10, 2025 17:41
@github-actions github-actions bot added documentation Improvements or additions to documentation datasets Geospatial or benchmark datasets testing Continuous integration testing labels Feb 10, 2025
@nilsleh nilsleh marked this pull request as ready for review February 12, 2025 07:43
@adamjstewart adamjstewart mentioned this pull request Feb 25, 2025
Copy link
Collaborator

@adamjstewart adamjstewart left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just minor documentation things remaining

array: np.typing.NDArray[np.int_] = np.array(img.convert('RGB'))
tensor: Tensor = torch.from_numpy(array)
# Convert from HxWxC to CxHxW
tensor = tensor.permute((2, 0, 1))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could use einops.rearrange here to make this easier to read

@adamjstewart adamjstewart merged commit f0f8a00 into microsoft:main Mar 12, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants