Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xView2: deterministic dataset order #1918

Merged
merged 1 commit into from
Mar 2, 2024

Conversation

adamjstewart
Copy link
Collaborator

This is the only other place I found where we're using a set. @RitwikGupta maybe you can confirm that dataset order is not deterministic (even with setting a random seed) and confirm that this fixes it?

@adamjstewart adamjstewart added this to the 0.5.2 milestone Feb 28, 2024
@github-actions github-actions bot added the datasets Geospatial or benchmark datasets label Feb 28, 2024
@isaaccorley
Copy link
Collaborator

I think for this case this isn't as big of an issue because the train/test sets are in separate folders. But if you did random split the train set into a train/val set you would encounter issues with determinism due to the set operation.

@adamjstewart
Copy link
Collaborator Author

You still have issues with determinism even without random split. The order you iterate over the dataset changes, so you can't reproduce training results.

@isaaccorley
Copy link
Collaborator

Train dataloaders random shuffle by default on each epoch, but yeah if you want to guarantee that that randomness is identical per seed, this definitely needs to be fixed.

@isaaccorley isaaccorley merged commit faa312d into microsoft:main Mar 2, 2024
24 checks passed
@adamjstewart adamjstewart deleted the datasets/xview2 branch March 2, 2024 21:04
isaaccorley pushed a commit that referenced this pull request Mar 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datasets Geospatial or benchmark datasets
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants