Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PIL.UnidentifiedImageError when downloading dataset with missing image #69

Closed
stevensagaert opened this issue May 9, 2023 · 0 comments

Comments

@stevensagaert
Copy link

stevensagaert commented May 9, 2023

Due to the fact that currently there is no transactional protection between metadata en image data it's possible to get a corrupted dataset where the metadata has an image path and marks a sample as annotated & reviewed but there is no actual image. When you try to download such a dataset you get a PIL.UnidentifiedImageError.

The following modified dataset.py has the necessary minimal changes to be able to download such a dataset but of course this solution isn't fully satisfactory because the client still has to deal with corrupt data (in this case missing images). The real good solution would be to prevent such corruption in the first place. Note that even in this case some changes to dataset.p error handling are still useful because some setting of dummy metadata in case of an error just didn't work (in wrong branch).
Note that the changes also include the change for issue #68
dataset.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants