New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot load image from CC3M #13
Comments
Hi, I used this code to download the dataset: |
Thanks. How about the first question? The code cannot identify CC3M images (the path is correct, and images do exists), while it can identify images from other datasets. |
It could be because that the image is not downloaded correctly, so PIL cannot load it. |
Thanks for your help! |
How to down CC12M dataset ? can you share the download tool? |
Hi, I simply modified the download code for CC3M, the format between the two is very similar. |
This is for CC3M rather than cc12. |
Yes, you can slightly modify the code to download cc12. |
Get the following error:
PIL.UnidentifiedImageError: cannot identify image file '/home/ubuntu/data/CC3M/DownloadConceptualCaptions/validation/10481_3355970027'
The error is generated by this code in
caption_dataset.py
:image = Image.open(ann['image']).convert('RGB')
BTW, I can only download 2.4M images from CC3M/training, how did you download 2.95M images? Thanks.
The text was updated successfully, but these errors were encountered: