Can you upload resources to another cloud like Gdrive, Onedrive or Dropbox? #17

khiemledev · 2021-10-17T13:18:26Z

In my country, the download speed from Baidu is very slow and I can't down needed resources. Can you please upload them to GDrive, OneDrive, or Dropbox? Thank you!

luo3300612 · 2021-10-18T08:20:50Z

Sorry, since the whole file is ~70G. I am not able to afford to upload it to GDrive/OneDrive. But you can follow the Data preparation step. There are 5 keys in my hdf5 feature file. The first three keys can be obtained when extracting region features with extract_region_feature.py. The forth key can be obtained when extracting grid features with code in grid-feats-vqa. The last key can be obtained with align.ipynb.

khiemledev · 2021-10-18T08:48:31Z

I do some tricks and successful to download the files. Thank you for your reply!

I have another question. Can you please tell me how to produce coco_train_ids.npy, coco_test_ids.npy, and coco_restval_ids.npy files for my own dataset already in COCO format?

luo3300612 · 2021-10-18T11:17:37Z

coco_train_ids.npy is a (N,) numpy array where N=the number of images for training. It contains id to specify the image-text pair in captions_train2014.json:

>>>import json
>>>info = json.load(open('captions_train2014.json'))
>>>annotations = info['annotations']
>>>print(annotations[0])
{'image_id': 318556, 'id': 48, 'caption': 'A very clean and well decorated empty bathroom'}

so the image feature is 318556_features/boxes/size/grids/mask, and the corresponding caption is 'A very clean and well decorated empty bathroom'.

However, since the code is highly limited to the COCO dataset, it is recommended to re-write dataset.py for your own dataset.

Or you need to create the hdf5, train/val/test_ids.npy and captions_train2014.json/captions_val2014.json file for your own dataset.

Hope it helps.

khiemledev · 2021-10-18T14:45:33Z

It's very helpful. Thank you very much!

YuigaWada · 2022-11-03T01:55:13Z

For those who don't have a Baidu account, I created a mirror of the data distributed on Baidu Pan. You can download the data from this link without login.
Use at your own risk :)
(also related to #36 issue)

khiemledev closed this as completed Oct 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you upload resources to another cloud like Gdrive, Onedrive or Dropbox? #17

Can you upload resources to another cloud like Gdrive, Onedrive or Dropbox? #17

khiemledev commented Oct 17, 2021

luo3300612 commented Oct 18, 2021

khiemledev commented Oct 18, 2021 •

edited

luo3300612 commented Oct 18, 2021

khiemledev commented Oct 18, 2021

YuigaWada commented Nov 3, 2022

Can you upload resources to another cloud like Gdrive, Onedrive or Dropbox? #17

Can you upload resources to another cloud like Gdrive, Onedrive or Dropbox? #17

Comments

khiemledev commented Oct 17, 2021

luo3300612 commented Oct 18, 2021

khiemledev commented Oct 18, 2021 • edited

luo3300612 commented Oct 18, 2021

khiemledev commented Oct 18, 2021

YuigaWada commented Nov 3, 2022

khiemledev commented Oct 18, 2021 •

edited