Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

更换数据集问题 #3

Closed
Leng-bingo opened this issue Nov 26, 2022 · 4 comments
Closed

更换数据集问题 #3

Leng-bingo opened this issue Nov 26, 2022 · 4 comments

Comments

@Leng-bingo
Copy link

如果想更换自己的其他数据集,只需要对应以下格式即可么。

│MSCOCO_Caption/
├──annotations/
│  ├── captions_train2014.json
│  ├── captions_val2014.json
├──train2014/
│  ├── COCO_train2014_000000000009.jpg
│  ├── ......
├──val2014/ 
│  ├── COCO_val2014_000000000042.jpg
│  ├── ......
@Leng-bingo
Copy link
Author

还有就是下面的json文件切分好的哪里可以下载呢,里面的格式是什么样子的,再次感谢

├──annotations/
│  ├── captions_train2014.json
│  ├── captions_val2014.json

@buxiangzhiren
Copy link
Owner

你需要自己生成.pkl文件,只要是image和text一对的那种就行。然后json文件你可以自己用加载了看一下

@buxiangzhiren
Copy link
Owner

buxiangzhiren commented Nov 26, 2022

json文件是coco数据集自己提供的

@rongtongxueya
Copy link

hi,dude.I met the same problem as this but I don't know how to generate the .pkl. I printed and saw that the .pkl file has a clip_embedding more than the json file, I don't know how to get this embedding, can someone kind give me some advice

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants