Skip to content

li-xirong/flickr8kcn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Flickr8K-CN

Flickr8K-CN is a bilingual (English-to-Chinese) extension of the popular Flickr8K set, used for evaluating image captioning in a cross-lingual setting.

Chinese sentences Flickr8k-train Flickr8k-val Flickr8k-test
human written
human translation
machine translation (baidu)
machine translation (google)

Data

Sentences

  1. Original English sentences
  2. Chinese sentences written by native Chinese speakers
  3. Chinese sentences generated by Baidu translation (icmr2016 version, version 20160815)
  4. Chinese sentences generated by Google translation (icmr2016 version, version 20160816)
  5. Chinese sentences generated by human translation (only the test set is covered)

Dataset split

Image features

  1. 1,024-dim GoogleNet pool5, read by bigfile.py

Citations

  1. Xirong Li, Weiyu Lan, Jianfeng Dong, Hailong Liu, Adding Chinese Captions to Images, ACM ICMR 2016

About

A bilingual dataset for image captioning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published