Skip to content
Switch branches/tags
Go to file

Visually29K: a large-scale curated infographics dataset

This code is associated with the following project page:

In this repo, we provide metadata and annotations for thousands of infographics, for various computer vision and natural language tasks. We used this data in the reports: and

To learn how to use the data: howto.ipynb

If you use the data or code in this git repo, please consider citing:

    author    = {Spandan Madan*, Zoya Bylinskii*, Matthew Tancik*, Adrià Recasens, Kimberli Zhong, Sami Alsheikh, Hanspeter Pfister, Aude Oliva, Fredo Durand}
    title     = {Synthetically Trained Icon Proposals for Parsing and Summarizing Infographics},
    booktitle = {arXiv preprint arXiv:1807.10441},
    url       = {},
    year      = {2018}
    author    = {Zoya Bylinskii*, Sami Alsheikh*, Spandan Madan*, Adria Recasens*, Kimberli Zhong, Hanspeter Pfister, Fredo Durand, Aude Oliva}
    title     = {Understanding infographics through textual and visual tag prediction},
    booktitle = {arXiv preprint arXiv:1709.09215},
    url       = {},
    year      = {2017}


A large-scale infographics dataset from with metadata and additional crowdsourced annotations




No releases published


No packages published