Few-shot Image Generation with Reptile: the dataset
Clone or download
Latest commit 4ac686f Jan 9, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
Data Last batch Jan 4, 2019
.gitattributes First batch Jan 3, 2019
LICENSE Create LICENSE Jan 3, 2019
README.md updated info Jan 9, 2019
data.csv First batch Jan 3, 2019
dataset_explanation.png Last batch Jan 4, 2019

README.md

FIGR-8

Notice: This dataset is still under construction. The total number of images and classes is not definitive.

The FIGR-8 database is a dataset containing 18409 classes of 1548944 images representing pictograms, ideograms, icons, emoticons or object or conception depictions. Its aim is to set a benchmark for Few-shot Image Generation tasks, albeit not being limited to it. Each image is represented by 192x192 pixels with grayscale value of 0-255. Classes are not balanced (they do not all contain the same number of elements), but they all do contain at the very least 8 images.

The main contribution of this dataset to the community is the abundance of image classes, which is valuable for tasks in which encompassing an abundance of concepts can be useful or even necessary. This was the case in our paper, FIGR: Few-shot Image Generation with Reptile.

The dataset is readily available for people who want to give a shot at few-shot image generation techniques, while spending minimal effort on gathering a full-size database. We also encourage this dataset to enable reproduction efforts from the community.

Contents

  • Data/
    • All images are contained in subfolders inside Data folder.
  • data.csv with the following information in order (see below for example):
    • Category number. All items from the same category name have the same category number. One category number per category name.
    • Category name. Also called "class". Represents what is depicted in the image.
    • Path. Represented as Category name/image_id. All images are in 192x192 with .png extension.
    • Artist. The artist who designed the logo, the icon or the pictogram.
    • Format. 192x192 for all images.
    • License. The license type which the image is subject to. Refer to section License for more information.

Sample from data.csv:

Category number Category name Path Artist Format License
14019 Caballo Caballo/141612-200.png Ximena 192x192 Creative Commons
15020 Irrigation water Irrigation water/1667588-200.png Ronan Bolaños 192x192 Creative Commons
... ... ... ... ... ...
1566 Store Hot Coffee Store Hot Coffee/1320869-200.png Mat fine 192x192 Creative Commons
... ... ... ... ... ...

License

Most images in the dataset have been licensed under a Creative Commons License by their author, indicating that their reproduction on any material intended to be sold or to be made profit from is strictly prohibited. However, use in which the author's name is indicated is permitted. More details can be found here.

The dataset itself (FIGR-8) is protected under the MIT License.

Alternate Download

Alternatively, you can download the dataset from here.

Acknowledgement

Images were gathered from The Nounji App, available on the App Store.

If you use this database for your own projects, please consider citing the following paper:

@article{FIGR2019,
author = {Louis Clouâtre and Marc Demers},
title = {FIGR: Few-shot Image Generation with Reptile},
journal = {CoRR},
volume = {abs/1901.02199},
year = 2019,
ee = {http://arxiv.org/abs/1901.02199},
month = jan,
archiveprefix = “arXiv”,
number = “1901.02199v1”,
eprint = “1901.02199v1”,
primaryclass = “cs.CV”,
nonrefereed = “true”
}