GitHub - LadyDarwe/ImageDB: AI-Generated Images as Data Source: The Dawn of Synthetic Era

This project is associated with our survey paper which comprehensively contextualizes the advance of the recent AI-Generated Images as Data Source (AIGS) and visual AIGC by formulating taxonomies according to methodologies and applications.

AI-Generated Images as Data Source: The Dawn of Synthetic Era [Paper]
Zuhao Yang, Fangneng Zhan, Kunhao Liu, Muyu Xu, Shijian Lu
arXiv, 2023

You are welcome to promote papers via pull request.
The process to submit a pull request:

a. Fork the project into your own repository.
b. Add the Title, Author, Conference, Paper link, Project link, and Code link in README.md with below format:

**Title**<br>
*Author*<br>
Conference
[[Paper](Paper link)]
[[Project](Project link)]
[[Code](Code link)]
[[Video](Video link)]

c. Submit the pull request to this branch.

Related Surveys & Projects

Machine Learning for Synthetic Data Generation: A Review
Yingzhou Lu, Minjie Shen, Huazheng Wang, Wenqi Wei
arXiv 2023 [Paper]

Synthetic Data in Human Analysis: A Survey
Indu Joshi, Marcel Grimmer, Christian Rathgeb, Christoph Busch, Francois Bremond, Antitza Dantcheva
arXiv 2022 [Paper]

A Review of Synthetic Image Data and Its Use in Computer Vision
Keith Man, Javaan Chahl
J. Imaging 2022 [Paper]

Survey on Synthetic Data Generation, Evaluation Methods and GANs
Alvaro Figueira, Bruno Vaz
Mathematics 2022 [Paper]

Table of Contents (Work in Progress)

Synthetic Data for Training:

Dataset Generation
Data Augmentation

Synthetic Data Application:

Visual Understanding
Visual Generation
Self-supervised Learning
Robotics
Autonomous Driving

Datasets:

Datasets

Dataset Generation

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort
Yuxuan Zhang, Huan Ling, Jun Gao, Kangxue Yin, Jean-Francois Lafleche, Adela Barriuso, Antonio Torralba, Sanja Fidler
CVPR 2021 [Paper][Project][Code]

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
Daiqing Li, Huan Ling, Seung Wook Kim, Karsten Kreis, Adela Barriuso, Sanja Fidler, Antonio Torralba
CVPR 2022 [Paper][Project][Code]

HandsOff: Labeled Dataset Generation With No Additional Human Annotations

On the generation of realistic synthetic petrographic datasets using a style-based GAN

Learning to Annotate Part Segmentation with Gradient Matching

Histopathology DatasetGAN: Synthesizing Large-Resolution Histopathology Datasets

Application of DatasetGAN in medical imaging: preliminary studies

Medical Image Segmentation Using Deep Learning: A Survey

Transformation-Consistent Self-Ensembling Model for Semisupervised Medical Image Segmentation

A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images Using a GAN

Data Augmentation

A data augmentation perspective on diffusion models and retrieval
Max F. Burg, Florian Wenzel, Dominik Zietlow, Max Horn, Osama Makansi, Francesco Locatello, Chris Russell
arXiv 2023 [Paper]

Effective Data Augmentation With Diffusion Models
Brandon Trabucco, Kyle Doherty, Max Gurinas, Ruslan Salakhutdinov
arXiv 2023 [Paper][Project]

Skin Lesion Classification Using GAN based Data Augmentation
Rashid Haroon, Tanveer M. Asjid, Aqeel Khan Hassan
EMBC 2019 [Paper]

GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification

Data augmentation generative adversarial networks

Gan augmentation: Augmenting training data using generative adversarial networks

Enhancement of Image Classification Using Transfer Learning and GAN-Based Synthetic Data Augmentation

Diversify your vision datasets with automatic diffusion-based augmentation

Visual Understanding

Classification

Is synthetic data from generative models ready for image recognition?
Ruifei He, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, Xiaojuan Qi
ICLR 2023 [Paper][Code]

Synthetic Data from Diffusion Models Improves ImageNet Classification
Shekoofeh Azizi, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi, David J. Fleet
arXiv 2023 [Paper]

Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains
Pierre Chambon, Christian Bluethgen, Curtis P. Langlotz, Akshay Chaudhari
NeurIPS 2022 [Paper]

OpenGAN: Open-Set Recognition via Open Data Generation
Shu Kong, Deva Ramanan
ICCV 2021 [Paper][Project][Code][Video]

Image Captions are Natural Prompts for Text-to-Image Models

Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels

Segmentation

Learning Semantic Segmentation from Synthetic Data: A Geometrically Guided Input-Output Adaptation Approach
Yuhua Chen, Wen Li, Xiaoran Chen, Luc Van Gool
CVPR 2019 [Paper]

Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization
Daiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler
CVPR 2021 [Paper][Project][Code]

Repurposing GANs for One-shot Semantic Part Segmentation
Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn
CVPR 2021 [Paper][Project][Code]

Diffusion Models for Zero-Shot Open-Vocabulary Segmentation
Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht
arXiv 2023 [Paper]

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation
Weimin Tan, Siyuan Chen, Bo Yan
arXiv 2023 [Paper]

Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization

Few-shot 3D Multi-modal Medical Image Segmentation using Generative Adversarial Learning

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

ODISE

Detection

DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection
Yunhao Ge, Jiashu Xu, Brian Nlong Zhao, Neel Joshi, Laurent Itti, Vibhav Vineet
arXiv 2022 [Paper]

Explore the Power of Synthetic Data on Few-shot Object Detection
Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao
CVPR 2023 [Paper]

The Big Data Myth: Using Diffusion Models for Dataset Generation to Train Deep Detection Models
Roy Voetman, Maya Aghaei, Klaas Dijkstra
arXiv 2023 [Paper]

IMAGINARYNET: LEARNING OBJECT DETECTORS WITHOUT REAL IMAGES AND ANNOTATIONS

Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt

Visual Generation

Re-Aging GAN: Toward Personalized Face Age Transformation
Farkhod Makhmudkhujaev, Sungeun Hong, and In Kyu Park
ICCV 2021 [Paper][Video]

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model
Yuval Alaluf, Or Patashnik, Daniel Cohen-Or
SIGGRAPH 2021 [Paper][Project][Code][Video]

Production-Ready Face Re-Aging for Visual Effects
Gaspard Zoss, Prashanth Chandran, Eftychios Sifakis, Markus Gross, Paulo Gotardo, Derek Bradley
TOG 2021 [Paper][Project][Video]

Zero-1-to-3: Zero-shot One Image to 3D Object
Ruoshi Liu, Rundi Wu, Basile Van Hoorick, Pavel Tokmakov, Sergey Zakharov, Carl Vondrick
ICCV 2023 [Paper][Project][Code]

DreamBooth3D: Subject-Driven Text-to-3D Generation
Amit Raj, Srinivas Kaza, Ben Poole, Michael Niemeyer, Nataniel Ruiz, Ben Mildenhall, Shiran Zada, Kfir Aberman, Michael Rubinstein, Jonathan Barron, Yuanzhen Li, Varun Jampani
arXiv 2023 [Paper][Project][Video]

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Chi Zhang, Yiwen Chen, Yijun Fu, Zhenglin Zhou, Gang YU, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen
arXiv 2023 [Paper]

Self-supervised Learning

Generative Models as a Data Source for Multiview Representation Learning
Ali Jahanian, Xavier Puig, Yonglong Tian, Phillip Isola
ICLR 2022 [Paper][Project][Code][Video]

StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners
Yonglong Tian, Lijie Fan, Phillip Isola, Huiwen Chang, Dilip Krishnan
arXiv 2023 [Paper]

Ensembling with Deep Generative Views

DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Robotics

NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields
Thomas Lips, Victor-Louis De Gusseme, Francis wyffels
ICRA 2022 [Paper][Project][Code][Video]

INeRF: Inverting Neural Radiance Fields for Pose Estimation

VMRF: View Matching Neural Radiance Fields

LENS: Localization enhanced by NeRF synthesis

Gan-based neural radiance field without posed camera

NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation

Vision-only robot navigation in a neural radiance world

Event-based Camera Tracker by ∇t NeRF

Autonomous Driving

Lift3D: Synthesize 3D Training Data by Lifting 2D GAN to 3D Generative Radiance Field

3D Data Augmentation for Driving Scenes on Camera

UniSim: A Neural Closed-Loop Sensor Simulator

MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Datasets

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover, Duen Horng Chau
ACL 2023 [Paper]Project][Code]

JourneyDB: A Benchmark for Generative Image Understanding
Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li
arXiv 2023 [Paper][Project][Code]

GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image
Mingjian Zhu, Hanting Chen, Qiangyu Yan, Xudong Huang, Guanyu Lin, Wei Li, Zhijun Tu, Hailin Hu, Jie Hu, Yunhe Wang
arXiv 2023 [Paper][Project][Code]

DiffusionDB (https://huggingface.co/datasets/poloclub/diffusiondb)

JourneyDB (https://docs.google.com/forms/d/e/1FAIpQLSeiciK0g0IA46_hFaitRhdpihhpjqt3helJNT68y-C8MfKhiQ/viewform?pli=1)

GenImage (https://pan.baidu.com/share/init?surl=i0OFqYN5i6oFAxeK6bIwRQ#list/path=%2F)

Citation

If you use this code for your research, please cite our papers.

@article{yang2023aigs,
  title={AI-Generated Images as Data Source: The Dawn of Synthetic Era},
  author={Zuhao Yang and Fangneng Zhan and Kunhao Liu and Muyu Xu and Shijian Lu},
  journal={arXiv preprint arXiv:2310.01830},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
README.md		README.md
cite.bib		cite.bib
teaser.png		teaser.png
title.png		title.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Related Surveys & Projects

Table of Contents (Work in Progress)

Dataset Generation

Data Augmentation

Visual Understanding

Classification

Segmentation

Detection

Visual Generation

Self-supervised Learning

Robotics

Autonomous Driving

Datasets

Citation

About

Uh oh!

Releases

Packages

Languages

LadyDarwe/ImageDB

Folders and files

Latest commit

History

Repository files navigation

Related Surveys & Projects

Table of Contents (Work in Progress)

Dataset Generation

Data Augmentation

Visual Understanding

Classification

Segmentation

Detection

Visual Generation

Self-supervised Learning

Robotics

Autonomous Driving

Datasets

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages