Skip to content
A collection of arbitrary text to image papers with code (constantly updating)
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE Update Sep 27, 2019


A collection of arbitrary kinds of text to image papers, organized by Tzu-Heng Lin and Haoran Mo.

Papers are ordered in arXiv first version submitting time (if applicable).

Feel free to send a PR or an issue.


general text to image

Note Model Paper Conference paper link code link
GAN-INT-CLS Generative Adversarial Text to Image Synthesis ICML 2016 1605.05396 reedscot/icml2016
StackGAN StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks ICCV 2017 1612.03242 hanzhanggit/StackGAN
StackGAN++ StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks TPAMI 2018 1710.10916 hanzhanggit/StackGAN-v2
AttnGAN AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks CVPR 2018 1711.10485 taoxugit/AttnGAN
HD-GAN Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network CVPR 2018 1802.09178 ypxie/HDGan
StoryGAN StoryGAN: A Sequential Conditional GAN for Story Visualization CVPR 2019 1812.02784 yitong91/StoryGAN
MirrorGAN MirrorGAN: Learning Text-to-image Generation by Redescription CVPR 2019 1903.05854
DM-GAN DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis CVPR 2019 1904.01310
SD-GAN Semantics Disentangling for Text-to-Image Generation CVPR 2019 1904.01480

scene graph/layout to image

Note Model Paper Conference paper link code link
GAWWN Learning What and Where to Draw NIPS 2016 1610.02454 reedscot/nips2016
Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis CVPR 2018 1801.05091
sg2im Image Generation from Scene Graphs CVPR 2018 1804.01622 google/sg2im
Text2Scene Text2Scene: Generating Abstract Scenes from Textual Descriptions CVPR 2019 1809.01110 uvavision/Text2Image
Layout2Im Image Generation from Layout CVPR 2019 1811.11389
LayoutGAN LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator ICLR 2019 openreview
Object Pathways Generating Multiple Objects at Spatially Distinct Locations ICLR 2019 1901.00686 tohinz/multiple-objects-gan
Using Scene Graph Context to Improve Image Generation 1901.03762
Obj-GAN Object-driven Text-to-Image Synthesis via Adversarial Training CVPR 2019 1902.10740 jamesli1618/Obj-GAN

dialog to image

Note Model Paper Conference paper link code link
CoDraw CoDraw: Visual Dialog for Collaborative Drawing 1712.05558 CoDraw dataset
ChatPainter ChatPainter: Improving Text to Image Generation using Dialogue ICLRW 2018 1802.08216
Keep Drawing It: Iterative language-based image generation and editing NIPSW 2018 1811.09845 CLEVR dataset
Chat-crowd Chat-crowd: A Dialog-based Platform for Visual Layout Composition 1812.04081 uvavision/chat-crowd
SeqAttnGAN Sequential Attention GAN for Interactive Image Editing via Dialogue 1812.08352
You can’t perform that action at this time.