Skip to content

Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"

License

Notifications You must be signed in to change notification settings

clement-bonnet/text-to-pose

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

From Text to Pose to Image: Improving Diffusion Model Control and Quality

This repository contains the code for the paper From Text to Pose to Image: Improving Diffusion Model Control and Quality, published at the NeurIPS 2024 Workshop on Compositional Learning: Perspectives, Methods, and Paths Forward (link to workshop).

Standard text-to-image generation Ours: text-to-pose-to-image generation

Text To Pose

Text-to-pose transformer architecture

Pose Adapter

Generated poses using the Tencent pose adapter and ours

Citation

If you use this paper in your work, please cite the paper using the following BibTeX entry:

@misc{bonnet2024textposeimageimproving,
      title={From Text to Pose to Image: Improving Diffusion Model Control and Quality}, 
      author={Clément Bonnet and Ariel N. Lee and Franck Wertel and Antoine Tamano and Tanguy Cizain and Pablo Ducru},
      year={2024},
      eprint={2411.12872},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2411.12872}, 
}

About

Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"

Resources

License

Stars

Watchers

Forks