Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models

This is the repository for Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models, published at SIGGRAPH 2024.

[Project Page] [Paper]

Set up

Build conda environment by running:

conda create -n sepen python=3.10  
conda activate sepen
pip install -r requirement.txt

Training

Individual concepts

see src/run_individual.sh for a sample training script.

Individual concepts

see src/run_large.sh for a sample training script.

Sample

see src/sample.py for refernce.

Evaluation

FID

install clean-fid via pip install clean-fid then refer to src/eval/fid/eval_fid.py for FID evaluation.

BLIP score

We adopt the implementation from A&E. See src/eval/blip/eval_blip.py for BLIP similarity score evaluation.

Detection score

Clone and build Detic from their official repo. Then move the Python files under src/eval/detic to the cloned folder. See src/eval/detic/eval_detic.py for details.

Large-scale concepts and prompts

The 220 concepts we used for the large-scale experiment is at src/concepts/large_scale.py.
The 200 evaluation prompts are at src/concepts/large_test.txt.

Acknowledgment

Part of our codes is inspired by Custom Diffusion and Attend and Excite.

We leverage Detic and clean-fid for our evaluation.

Citation

@inproceedings{bao2024sepen,
    Author = {Bao, Zhipeng and Li, Yijun and Singh, Krishna Kumar and Wang, Yu-Xiong and Hebert, Martial},
    Title = {Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models},
    Booktitle = {SIGGRAPH},
    Year = {2024},
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
teaser		teaser
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.md		LICENSE.md
README.md		README.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models

Set up

Training

Individual concepts

Individual concepts

Sample

Evaluation

FID

BLIP score

Detection score

Large-scale concepts and prompts

Acknowledgment

Citation

About

Releases

Packages

Languages

License

adobe/SeperateAndEnhance

Folders and files

Latest commit

History

Repository files navigation

Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models

Set up

Training

Individual concepts

Individual concepts

Sample

Evaluation

FID

BLIP score

Detection score

Large-scale concepts and prompts

Acknowledgment

Citation

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages