Concept Decomposition for Visual Exploration and Inspiration

Concept Decomposition for Visual Exploration and Inspiration

Yael Vinker, Andrey Voynov, Daniel Cohen-Or, Ariel Shamir
Our method provides a tree-structured visual exploration space for a given unique concept. The nodes of the tree ("v_i") are newly learned textual vector embeddings, injected to the latent space of a pretrained text-to-image model. The nodes encode different aspects of the subject of interest. Through examining combinations within > and across trees, the different aspects can inspire the creation of new designs and concepts, as can be seen below.

Setup

git clone https://github.com/yael-vinker/inspiration_tree.git

Environment (with pip)

Our code relies on the enviornment in the official Stable Diffusion repository. To set up their environment, please run:

python -m venv .tree_venv
source .tree_venv/bin/activate
pip install -r requirements.txt

Technical details:

CUDA 11.6
torch 1.7.1+cu110

Hugging Face Diffusers Library
Our code relies on the diffusers library and the official Stable Diffusion v1.4 model.

Usage

This code will allow you to generate a tree per concept, and play with the different prompts and combinations for the generated tree (under "inspiration_tree_playground.ipynb").

Pretrained Models and Datasets

As part of our code release and to assist with comparisons, we have also provided some of the trained models and datasets used in the paper.

All of our models (learned tokens from the paper) can be found undeer the "learned_tokens" directory. The notebook inspiration_tree_playground.ipynb shows how to load them and reproduce the results from the paper (under "Play with learned aspects from the paper"). All datasets used from Textual Inversion can be found under "datasets".

Generate your tree

The logic for generating the tree is under "main_multiseed.py", which runs the framwork for a single node. You can generate the tree by passing your own parameters to main_multiseed.py. An example is given in "run_decompose.sh":

python main_multiseed.py --parent_data_dir "cat_sculpture/" --node v0 --test_name "v0" --GPU_ID "${GPU_ID}" --multiprocess 0

Notes:

The "test_name" should be identical to the chosen node Results:
The results will be saved to "outputs/<parent_data_dir>/"
In "final_samples.jpg" you can see a batch of random samples of the learned nodes
Under "consistency_test" we save the results of the seed selection procedure
Once training is finished, you can continue with splitting the generated nodes as well

Inference

We made a notebook ("inspiration_tree_playground.ipynb") to play with the results, and this should be the most convenient option. Notes:

The notebook shows the concepts learned in each node
You can also load checkpoints from early iterations
You can generate new concepts using natural language sentences

Acknowledgements

Our code builds on the diffusers implementation of textual inversion

Citation

If you find this useful for your research, please cite the following:

@article{vinker2023concept,
  title={Concept Decomposition for Visual Exploration and Inspiration},
  author={Yael Vinker and Andrey Voynov and Daniel Cohen-Or and Ariel Shamir},
  journal={arXiv preprint arXiv:2305.18203},
  year={2023}
}

Disclaimer

This is not an officially supported Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
datasets		datasets
input_concepts/cat_sculpture		input_concepts/cat_sculpture
learned_tokens		learned_tokens
repo_images		repo_images
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
inspiration_tree_playground.ipynb		inspiration_tree_playground.ipynb
main_multiseed.py		main_multiseed.py
requirements.txt		requirements.txt
run_decompose.sh		run_decompose.sh
seed_selection.py		seed_selection.py
textual_inversion_decomposed.py		textual_inversion_decomposed.py
utils.py		utils.py

License

google/inspiration_tree

Folders and files

Latest commit

History

Repository files navigation

Concept Decomposition for Visual Exploration and Inspiration

Setup

Environment (with pip)

Usage

Pretrained Models and Datasets

Generate your tree

Inference

Acknowledgements

Citation

Disclaimer

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages