This repository offers you a framework to create massive amounts of AI-generated images using the Stable Diffusion model. The Stable Diffusion model is integrated into a Metaflow workflow that will help you scale horizontally or vertically to quickly produce as many images as you need. To run the code in this repository you will need access to a Metaflow deployment configured with S3 storage. If you want to learn more about Metaflow or need help getting set up, find us on Slack!
Stable Diffusion Intepretations of Einstein
Before running the flow ensure that Metaflow-related infrastructure is deployed and configured on your AWS account and GPU's are configured for the compute environment (AWS Batch / EKS).
If you don't have infrastructure setup, you can set it up with this cloudformation template. To deploy the GPU infrastructure on AWS, change the ComputeEnvInstanceTypes in the Cloudformation template or the Cloudformation UI. More detailed instructions on setting up infrastructure can be found here
We have included a conda environment in the form of a env.yml
file for you to use. You can install and activate the environment by running the following commands from your terminal:
conda install mamba -n base -c conda-forge
mamba env create -f env.yml
conda activate meta-diffusion
If you prefer to use venv you can create and activate a new environment by running the following commands from your terminal:
python venv -m ./meta-diffusion
source ./meta-diffusion/bin/activate
Note if you get an error installing the transformers
library, you may need to install the Rust compiler. You can do so like:
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
Before running the flow ensure you have the necessary AWS infrastructure setup for Metaflow. These flows require S3 and GPU/s.
- Ensure that you have signed the waiver for CompVis/stable-diffusion-v-1-4-original model on the Huggingface hub.
- Create a Huggingface hub access token
- Run the below command after replacing
<myhuggingfacetoken>
with the Huggingface hub token created in the previous step. Run this command only once to download the model to the local machine.HF_TOKEN=<myhuggingfacetoken> python model_download.py
Source File : meta_diffusers_text.py
Run Command :
python meta_diffusers_text.py run \
--max-parallel 10 \
--num-images 40 \
--prompt "Autumn inside the mars dome, ornate, beautiful, atmosphere, vibe, mist, smoke, fire, chimney, rain, wet, pristine, puddles by stanley artgerm lau, greg rutkowski, thomas kindkade, alphonse mucha, loish, norman rockwell" \
--prompt "alan turing by pablo piccasso" \
--num-steps 50 \
--seed 9332
Options:
Options:
--model-path PATH Path to the downloaded model on the local machine.
[default: ./models]
--force-upload Force upload the model from the local machine
[default: False]
--s3-prefix TEXT prefix of the path where models are stored in S3.
[default: models/diffusion-models/]
--model-version TEXT [default: stable-diffusion-v1-4]
--batch-size INTEGER controls the number of images to send to the GPU
per batch [default: 4]
--width INTEGER width of the output image [default: 512]
--height INTEGER Height of the output image [default: 512]
--num-steps INTEGER Number of steps to run inference [default: 60]
--prompt TEXT [default: mahatma gandhi, tone mapped, shiny,
intricate, cinematic lighting, highly detailed,
digital painting, artstation, concept art, smooth,
sharp focus, illustration]
--num-images INTEGER Number of images to create per prompt [default:
10]
--seed INTEGER [default: 42]
--max-parallel INTEGER This parameter will limit the amount of
parallelisation we wish to do. Based on the value
set here, the foreach will fanout to that many
workers. [default: 4]
Running Locally : To run this flow locally, ensure that you have installed the requirements.txt
file and commented the @batch
decorator in the flow file.
Source File : meta_dynamic_prompts.py
Run Command :
python meta_dynamic_prompts.py run \
--num-images 4 \
--subject "mahatma gandhi" \
--subject "alan turing" \
--subject "albert einstein" \
--subject "steve jobs" \
--styles "Pablo Picasso, banksy, artstation" \
--num-steps 45 \
--seed 6372
Options:
Options:
--model-path PATH Path to the downloaded model on the local
machine. [default: ./models]
--force-upload Force upload the model from the local machine
[default: False]
--s3-prefix TEXT prefix of the path where models are stored in S3.
[default: models/diffusion-models/]
--model-version TEXT [default: stable-diffusion-v1-4]
--batch-size INTEGER controls the number of images to send to the GPU
per batch [default: 4]
--width INTEGER width of the output image [default: 512]
--height INTEGER Height of the output image [default: 512]
--num-steps INTEGER Number of steps to run inference [default: 60]
--subject TEXT The subject based on which images are generated
[default: Mahatma gandhi, dalai lama, alan
turing]
--styles TEXT Comma seperated list of styles [default: van
gogh,salvador dali,warhol,Art Nouveau,Ansel
Adams,basquiat]
--num-images INTEGER Number of images to create per (prompt, style)
[default: 10]
--images-per-card INTEGER Maximum number of images to show per @card
[default: 10]
--ui-url TEXT Url to the Metaflow UI. If provided then an index
card is created for the `join_styles` @step
--seed INTEGER Seed to use for inference. [default: 42]
Running Locally : To run this flow locally, ensure that you have installed the requirements.txt
file and commented the @batch
decorator in the flow file.