DiffusionModelForPokemon

This project utilizes the HuggingFace diffusers library to generate images. It starts from a pretrained model based on celebrity faces from google/ddpm-celebahq-256 (The following figure shows two images generated by a model trained on this dataset) Then, it retrieves a Pokemon image dataset from Kaggle, fine-tunes the model with it, and uses DDIM (Diffusion Deformers Inverting Model) for faster sampling during inference. Experiment tracking and visualization of the fine-tuning process are done using weight Weights & Biases. Finally, a demo is produced using Gradio for UI interaction. CelebA-HQ dataset: Pokemon dataset:

Installation

conda create --name diffusion python=3.8.16
conda activate diffusion
pip install torch==1.9.1 torchvision==0.10.1 torchaudio==0.9.0
pip install -qq diffusers datasets accelerate wandb open-clip-torch
pip install -q gradio

Make sure to follow Kaggle documentation to set up Kaggle API correctly:

pip install -q kaggle
mkdir -p ~/.kaggle
cp kaggle.json ~/.kaggle/
chmod 600 ~/.kaggle/kaggle.json

Download the Pokemon dataset:

kaggle datasets download -d hlrhegemony/pokemon-image-dataset
unzip pokemon-image-dataset.zip -d my_pokemon_folder

Finally, don't forget to retrieve and use your WandaDB API key in the following script.

# Initialize wandb for logging
wandb.login(key='YOUR_KEY')
wandb.init(project=wandb_project, config=locals())

Usage

To fine-tune the model, use the following script:

python pokemen_finetune_diffusion.py

Visualization

The generated images at the beginning (you can still see human faces): The generated images during the fine-tuning process(human faces get blended with pokemon style):

Final result

The generated images at the end of the fine-tuning process:

Demo

An example image generated by the fine-tuned model via Gradio app:

gradio_demo.mov

To run the Gradio demo, use the following script:

python gradio_demo.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
beginning.png		beginning.png
celeb_image.png		celeb_image.png
end.png		end.png
example_gradio_pokemon.png		example_gradio_pokemon.png
gradio_demo.mov		gradio_demo.mov
gradio_demo.py		gradio_demo.py
middle.png		middle.png
pokemen_finetune_diffusion.py		pokemen_finetune_diffusion.py
pokemon-dataset-cover.jpeg		pokemon-dataset-cover.jpeg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiffusionModelForPokemon

Installation

Usage

Visualization

Final result

Demo

About

Uh oh!

Releases

Packages

Languages

christineyu123/DiffusionModelForPokemon

Folders and files

Latest commit

History

Repository files navigation

DiffusionModelForPokemon

Installation

Usage

Visualization

Final result

Demo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages