Stable-Diffusion-Textual-Inversion-Image-Generation-Model-Using-Fast-Diffusion

This project contains the custom model created using the DreamBooth Model and Lora for textual inversion based on a custom training dataset. This model was created using fast stable diffusion version 1.5. This model uses textual inversion to generate new images based on text injections. The model is trained on 15 512 x 512 jpg images with 1100 U-Net training steps. Model was trained on Google Virtual Machines through the Colab jupyter environment.

Example Image:

This repository contains the Stable Diffusion Textual Inversion Image Generation Model, an AI model that applies textual inversion for generating images. Based on the Fast Diffusion approach, this model was built using DreamBooth Model and Lora on a custom training dataset.

The core principle of the model is to generate new images based on text injections, utilizing a Stable Diffusion algorithm. The model was trained on 15 512 x 512 jpg images with 1100 U-Net training steps on Google Virtual Machines via the Colab Jupyter environment.

Getting Started

First, clone this repository:

bash Copy code

git clone https://github.com/username/Stable-Diffusion-Textual-Inversion-Image-Generation-Model-Using-Fast-Diffusion.git

You will need Python 3.6+ and the packages mentioned in requirements.txt. Install these packages using pip:

bash Copy code

pip install -r requirements.txt

Model Architecture

This model is constructed on a Stable Diffusion framework, where the textual inversion approach generates images based on the text inputs. The model utilizes a U-Net for training and Lora for textual inversion. The process follows the principle of Diffusion Models where the final image is seen as a result of applying a series of transformations (diffusion steps) on a noise vector. The idea is to teach the model to reverse this process.

Technical Details

The Stable Diffusion Model we use here is v1.5. We used U-Net for training with 1100 training steps. The input data consists of 15 512x512 jpg images. We utilized Lora for textual inversion.

The model architecture consists of three main components:

A U-Net which takes in the image and maps it to a latent space representation. A diffusion model which adds noise to the latent space representation to create new images. A textual encoder which maps the text inputs to a latent space. The model is trained using a mixed precision training method for efficiency. The training involves applying the textual inversion on the images in the dataset, where the model learns to generate images from text inputs.

How to Use

Before proceeding with model training, you need to prepare your dataset. The model requires a dataset of 15 512 x 512 jpg images. Make sure to provide the correct path of the dataset.

For training, follow these steps:

Load the dataset
Define your model parameters
Train the model using the pre-defined U-Net architecture
After training, the model can be used to generate new images with textual inversion

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
2c19f6666e0e163c7954df66cb901353fcad088e		2c19f6666e0e163c7954df66cb901353fcad088e
4297ea6a8d2bae1fea8f48b45e257814dcb11f69		4297ea6a8d2bae1fea8f48b45e257814dcb11f69
702bb12920b291cade3706cf215c1604d2255d93		702bb12920b291cade3706cf215c1604d2255d93
76e821f1b6f0a9709293c3b6b51ed90980b3166b		76e821f1b6f0a9709293c3b6b51ed90980b3166b
9bfb42aa97dcd61e89f279ccaee988bccb4fabae		9bfb42aa97dcd61e89f279ccaee988bccb4fabae
README.md		README.md
__init__.cpython-39(1).pyc		__init__.cpython-39(1).pyc
__init__.cpython-39(2).pyc		__init__.cpython-39(2).pyc
__init__.cpython-39(3).pyc		__init__.cpython-39(3).pyc
__init__.cpython-39(4).pyc		__init__.cpython-39(4).pyc
__init__.cpython-39.pyc		__init__.cpython-39.pyc
align_trans.cpython-39.pyc		align_trans.cpython-39.pyc
api.cpython-39.pyc		api.cpython-39.pyc
bisenet.cpython-39.pyc		bisenet.cpython-39.pyc
cache.json		cache.json
cache.json.lock		cache.json.lock
codeformer_arch.cpython-39.pyc		codeformer_arch.cpython-39.pyc
common.cpython-39.pyc		common.cpython-39.pyc
config.json		config.json
custom_code.cpython-39.pyc		custom_code.cpython-39.pyc
datasets.cpython-39.pyc		datasets.cpython-39.pyc
deepbooru.cpython-39.pyc		deepbooru.cpython-39.pyc
deepbooru_model.cpython-39.pyc		deepbooru_model.cpython-39.pyc
esrgan_model.cpython-39.pyc		esrgan_model.cpython-39.pyc
esrgan_model_arch.cpython-39.pyc		esrgan_model_arch.cpython-39.pyc
extras.cpython-39.pyc		extras.cpython-39.pyc
face_detector.cpython-39.pyc		face_detector.cpython-39.pyc
face_restoration.cpython-39.pyc		face_restoration.cpython-39.pyc
general.cpython-39.pyc		general.cpython-39.pyc
img2imgalt.cpython-39.pyc		img2imgalt.cpython-39.pyc
launch.cpython-39.pyc		launch.cpython-39.pyc
ldsr_model.cpython-39.pyc		ldsr_model.cpython-39.pyc
loopback.cpython-39.pyc		loopback.cpython-39.pyc
lora_script.cpython-39.pyc		lora_script.cpython-39.pyc
main		main
masking.cpython-39.pyc		masking.cpython-39.pyc
matlab_cp2tform.cpython-39.pyc		matlab_cp2tform.cpython-39.pyc
merges.txt		merges.txt
models.cpython-39.pyc		models.cpython-39.pyc
outpainting_mk_2.cpython-39.pyc		outpainting_mk_2.cpython-39.pyc
parsenet.cpython-39.pyc		parsenet.cpython-39.pyc
poor_mans_outpainting.cpython-39.pyc		poor_mans_outpainting.cpython-39.pyc
postprocessing.cpython-39.pyc		postprocessing.cpython-39.pyc
postprocessing_codeformer.cpython-39.pyc		postprocessing_codeformer.cpython-39.pyc
postprocessing_gfpgan.cpython-39.pyc		postprocessing_gfpgan.cpython-39.pyc
postprocessing_upscale.cpython-39.pyc		postprocessing_upscale.cpython-39.pyc
preprocess.cpython-39.pyc		preprocess.cpython-39.pyc
prompt_parser.cpython-39.pyc		prompt_parser.cpython-39.pyc
prompts_from_file.cpython-39.pyc		prompts_from_file.cpython-39.pyc
quantize.cpython-39.pyc		quantize.cpython-39.pyc
realesrgan_model.cpython-39.pyc		realesrgan_model.cpython-39.pyc
resnet.cpython-39.pyc		resnet.cpython-39.pyc
retinaface.cpython-39.pyc		retinaface.cpython-39.pyc
retinaface_net.cpython-39.pyc		retinaface_net.cpython-39.pyc
retinaface_utils.cpython-39.pyc		retinaface_utils.cpython-39.pyc
sampler.cpython-39.pyc		sampler.cpython-39.pyc
scripts.cpython-39.pyc		scripts.cpython-39.pyc
scripts_auto_postprocessing.cpython-39.pyc		scripts_auto_postprocessing.cpython-39.pyc
scripts_postprocessing.cpython-39.pyc		scripts_postprocessing.cpython-39.pyc
scunet_model.cpython-39.pyc		scunet_model.cpython-39.pyc
scunet_model_arch.cpython-39.pyc		scunet_model_arch.cpython-39.pyc
sd_hijack_clip.cpython-39.pyc		sd_hijack_clip.cpython-39.pyc
sd_hijack_open_clip.cpython-39.pyc		sd_hijack_open_clip.cpython-39.pyc
sd_hijack_optimizations.cpython-39.pyc		sd_hijack_optimizations.cpython-39.pyc
sd_hijack_unet.cpython-39.pyc		sd_hijack_unet.cpython-39.pyc
sd_hijack_utils.cpython-39.pyc		sd_hijack_utils.cpython-39.pyc
sd_hijack_xlmr.cpython-39.pyc		sd_hijack_xlmr.cpython-39.pyc
sd_samplers_common.cpython-39.pyc		sd_samplers_common.cpython-39.pyc
sd_samplers_compvis.cpython-39.pyc		sd_samplers_compvis.cpython-39.pyc
sd_samplers_kdiffusion.cpython-39.pyc		sd_samplers_kdiffusion.cpython-39.pyc
sd_upscale.cpython-39.pyc		sd_upscale.cpython-39.pyc
sd_vae_approx.cpython-39.pyc		sd_vae_approx.cpython-39.pyc
special_tokens_map.json		special_tokens_map.json
sub_quadratic_attention.cpython-39.pyc		sub_quadratic_attention.cpython-39.pyc
swinir_model.cpython-39.pyc		swinir_model.cpython-39.pyc
swinir_model_arch.cpython-39.pyc		swinir_model_arch.cpython-39.pyc
swinir_model_arch_v2.cpython-39.pyc		swinir_model_arch_v2.cpython-39.pyc
tokenizer_config.json		tokenizer_config.json
torch_utils.cpython-39.pyc		torch_utils.cpython-39.pyc
txt2img.cpython-39.pyc		txt2img.cpython-39.pyc
ui-config.json		ui-config.json
ui.cpython-39(1).pyc		ui.cpython-39(1).pyc
ui.cpython-39.pyc		ui.cpython-39.pyc
ui_common.cpython-39.pyc		ui_common.cpython-39.pyc
ui_extra_networks_hypernets.cpython-39.pyc		ui_extra_networks_hypernets.cpython-39.pyc
ui_extra_networks_textual_inversion.cpython-39.pyc		ui_extra_networks_textual_inversion.cpython-39.pyc
ui_postprocessing.cpython-39.pyc		ui_postprocessing.cpython-39.pyc
uni_pc.cpython-39.pyc		uni_pc.cpython-39.pyc
version.txt		version.txt
vocab.json		vocab.json
vqgan_arch.cpython-39.pyc		vqgan_arch.cpython-39.pyc
xlmr.cpython-39.pyc		xlmr.cpython-39.pyc
xyz_grid.cpython-39.pyc		xyz_grid.cpython-39.pyc
yolo.cpython-39.pyc		yolo.cpython-39.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stable-Diffusion-Textual-Inversion-Image-Generation-Model-Using-Fast-Diffusion

Getting Started

Model Architecture

Technical Details

How to Use

About

Uh oh!

Releases

Packages

rishabhsinha17/Stable-Diffusion-Textual-Inversion-Image-Generation-Model-Using-Fast-Diffusion

Folders and files

Latest commit

History

Repository files navigation

Stable-Diffusion-Textual-Inversion-Image-Generation-Model-Using-Fast-Diffusion

Getting Started

Model Architecture

Technical Details

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages