Diffusion Probabilistic Model with DDPMSampler

This GitHub repository contains a collection of Python code for implementing various probabilistic generative models and embedding techniques. These models are designed for image enhancement, generative tasks, and probabilistic modeling, offering a versatile set of tools for working with image data and text embeddings. Below is a summary of the key components and models included in this repository:

Variational Autoencoder (VAE) Architecture

The VAE architecture is designed for image enhancement, generative tasks, and probabilistic modeling. It includes components such as denoising, inpainting, and image generation:

VAE Encoder: A module that includes convolutional layers, residual blocks, and attention blocks for encoding input images.
VAE Decoder: A module for decoding and generating enhanced images from latent representations.

Diffusion Probabilistic Model

The diffusion probabilistic model, implemented in the Diffusion class, is a powerful generative model for image data. It takes into account latent variables, context information, and time embeddings to generate images. Key features include:

Diffusion Model: A module for initializing the diffusion model, which can be used for tasks like image denoising, enhancement, and generation.
DDPMSampler: A class for facilitating sampling from the diffusion model, allowing users to control noise strength and generate images at specific timesteps.

CLIP Embedding

The CLIP (Contrastive Language-Image Pre-training) embedding model is designed for text and image embeddings including natural language understanding, image-text matching, and cross-modal applications. It includes the following components:

CLIPEmbedding: A module for embedding tokens (text or image) using a combination of token embeddings and learnable position embeddings.
CLIPLayer: A layer that performs self-attention and feedforward operations, allowing for the learning of complex relationships between tokens.
CLIP Model: A complete CLIP model that combines the embedding and multiple CLIP layers for generating embeddings from tokens.

This repository provides a comprehensive set of tools for working with probabilistic generative models and embeddings, making it a valuable resource for researchers and developers working on image and text-related tasks. Each component comes with its own API and usage examples, ensuring flexibility and ease of integration into different projects.

For more information regarding DDPM or the model, read these papers for more thorough understanding here and here

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
DDPM.py		DDPM.py
LICENSE		LICENSE
README.md		README.md
attention.py		attention.py
clip.py		clip.py
decoder.py		decoder.py
diffusion.py		diffusion.py
encoder.py		encoder.py
model_converter.py		model_converter.py
model_loader.py		model_loader.py
pipeline.py		pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion Probabilistic Model with DDPMSampler

Variational Autoencoder (VAE) Architecture

Diffusion Probabilistic Model

CLIP Embedding

About

Releases

Packages

Languages

License

karan-nanda/Stable-Diffusion-Model

Folders and files

Latest commit

History

Repository files navigation

Diffusion Probabilistic Model with DDPMSampler

Variational Autoencoder (VAE) Architecture

Diffusion Probabilistic Model

CLIP Embedding

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages