cv
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"
a1111 implementation of https://github.com/ChenyangSi/FreeU
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
"FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Official implementation of AnimateDiff.
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
[CVPR 2024] | LAMP: Learn a Motion Pattern for Few-Shot Based Video Generation
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
A custom script for AUTOMATIC1111/stable-diffusion-webui to implement a tiny template language for random prompt generation
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
OneDiff: An out-of-the-box acceleration library for diffusion models.
VMamba: Visual State Space Models,code is based on mamba
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
PyTorch code and models for V-JEPA self-supervised learning from video.
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
[CSUR] A Survey on Video Diffusion Models
