Generative Models
Audio Dataset for training CLAP and other models
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
PyTorch implementations of Generative Adversarial Networks.
Generative Models by Stability AI
Taming Transformers for High-Resolution Image Synthesis
Turn your rough sketch into a refined image using AI
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
A latent text-to-image diffusion model
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
SpeechGPT Series: Speech Large Language Models
[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)
A prompting enhancement library for transformers-type text embedding systems
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A framework for the evaluation of autoregressive code generation language models.
Official inference repo for FLUX.1 models
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
Open-Sora: Democratizing Efficient Video Production for All
Hierarchical Reasoning Model Official Release
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

