image generation
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。
Official implementations for paper: Anydoor: zero-shot object-level image customization
Supercharged experience for multiple models such as ChatGPT, DALL-E and Stable Diffusion.
📷 EasyPhoto | Your Smart AI Photo Generator.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Taming Transformers for High-Resolution Image Synthesis
High-Resolution Image Synthesis with Latent Diffusion Models
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Concept Sliders for Precise Control of Diffusion Models
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, e…
A GPT-4/Gemini Voice/Video Exploration Tool
Production-ready platform for agentic workflow development.
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
A realtime sketch to image demo using LCM and the gradio library.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Unofficial Implementation of Animate Anyone
Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
DeepFaceLab is the leading software for creating deepfakes.
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画
Fast and Simple Face Swap Extension Node for ComfyUI
[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"
[ACM MM 2024] Offical Code for "HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting"




