Stars
OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to in…
Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling
Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Run Orpheus 3B Locally With LM Studio
Staging repo for development of native port of TypeScript
This custom_node for ComfyUI adds one-click "Virtual VRAM" for any GGUF UNet and CLIP loader, managing the offload of layers to DRAM or VRAM to maximize the latent space of your card. Also includes…
A simple and beautiful text diff viewer component made with Diff and React.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
A novel approach to hunyuan image-to-video sampling
musubi-tuner modified to tune image2video/video infilling
A pipeline parallel training script for diffusion models.
Official implementation of OneDiffusion paper (CVPR 2025)
Text and image to video generation: Kandinsky 4.0 (2024)
HunyuanVideo: A Systematic Framework For Large Video Generation Model
OneTrainer is a one-stop solution for all your stable diffusion training needs.
An Extension for Automatic1111 Webui that trivializes outpainting