video
The suite of modeling video with Mamba
Mora: More like Sora for Generalist Video Generation
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Lumina-T2X is a unified framework for Text to Any Modality Generation
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
[ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Enjoy the magic of Diffusion models!
🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
VideoSys: An easy and efficient system for video generation
[AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation"
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
[ICLR 2026] A Training-free Iterative Framework for Long Story Visualization
Code repository for T2V-Turbo and T2V-Turbo-v2
