Skip to content

KlingAIResearch/VideoCanvas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Minghong Cai1 †, Qiulin Wang2 βœ‰, Zongli Ye1, Wenze Liu1, Quande Liu2, Weicai Ye2, Xintao Wang2, Pengfei Wan2, Kun Gai2, Xiangyu Yue1 βœ‰
1MMLab, The Chinese University of Hong Kong 2Kling Team, Kuaishou Technology
†: Intern at Kuaishou Technology, βœ‰: Corresponding Authors

πŸ“‹ News

  • [2025.10.9] Release Arxiv paper.

πŸ“– Introduction

VideoCanvas has two key contributions:

  • 🎯 Unified Tasks: VideoCanvas introduces a unified paradigm for arbitrary spatio-temporal video generation, seamlessly integrating diverse capabilities including image/patch-to-video conditioning at any timestamp, inpainting/outpainting, camera control, scene transitions, and video extension.
  • πŸ› οΈ Simple Solution: Our technical innovation leverages In-Context Conditioning with zero-padding for spatial control and Temporal RoPE Interpolation for temporal alignment, achieving frame-precise video generation without fine-tuning VAEs or adding parameters.
teaser.mp4

πŸ“– VideoCanvasBench

We will release this benchmark, including intra-scene and inter-scene evaluation data.

βš™οΈ Code (Coming soon)

Citation

 @article{cai2025videocanvas,
    title={VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning},
    author={Minghong Cai, Qiulin Wang, Zongli Ye, Wenze Liu, Quande Liu, Weicai Ye, Xintao Wang, Pengfei Wan, Kun Gai, Xiangyu Yue},
    journal={arXiv preprint arXiv:2510.08555},
    year={2025}
}

About

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors