TL;DR: We propose MoVerse, a method that builds a navigable 3D world from a single narrow-field-of-view image and renders interactive video in real time on a single RTX 4090. MoVerse separates world construction from observation rendering through a three-stage pipeline, producing coherent and photorealistic free-roaming video walkthroughs.
Features: MoVerse generates high-quality real-time navigable worlds from a single input image, supporting:
- Panoramic Generation: Expands a single NFOV image into a gravity-aligned, horizontally periodic 360° ERP panorama via topology-aware latent diffusion
- 3D Gaussian Scaffold: Lifts the panorama into a panoramic 3D Gaussian scaffold using feed-forward residual prediction in angular–inverse-depth space
- Autoregressive Video Refinement: Translates scaffold renderings along user-specified camera trajectories into photorealistic video at 8 FPS on a single RTX 4090
- Free-Roaming Walkthrough: Supports user-controlled camera trajectories with temporally coherent real-time rendering
- Diverse Scenes: Works across indoor and outdoor scenes — from ancient ruins to cyberpunk streets, anime landscapes to realistic interiors
The code and pretrained models are currently under corporate compliance and security review. We anticipate the review will take about a month, and we will release all materials promptly upon approval.
- 🌐 Project Page: https://orange-3dv-team.github.io/MoVerse
- 📄 Paper: arXiv:2606.13376
- 💻 Code: GitHub
- 🤗 Model: Coming Soon
Given a single narrow-field-of-view image, MoVerse separates world construction from observation rendering:
INPUT (NFOV image)
→ STAGE I: Panoramic Generation (single image → 360° ERP)
→ STAGE II: 3D Gaussian Scaffold (panorama → splattable 3D asset)
→ STAGE III: Autoregressive Video Refinement (scaffold → real-time roaming video)
OUTPUT (navigable world)
Visit our Project Page to view complete demo videos, interactive 360° panorama viewers, and 3D Gaussian scaffold visualizations.
@article{moverse2026,
title = {MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold},
author = {Yang Zhou and Ziheng Wang and Yuqin Lu and Haofeng Liu and Jun Liang and Shengfeng He and Jing Li},
journal = {arXiv preprint arXiv:2606.13376},
year = {2026}
}If you're interested in this project, please give us a Star ⭐ to receive timely open-source notifications!
