Skip to content

LogosRoboticsGroup/UniWorldVLA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

1 Commit
Β 
Β 
Β 
Β 

Repository files navigation

Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving

Qiqi Liu1,2,3*, Huan Xu3*, Jingyu Li1,2,3, Bin Sun3†, Zhihui Hao3†, Dangen She3, Xiatian Zhu4, Li Zhang1,2‑

1Fudan University; 2Shanghai Innovation Institute; 3Li Auto Inc.; 4University of Surrey
* equal contribution; † project leader; ‑ corresponding author

arXiv


πŸ“° News


πŸ”­ Project Overview

Overview


πŸ’‘ Key Features

  • πŸ”„ Interleaved world modeling and planning: alternates future frame prediction and ego action/trajectory generation step-by-step, forming a closed-loop interaction that keeps planning conditioned on imagined observations.
  • πŸ€– Unified autoregressive VLA formulation: generates visual tokens and action queries in a single sequence, tightly coupling prediction and control under temporal causality.
  • πŸ“Ή Depth integration for geometric cues: augments historical frames with monocular depth maps and fuses geometry features via cross-attention to improve long-horizon scene prediction.

πŸ“Š Results

Table 1. Closed-loop planning results on NAVSIM.

Table1

Table 2. World modeling / prediction results on NAVSIM.

Table2

Visualization.

Visualization


🧾 TODO

  • Release arXiv paper
  • Release code
  • Release model weights

πŸ“– Citation

@article{liu2026uniworld,
  title   = {Uni-World VLA: Interleaved World Modeling and Planning for Autonomous Driving},
  author  = {Liu, Qiqi and Xu, Huan and Li, Jingyu and Sun, Bin and Hao, Zhihui and She, Dangen and Zhu, Xiatian and Zhang, Li},
  journal = {arXiv preprint arXiv:2603.27287},
  year    = {2026},
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors