BWM is a physically consistent, action-conditioned video world model built upon Wan2.2-TI2V-5B, serving as a low-cost yet high-fidelity simulator for robotic manipulation.
- [2026-05] π Top results on WorldArena Leaderboard! BLM ranks 1st among open-source models on Track 1 and Track 2 Data Engine, while BWM-fast ranks 2nd overall on Track 1.
- [2026-05] π Inference code released! Generate action-conditioned robot manipulation videos with BWM. See π οΈ Usage.
- [2026-05] π Model definition released! The BWM architecture and core model components are now available.
- BLM: π₯ 1st Place among open-source models on Track 1 and Track 2 Data Engine.
- BWM-fast: π₯ 2nd Place on the overall Track 1 leaderboard.
![]() Track 1 open-source leaderboard |
![]() Track 2 Data Engine open-source leaderboard |
![]() Track 1 overall leaderboard |
Leaderboard: https://huggingface.co/spaces/WorldArena/WorldArena
- β TODO
- ποΈ Framework
- π¬ Qualitative Results
- π οΈ Usage
- ποΈ Training
- π Acknowledgements
- π§ Contact
- π Citing
- Release inference code
- Release model definition
- Release model weights
- Release training code
- Release technical report
Coming soon !
The following simulation scenes are generated autoregressively by BWM from initial frames and action sequences in the WorldArena test set, achieving high-fidelity visual realism while maintaining long-horizon physical consistency.
![]() |
![]() |
- Task: arrange blocks by size, stack bowls
- Challenge: Multi-object spatial ordering, stacking stability, and contact-rich placement
- Ours:
- β Preserves object identity and target layout
- β Maintains stable stacking contacts
- β Predicts adaptive gripper control
![]() |
![]() |
- Task: open microwave, open laptop
- Challenge: Articulated hinge motion, constrained rotation, and persistent object state
- Ours:
- β Captures hinge-constrained opening dynamics
- β Maintains coherent object geometry during rotation
- β Preserves opened states over long-horizon rollouts
![]() |
![]() |
![]() |
![]() |
- Task: turn switch, hang mug, click bell, stamp seal
- Challenge: Small contact regions, constrained placement, and precise state-changing interactions
- Ours:
- β Captures fine-grained affordance dynamics
- β Aligns contact with object affordances
- β Preserves state-changing interactions
![]() |
![]() |
- Task: hand over block, hand over mic
- Challenge: Dual-arm synchronization, inter-arm occlusion, and coordinated grasp timing
- Ours:
- β Models synchronized dual-arm motion
- β Preserves object continuity
- β Avoids close-contact collisions
![]() |
![]() |
- Task: put object in cabinet, put bottles in dustbin
- Challenge: Long-horizon transport, partial occlusion, and constrained final placement
- Ours:
- β Maintains long-horizon scene coherence
- β Handles occlusion without object drift
- β Produces stable constrained placement
To test generalization beyond benchmark initial states, we use GPT-Image-2-created initial scenes with original robot action sequences and let BWM autoregressively roll out the future under object appearance shifts.
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
- Task: shake bottle, put object in cabinet
- Challenge: Novel initial scenes and object appearance shifts
- Ours:
- β Generalizes to GPT-Image-2-created initial scenes
- β Preserves action-conditioned dynamics
- β Maintains coherent robot-object interaction
# Create conda environment
conda create -n BWM python=3.10.20
conda activate BWM
# Install PyTorch with CUDA support
pip install torch==2.8.0 torchvision==0.23.0 torchaudio==2.8.0 --index-url https://download.pytorch.org/whl/cu128
# Install DiffSynth-Studio
pip install diffsynth==2.0.11
# Install dependencies
pip install -r requirements.txtDownload the Wan2.2-TI2V-5B base model from ModelScope:
modelscope download --model Wan-AI/Wan2.2-TI2V-5B --local_dir models/Wan2.2-TI2V-5BDownload the BWM checkpoint from Hugging Face:
hf download BLM-Lab/Boundless-World-Model step-12000.safetensors --local-dir ckpt/BLMThe demo metadata, videos, actions, and normalization statistics are already included under demo/.
Set local paths before running inference:
cp scripts/local.example.sh scripts/local.shUpdate MODEL_PATHS and CKPT_PATH in scripts/local.sh, then run:
bash scripts/infer_example.shComing soon !
This project builds upon the following open-source projects and benchmarks. We thank these teams for their contributions:
- Wan2.2: https://github.com/Wan-Video/Wan2.2
- DiffSynth-Studio: https://github.com/modelscope/DiffSynth-Studio
- WorldArena: https://github.com/tsinghua-fib-lab/WorldArena/
- ABot-PhysWorld: https://github.com/amap-cvlab/ABot-PhysWorld
We also acknowledge the following engineering contributions:
- Wentao Tan: basic architecture design Β· Email Β· GitHub
- Zengrong Lin: core code implementation Β· Email Β· GitHub
- Yang Sun: code refactoring and software maintainability Β· Email Β· GitHub
We further thank all project contributors for their valuable discussions, support for the paper experiments, and participation in the WorldArena challenge.
- Supervision: Heng Tao Shen
- Principal Investigator: Lei Zhu
- Student Project Leadership: Wentao Tan, Tianshi Wang
- WorldArena Challenge:
- Strategy Design: Wentao Tan, Bowen Wang
- Inference-Time Scaling: Tianshi Wang, Chenming Li
- Data Pipeline: Bowen Wang, Enci Xie, Wentao Tan, Chenming Li, Yang Sun, Yipeng Chen, Xuebin Fang, Zequn Wang
- Metric Analysis: Wentao Tan, Enci Xie, Chenming Li, Tianshi Wang
- Closed-Loop Rollout: Zequn Wang, Zhe Li, Heng Zhi, Zengrong Lin
- Model Architecture:
- Innovation: Wentao Tan, Zengrong Lin, Enci Xie, Baixu Ji
- Model Training: Zengrong Lin, Yang Sun, Zhe Li
- Post Training: Yang Sun, Zengrong Lin, Wentao Tan
- Baselines: Zequn Wang, Heng Zhi, Yipeng Chen, Chenyu Liu, Wenjie Yang, Hao Xue, Chen Xu
- VLAs Support:
- Real-World: Heng Zhi
- Simulation: Heng Zhi, Baixu Ji
- Infrastructure:
- Distributed Evaluation: Wenhao Liu
- Real-World Setup: Zhe Li
- Discussion Support: Fengling Li, Pengfei Zhang, Lanyun Zhu, Ying Cheng, Jingkuan Song, Xing Xu, Yunfan Ren, Qi Zhang
Contributors are listed in alphabetical order by English name.
Baixu Ji, Bowen Wang, Chen Xu, Chenming Li, Chenyu Liu, Enci Xie, Fengling Li, Hao Xue, Heng Tao Shen, Heng Zhi, Jingkuan Song, Lanyun Zhu, Lei Zhu, Pengfei Zhang, Qi Zhang, Tianshi Wang, Wenhao Liu, Wenjie Yang, Wentao Tan, Xing Xu, Xuebin Fang, Yang Sun, Ying Cheng, Yipeng Chen, Yunfan Ren, Zengrong Lin, Zequn Wang, Zhe Li
If you find BWM is useful in your research or applications, please consider giving us a star π.




















