GitHub - maitrix-org/Pandora: Pandora: Towards General World Model with Natural Language Actions and Video States

Pandora: Towards General World Model with Natural Language Actions and Video States

We introduce Pandora, a step towards a General World Model (GWM) that:

Simulates world states by generating videos across any domains
Allows any-time control with actions expressed in natural language

Please refer to world-model.ai for results.

[Website] [Paper] [Model] [Gallery]

News

[2024/05/23] Release the model and inference code.
[2024/05/23] Launch the website and release the paper.

Setup

conda create -n pandora python=3.11.0 nvidia/label/cuda-12.1.0::cuda-toolkit -y
conda activate pandora
pip install torch torchvision torchaudio
bash build_envs.sh

If your GPU doesn't support CUDA 12.1, you can also install with CUDA 11.8:

conda create -n pandora python=3.11.0 nvidia/label/cuda-11.8.0::cuda-toolkit -y 
conda activate pandora
pip install torch torchvision torchaudio
bash build_envs.sh

Inference

Gradio Demo

Download the model checkpoint from Hugging Face. (We currently hide the model weights due to data license issue. We will re-open the weights soon after we figure this out.)
Run the commands on your terminal

CUDA_VISIBLE_DEVICES={cuda_id} python gradio_app.py  --ckpt_path {path_to_ckpt}

Then you can interact with the model through gradio interface.

Citation

@article{xiang2024pandora,
  title={Pandora: Towards General World Model with Natural Language Actions and Video States},
  author={Jiannan Xiang and Guangyi Liu and Yi Gu and Qiyue Gao and Yuting Ning and Yuheng Zha and Zeyu Feng and Tianhua Tao and Shibo Hao and Yemin Shi and Zhengzhong Liu and Eric P. Xing and Zhiting Hu},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
ChatUniVi		ChatUniVi
DynamiCrafter		DynamiCrafter
assets		assets
examples		examples
.gitignore		.gitignore
README.md		README.md
build_envs.sh		build_envs.sh
configuration.py		configuration.py
demo_utils.py		demo_utils.py
gradio_app.py		gradio_app.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pandora: Towards General World Model with Natural Language Actions and Video States

News

Setup

Inference

Gradio Demo

Citation

About

Releases

Packages

Contributors 5

Languages

maitrix-org/Pandora

Folders and files

Latest commit

History

Repository files navigation

Pandora: Towards General World Model with Natural Language Actions and Video States

News

Setup

Inference

Gradio Demo

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages