AORPO: Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

This is an official pytorch implementation of the paper Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts.

Note: We re-factor the code and the results are slightly different with those in the paper.

How to Install

git clone git@github.com:Leo-xh/AORPO.git
cd aorpo
conda create -n aorpo python=3.7
conda activate aorpo

pip install -r requirements.txt

How to Train

We provide a shell script and the command is

./train.sh [tag] [env_name] [alg] {gpu id}

We recomment training with gpus since AORPO is hard to train without gpus.

For example, to train AORPO in Cooperative Navigation

./train.sh test spread AORPO 0

Please feel free to try other parameters.

Trained Models

Since training AORPO requires plenty of time, we provide trained models (without large dynammics models) in ./trained_models/, the models can be evaluated using the following command

python eval.py [env_id] [model_path] {--render}

For example, to evaluate AORPO in Cooperative Navigation

python eval.py simple_spread ./trained_models/simple_spread.pt --render

Citation

@article{2021,
   title={Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts},
   url={http://dx.doi.org/10.24963/ijcai.2021/466},
   DOI={10.24963/ijcai.2021/466},
   journal={Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence},
   publisher={International Joint Conferences on Artificial Intelligence Organization},
   author={Zhang, Weinan and Wang, Xihuai and Shen, Jian and Zhou, Ming},
   year={2021},
   month={Aug} }

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
agents		agents
algorithms		algorithms
multiagent-particle-envs		multiagent-particle-envs
trained_models		trained_models
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
eval.py		eval.py
main.py		main.py
requirements.txt		requirements.txt
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AORPO: Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

How to Install

How to Train

Trained Models

Citation

About

Languages

apexrl/AORPO

Folders and files

Latest commit

History

Repository files navigation

AORPO: Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts

How to Install

How to Train

Trained Models

Citation

About

Resources

Stars

Watchers

Forks

Languages