DPO

Note

The implementation of the following methods can be found in this codebase:

DPO: A Fully Decentralized Surrogate for Multi-Agent Policy Optimization

Installation

1. install SMAC following https://github.com/oxwhirl/smac
1. install Multi-Agent MuJoCo following https://github.com/schroederdewitt/multiagent_mujoco
1. install MPE following https://github.com/openai/multiagent-particle-envs
1. install required packages: pip install -r requirements.txt

How to run

 python3 on-policy-main/train_smac.py  --map_name 2s3z --use_eval  --penalty_method True --dtar_kl 0.02   --experiment_name dtar_0.02_V_penalty_2M --num_env_steps 2000000 --group_name dpo --seed 1 --multi_rollout True --n_rollout_threads 1

Results

Here, we provide results in three different SMAC scenarios using default hyperparameters. --->

Citation

If you are using the codes, please cite our papers.

Kefan Su and Zongqing Lu. A Fully Decentralized Surrogate for Multi-Agent Policy Optimization. TMLR, 2024

@article{DPO,
title={A Fully Decentralized Surrogate for Multi-Agent Policy Optimization},
author={Su, Kefan and Lu, Zongqing},
journal={Transactions on Machine Learning Research},
year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
img		img
on-policy-main		on-policy-main
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

on-policy-main

on-policy-main

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

DPO

Note

Installation

How to run

Results

Citation

About

Releases

Packages

Contributors 2

Languages

PKU-RL/DPO

Folders and files

Latest commit

History

Repository files navigation

DPO

Note

Installation

How to run

Results

Citation

About

Resources

Stars

Watchers

Forks

Languages