ratio game

Coding implementation for paper: Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning (ICML 2023)

We study von Neumann's ratio game, a very simple stochastic game. We implement and compare two algorithms:

(1) Our algorithm with sequential policy updates

(2) Independent policy gradient algorithm, e.g. this paper.

Results are shown below.

(a)

Policies are initialized close to the stationary point, stepsize is 0.001.

(b)

Policies are initialized close to the stationary point, stepsize is 0.02.

(c)

Policies are initialized close to the stationary point, stepsize is 0.05.

(d)

Both policies are uniformly initialized, stepsize is 0.001.

(e)

Both policies are uniformly initialized, stepsize is 0.01.

(f)

Both policies are uniformly initialized, stepsize is 0.005.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
archive		archive
stationary		stationary
.gitignore		.gitignore
README.md		README.md
discussion.png		discussion.png
main.py		main.py
model.py		model.py
ratio game results.pdf		ratio game results.pdf
stepsize=0.00.png		stepsize=0.00.png
stepsize=0.01.png		stepsize=0.01.png
stepsize=0.05.png		stepsize=0.05.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ratio game

(a)

(b)

(c)

(d)

(e)

(f)

About

Releases

Packages

Languages

zhaoyl18/ratio_game

Folders and files

Latest commit

History

Repository files navigation

ratio game

(a)

(b)

(c)

(d)

(e)

(f)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages