Skip to content

zhaoyl18/ratio_game

Repository files navigation

ratio game

Coding implementation for paper: Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning (ICML 2023)

We study von Neumann's ratio game, a very simple stochastic game. We implement and compare two algorithms:

(1) Our algorithm with sequential policy updates

(2) Independent policy gradient algorithm, e.g. this paper.

Results are shown below.

(a)

Policies are initialized close to the stationary point, stepsize is 0.001. image

(b)

Policies are initialized close to the stationary point, stepsize is 0.02. image

(c)

Policies are initialized close to the stationary point, stepsize is 0.05. image

(d)

Both policies are uniformly initialized, stepsize is 0.001. image

(e)

Both policies are uniformly initialized, stepsize is 0.01. image

(f)

Both policies are uniformly initialized, stepsize is 0.005. image

About

policy gradient methods for von Neumann's ratio game

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages