MADDPG implementation in RLlib #5348

wsjeon · 2019-08-01T21:51:49Z

What do these changes do?

I refactored OpenAI/MADDPG implementation in RLlib and checked the performance.

Related issue number

Linter

I've run scripts/format.sh to lint the changes in this PR.

…ch as input for SyncReplayOptimizer. 2. Add `synchronize_sampling` argument to `SyncReplayOptimizer.__init__`, which is necessary for centralized critic training.

ericl · 2019-08-01T22:34:33Z

Thanks, this looks pretty good. Would it be possible to include some runnable example script so that we can run them in Jenkins, and so that users can try it out of the box?

wsjeon · 2019-08-01T22:59:33Z

Oh, I added a runnable script in README in my GitHub repository: https://github.com/wsjeon/maddpg-rllib. Additionally, I'm creating a Singularity image now :) Thanks. Best, Wonseok

…

On Thu, Aug 1, 2019 at 6:34 PM Eric Liang ***@***.***> wrote: Thanks, this looks pretty good. Would it be possible to include some runnable example script so that we can run them in Jenkins, and so that users can try it out of the box? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#5348?email_source=notifications&email_token=AE2DYWWNHPKQ4OWRZCGYUPTQCNQIRA5CNFSM4IIWTVH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD3MCL2A#issuecomment-517481960>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AE2DYWUXFLJRX7FTXMETBPLQCNQIRANCNFSM4IIWTVHQ> .

AmplabJenkins · 2019-08-02T01:43:35Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/15910/
Test FAILed.

ericl

Thanks! I pushed some changes to add the twostep_game test, remove the need for obs_space_dict / act_space_dict, and move it to contrib/MADDPG.

AmplabJenkins · 2019-08-06T04:38:17Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16007/
Test FAILed.

AmplabJenkins · 2019-08-06T09:06:46Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16016/
Test FAILed.

AmplabJenkins · 2019-08-06T09:43:41Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16018/
Test FAILed.

AmplabJenkins · 2019-08-06T14:07:59Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16034/
Test FAILed.

AmplabJenkins · 2019-08-06T15:10:54Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16041/
Test FAILed.

AmplabJenkins · 2019-08-06T23:31:31Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16057/
Test PASSed.

wsjeon added 11 commits July 24, 2019 15:08

Fit codes in Ray/RLlib agent format.

ca4e63b

Update README.md and experiment results for comparison.

551b28c

1. Remove MultiAgentSyncReplayOptimizer, and make before_learn_on_bat…

885941c

…ch as input for SyncReplayOptimizer. 2. Add `synchronize_sampling` argument to `SyncReplayOptimizer.__init__`, which is necessary for centralized critic training.

1. Remove MultiAgentSyncReplayOptimizer, and make before_learn_on_bat…

f64d00a

…ch as input for SyncReplayOptimizer. 2. Add `synchronize_sampling` argument to `SyncReplayOptimizer.__init__`, which is necessary for centralized critic training.

md test.

d4ffc98

Modify the width of images.

4ae7212

Modify the width of images.

ba8029e

Modify the width of images.

1105b18

Move main code inside MADDPG directory.

bc1e027

Move execution code to wsjeon/maddpg-rllib repo.

c96146d

Move execution code to wsjeon/maddpg-rllib repo.

108a209

ericl self-assigned this Aug 1, 2019

ericl added 4 commits August 5, 2019 16:40

lint

d2b76b4

Merge remote-tracking branch 'upstream/master' into develop

791b9ca

move to contrib

e0bb73d

docs

94155a9

ericl approved these changes Aug 6, 2019

View reviewed changes

fix tf import

84b6d04

ericl force-pushed the develop branch from a823615 to 84b6d04 Compare August 6, 2019 05:05

ericl added 3 commits August 5, 2019 23:27

Merge remote-tracking branch 'upstream/master' into develop

bec1bd1

docs

c5c22ff

link

fde82cd

Update test_dependency.py

2411318

ericl merged commit 281829e into ray-project:master Aug 6, 2019

edoakes pushed a commit to edoakes/ray that referenced this pull request Aug 9, 2019

MADDPG implementation in RLlib (ray-project#5348)

64b669e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MADDPG implementation in RLlib #5348

MADDPG implementation in RLlib #5348

wsjeon commented Aug 1, 2019 •

edited by ericl

ericl commented Aug 1, 2019

wsjeon commented Aug 1, 2019 via email

AmplabJenkins commented Aug 2, 2019

ericl left a comment

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

MADDPG implementation in RLlib #5348

MADDPG implementation in RLlib #5348

Conversation

wsjeon commented Aug 1, 2019 • edited by ericl

What do these changes do?

Related issue number

Linter

ericl commented Aug 1, 2019

wsjeon commented Aug 1, 2019 via email

AmplabJenkins commented Aug 2, 2019

ericl left a comment

Choose a reason for hiding this comment

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

AmplabJenkins commented Aug 6, 2019

wsjeon commented Aug 1, 2019 •

edited by ericl