MPE Continuous Action support #419

Rohan138 · 2021-07-08T05:45:28Z

I've added support for continuous actions in MPE as discussed in #249 through the continuous_actions=False argument in the environment config.

I've tested my changes on all environments using RLLib MADDPG, and run ./release_test.sh. The latter needed some updates as well.

All environments are working except simple_world_comm_v2, which seems to have a supersuit-related bug.

jkterry1 · 2021-07-08T05:47:27Z

Hey I haven't been tracking your conversation on discord, but you understand that our MPE environments are rather substantially fixed upband can't be directly compared to the ones in the original repo right?

jkterry1 · 2021-07-08T05:48:05Z

And does Ben have the needed information to look into the "supersuit related bug"?

Rohan138 · 2021-07-08T06:11:40Z

Yes, I understand-however, I think having the option of using continuous action spaces makes it much easier to use PettingZoo MPE with most papers using MPE, just because they've been using it as well. I've still kept the default actions to discrete, so the original behavior isn't affected.
For example, I used the RLLib MADDPG written by you, and got similar results to the original MPE + MADDPG, although not identical.

About simple_world_comm_v2 - I'm not sure what the bug is. If you run the script above with --env-name=simple_world_comm_v2, it crashes with different errors with SuperSuit 2.6.6 and 3.0.1, but it's about incorrect observation shapes for one of the agents with observation shape Discrete(34).

benblack769 · 2021-07-08T18:37:15Z

So I enabled CI tests for your PR, and it looks like simple_world_comm_v2 is failing due to the observation space issue. You can run the respective test with

pytest ./test/pytest_runner.py::test_module[mpe/simple_world_comm_v2-pettingzoo.mpe.simple_world_comm_v2]

Let me know if you have any questions.

benblack769 · 2021-07-08T18:37:40Z

Note that supersuit is not in those tests, so it is not a supersuit issue.

benblack769 · 2021-07-08T19:13:52Z

Ok, so I am reviewing the PR, and it generally looks good, really good job figuring out the tests and documentation.

One issue is that the continuous action space for environments with both communication and movement (like simple_world_comm_v2) is not what it should be (this might be what is causing the test to fail).

In particular, if you have 5 movement options and 4 communication options, then in a continuous space, the action should be of size 9 (the actions are concatenated), and in a discrete space, it should be of size 20 (the actions are a cross product of the two subspaces). The continuous space should break the action down into the movement and communication components differently in the continuous and discrete cases here: https://github.com/PettingZoo-Team/PettingZoo/blob/master/pettingzoo/mpe/_mpe_utils/simple_env.py#L94

The documentation should reflect this.

Thanks again for working on this!

benblack769 · 2021-07-10T21:09:18Z

Ok, so the action space and action handling looks good now. I don't like the way that the rendering misrepresents the communication (it renders the argmax of the actions), but that is inherited from the original MPE, and I don't know if its worth fixing for now.

So I am happy with this PR. @jkterry1 Do you want to look over the documentation? I think its fine, but I have low standards, as you know.

jkterry1 · 2021-07-10T21:14:19Z

"I don't like the way that the rendering misrepresents the communication (it renders the argmax of the actions), but that is inherited from the original MPE, and I don't know if its worth fixing for now."

@Rohan138 would you be willing to work on this?

jkterry1 · 2021-07-10T21:15:22Z

@benblack769 the docs look fine..? what body am I supposed to be looking for under the rug here

benblack769 · 2021-07-11T00:18:50Z

@jkterry1 I don't see any dead bodies either, I'm just checking that you don't.

Rohan138 · 2021-07-11T04:52:40Z

I added something-just printing out the entire comm, rounded to 2dp - that's the best thing I could think of, although I agree that the argmax is unsatisfying. Perhaps a new PR?
I also added the mpe_maddpg script to tutorials.
Feel free to reject either of these two commits.
I hereby attest that there are no dead bodies in this PR.

benblack769 · 2021-07-12T02:32:52Z

Can you check the "Allow edits from maintainers" box? There are a couple minor things I want to change.
bisq-network/style#4

Rohan138 · 2021-07-12T02:53:02Z

It's already checked, are you unable to edit?

jkterry1 · 2021-07-12T04:16:10Z

Ben that feature really doesn't work reliably on GitHub. It's a recurring problem with my PRs to external repos too that I've never seen a solution for.

On Sun, Jul 11, 2021 at 7:53 PM Rohan138 ***@***.***> wrote: It's already checked, are you unable to edit? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#419 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AEUF33D2QI2NK4G4WR3UPWTTXJKJTANCNFSM5AAAFO7Q> .

-- Thank you for your time, Justin Terry

Rohan138 · 2021-07-14T17:04:05Z

Hi, any updates on this?

jkterry1 · 2021-07-14T17:16:47Z

I'll try to meet with Ben today and deal with this

benblack769 · 2021-07-15T03:55:04Z

Sorry for ignoring this. I just fiddled around a bit with the rendering, no real complains.

benblack769 · 2021-07-15T03:55:24Z

So I am happy with this being merged

benblack769 · 2021-07-16T01:16:39Z

@jkterry1

Rohan138 and others added 7 commits July 7, 2021 20:55

Added continuous action support to MPE

d3da168

Update setup.py

6ef8030

Cleanup and docs for MPE continuous actions

031a3c2

Merge branch 'master' of https://github.com/Rohan138/PettingZoo

6bf1c23

Fix pip bug

8eaac5d

Passing flake8 tests

13c9d41

Ran release_test.sh

5a402bc

Fixed docs for simple_world_comm

a4faaa6

Test continuous_actions=True in all_parameter_combs.py

0862a3b

Rohan138 added 5 commits July 8, 2021 15:38

Fixed docs for MPE

f9cc123

Fixed continuous action spaces

650d9f3

Fixed action spaces

25a67e1

Fixed simple_world_comm_v2 bug

9b3d8ce

Removed test_output.txt

b3c2924

Rohan138 added 2 commits July 11, 2021 00:37

Added better rendering for continuous MPE

442e495

Added mpe_maddpg.py to tutorials

2df3d2d

Fixed example

71876fc

Rohan138 and others added 2 commits July 14, 2021 13:06

Merge branch 'PettingZoo-Team:master' into master

b05844d

Delete mpe_maddpg.py

732ed82

benblack769 added 2 commits July 14, 2021 21:53

fixed rendering

9f12a41

Merge branch 'master' of https://github.com/Rohan138/PettingZoo

f35548d

jkterry1 merged commit 383c152 into Farama-Foundation:master Jul 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPE Continuous Action support #419

MPE Continuous Action support #419

Rohan138 commented Jul 8, 2021 •

edited

jkterry1 commented Jul 8, 2021

jkterry1 commented Jul 8, 2021

Rohan138 commented Jul 8, 2021 •

edited

benblack769 commented Jul 8, 2021 •

edited

benblack769 commented Jul 8, 2021

benblack769 commented Jul 8, 2021

benblack769 commented Jul 10, 2021

jkterry1 commented Jul 10, 2021

jkterry1 commented Jul 10, 2021

benblack769 commented Jul 11, 2021

Rohan138 commented Jul 11, 2021 •

edited

benblack769 commented Jul 12, 2021

Rohan138 commented Jul 12, 2021

jkterry1 commented Jul 12, 2021 via email

Rohan138 commented Jul 14, 2021

jkterry1 commented Jul 14, 2021

benblack769 commented Jul 15, 2021

benblack769 commented Jul 15, 2021

benblack769 commented Jul 16, 2021

MPE Continuous Action support #419

MPE Continuous Action support #419

Conversation

Rohan138 commented Jul 8, 2021 • edited

jkterry1 commented Jul 8, 2021

jkterry1 commented Jul 8, 2021

Rohan138 commented Jul 8, 2021 • edited

benblack769 commented Jul 8, 2021 • edited

benblack769 commented Jul 8, 2021

benblack769 commented Jul 8, 2021

benblack769 commented Jul 10, 2021

jkterry1 commented Jul 10, 2021

jkterry1 commented Jul 10, 2021

benblack769 commented Jul 11, 2021

Rohan138 commented Jul 11, 2021 • edited

benblack769 commented Jul 12, 2021

Rohan138 commented Jul 12, 2021

jkterry1 commented Jul 12, 2021 via email

Rohan138 commented Jul 14, 2021

jkterry1 commented Jul 14, 2021

benblack769 commented Jul 15, 2021

benblack769 commented Jul 15, 2021

benblack769 commented Jul 16, 2021

Rohan138 commented Jul 8, 2021 •

edited

Rohan138 commented Jul 8, 2021 •

edited

benblack769 commented Jul 8, 2021 •

edited

Rohan138 commented Jul 11, 2021 •

edited