Multiagent simplerl #5066

andrewcoh · 2021-03-09T19:31:18Z

Proposed change(s)

Describe the changes made in this PR.

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Added tests that prove my fix is effective or that my feature works
Updated the changelog (if applicable)
Updated the documentation (if applicable)
Updated the migration guide (if applicable)

Other comments

vincentpierre · 2021-03-09T19:55:54Z

ml-agents/mlagents/trainers/tests/simple_test_envs.py

+        pass
+
+    def set_actions(self, behavior_name, action):
+        # im so sorry


vincentpierre · 2021-03-09T19:56:52Z

ml-agents/mlagents/trainers/tests/simple_test_envs.py

@@ -314,6 +314,185 @@ def _make_batched_step(
        return (decision_step, terminal_step)


+class MultiAgentEnvironment(BaseEnv):


I understand nothing this class does. Please add comments

Yeah, it is a pretty horrible thing

vincentpierre · 2021-03-09T19:59:20Z

ml-agents/mlagents/trainers/tests/torch/test_simple_rl.py


 # tests in this file won't be tested on GPU machine
 pytestmark = pytest.mark.check_environment_trains


+@pytest.mark.parametrize("action_sizes", [(0, 1), (1, 0)])


Can this be tested for combinations of rank 1, 2 and 3 observations and with and LSTM config? (To make sure it does not crash at least)

Added some tests for LSTM, variable length obs, and visual

vincentpierre · 2021-03-10T20:23:06Z

ml-agents/mlagents/trainers/tests/simple_test_envs.py

+                self.dones[name_and_num] = False
+                self.envs[name_and_num].reset()
+                # HACK
+                self.behavior_spec = self.envs[name_and_num].behavior_spec


This dos not need to be in the loop

andrewcoh and others added 7 commits March 8, 2021 16:28

simple rl multiagent env

59825a1

runs but does not train

2accf19

assemble terminal steps

a6ffbd8

seems to train

3e26bc3

fix final reward

5126254

Merge branch 'develop-coma2-trainer' into develop-multiagent-simplerl

fea6d53

Merge changes

7447d88

vincentpierre reviewed Mar 9, 2021

View reviewed changes

andrewcoh and others added 7 commits March 9, 2021 16:02

fix multiple discrete actions

2dd982d

Lots of small fixes for multiagent env

278ecf2

Fix just_died

38dc560

Add simple RL tests

82e5e99

Merge branch 'develop-coma2-trainer' into develop-multiagent-simplerl

3c88d65

Add LSTM simple_rl for COMA

51aba67

adding comments to multiagent rl

bfd1428

vincentpierre approved these changes Mar 10, 2021

View reviewed changes

Address comments

72f0370

ervteng merged commit a7d2a65 into develop-coma2-trainer Mar 10, 2021

delete-merged-branch bot deleted the develop-multiagent-simplerl branch March 10, 2021 20:57

github-actions bot locked as resolved and limited conversation to collaborators Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multiagent simplerl #5066

Multiagent simplerl #5066

andrewcoh commented Mar 9, 2021

vincentpierre Mar 9, 2021

vincentpierre Mar 9, 2021

andrewcoh Mar 10, 2021

vincentpierre Mar 9, 2021

ervteng Mar 10, 2021

vincentpierre Mar 10, 2021

ervteng Mar 10, 2021

		@@ -314,6 +314,185 @@ def _make_batched_step(
		return (decision_step, terminal_step)


		class MultiAgentEnvironment(BaseEnv):

Multiagent simplerl #5066

Multiagent simplerl #5066

Conversation

andrewcoh commented Mar 9, 2021

Proposed change(s)

Useful links (Github issues, JIRA tickets, ML-Agents forum threads etc.)

Types of change(s)

Checklist

Other comments

vincentpierre Mar 9, 2021

Choose a reason for hiding this comment

vincentpierre Mar 9, 2021

Choose a reason for hiding this comment

andrewcoh Mar 10, 2021

Choose a reason for hiding this comment

vincentpierre Mar 9, 2021

Choose a reason for hiding this comment

ervteng Mar 10, 2021

Choose a reason for hiding this comment

vincentpierre Mar 10, 2021

Choose a reason for hiding this comment

ervteng Mar 10, 2021

Choose a reason for hiding this comment