[Feature] Introduce grouping in VMAS #1658

matteobettini · 2023-10-27T11:08:27Z

This PR allows VMAS to use the MARL api with grouping.

Users will be able to provide a group_map to VMAS.
By default this will try to group agents named name_n into a group named "name".

In case at least one agent does not follow this convention and not group map is specified, VMAS will try to group all agents under the "agent" group

matteobettini

@vmoens this is ready for review when u can

matteobettini · 2023-10-31T18:35:13Z

torchrl/envs/libs/vmas.py

@@ -50,9 +63,75 @@ def _get_envs():
    ]


+@set_gym_backend("gym")


Detached vmas from gym spec conversion to have custom conversion rules

matteobettini · 2023-10-31T18:37:01Z

torchrl/envs/libs/vmas.py

+        agent_indices = {}
+        action_list = []
+        n_agents = 0
+        for group, agent_names in self.group_map.items():
+            group_action = tensordict.get((group, "action"))
+            group_action_list = list(self.read_action(group_action, group=group))
+            agent_indices.update(
+                {
+                    self.agent_names_to_indices_map[agent_name]: i + n_agents
+                    for i, agent_name in enumerate(agent_names)
+                }
+            )
+            n_agents += len(agent_names)
+            action_list += group_action_list
+        action = [action_list[agent_indices[i]] for i in range(self.n_agents)]


this will convert actions from group dicts to the list (ordered as it should)

This bit of logic is the focus of the new test

vmoens

@vmoens this is ready for review when u can

I guess you mean "you", "u" is my father :p

LGTM, I left some comments and suggestions

test/test_libs.py

torchrl/envs/libs/vmas.py

vmoens · 2023-11-01T13:25:15Z

torchrl/envs/libs/vmas.py

+                group_reward_spec,
+                group_info_spec,
+            ) = self._make_unbatched_group_specs(group)
+            self.unbatched_action_spec[group] = group_action_spec


is self.unbatched_action_spec public on purpose?
Can we review the public attributes of VMAS? I feel there are a lot of them and most are undocumented. This will hamper development in the future. I would make anything that is not user facing private to give us more dev freedom.

Sure I will make private all I don't need to be public.
This will be bc-breaking though as the methods will not be available anymore.

Regarding the unbatched_spec, these are needed when creating modules. Giving the specs with the batch_size to the modules will lead to errors #1018. These attirbutes are used in BenchMARL for example

This will be bc-breaking though as the methods will not be available anymore.

you can always make a property that raises a warning

torchrl/envs/libs/vmas.py

vmoens · 2023-11-01T13:28:40Z

torchrl/envs/libs/vmas.py

@@ -419,14 +590,10 @@ def read_reward(self, rewards):
        rewards = _selective_unsqueeze(rewards, batch_size=self.batch_size)
        return rewards

-    def read_action(self, action):
+    def read_action(self, action, group: str):


this is a public method and we're adding a new arg without default value, hence it's bc-breaking. Any chance we can make this non-bc breaking?

I can, or I can make it private, as requested above, which will be more bc breaking though

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

matteobettini · 2023-11-01T14:59:23Z

Ok I made all methods private.

I would like to keep the attributes public so that they can be accessed by libraries like BenchMARL.
I'll document them

vmoens

sorry for the spammy reviews

vmoens · 2023-11-01T16:25:05Z

torchrl/envs/libs/vmas.py

        rewards = _selective_unsqueeze(rewards, batch_size=self.batch_size)
        return rewards

-    def read_action(self, action):
+    def _read_action(self, action, group: str):


I think this solution is worse, it used to be public and now it's private. Is there a solution that does not break previous behaviour?

yes but it will leave the method public.

Just so to understand, do you want these methods public or private? I am happy to leave them public and making thewm bc comptible, but I thoght that private is what you wanted

The least bc-breaking the better.
I wasn't pointing at these methods in particular, more at all the attributes created when creating specs. It seems like there are loads of them.
read_action is public in other wrappers so I don't think it's a problem if it's public here too.

matteobettini · 2023-11-01T18:17:57Z

ok now there shold be no bc changes and the public attributes should be all doxumented

fix vmas tests

a41b26f

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 27, 2023

matteobettini added 7 commits October 27, 2023 12:10

amend

e3264bf

custom vmas specs

96cb40f

update

517dc54

update

76bfa13

added grouping for specs

f81e30f

added grouping for step and reset

c69acfd

use map

a342e5e

matteobettini changed the title ~~[VMAS] Fix tests~~ [VMAS] Introduce grouping in VMAS Oct 27, 2023

matteobettini added 5 commits October 27, 2023 15:23

amend

6e3667c

amend

ac3cb18

empty

c69825e

Merge branch 'main' into fix_vmas

f499444

amend

543ed6a

matteobettini marked this pull request as ready for review October 31, 2023 18:33

matteobettini commented Oct 31, 2023

View reviewed changes

vmoens approved these changes Nov 1, 2023

View reviewed changes

matteobettini and others added 3 commits November 1, 2023 14:39

Apply suggestions from code review

9346e45

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

address comments

b244d93

make methods private

c238684

docs

af723b8

vmoens changed the title ~~[VMAS] Introduce grouping in VMAS~~ [Feature] Introduce grouping in VMAS Nov 1, 2023

vmoens added the enhancement New feature or request label Nov 1, 2023

vmoens reviewed Nov 1, 2023

View reviewed changes

matteobettini added 2 commits November 1, 2023 16:28

add default

b71310f

remake public

d93d567

vmoens merged commit 04fbaa1 into pytorch:main Nov 2, 2023

matteobettini deleted the fix_vmas branch December 4, 2023 11:13

[Feature] Introduce grouping in VMAS #1658

[Feature] Introduce grouping in VMAS #1658

Uh oh!

Conversation

matteobettini commented Oct 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

matteobettini left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matteobettini commented Nov 1, 2023

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matteobettini commented Nov 1, 2023

Uh oh!

Uh oh!

matteobettini commented Oct 27, 2023 •

edited

Loading