[Feature] MultiDiscreteTensorSpec nvec with several axes #789

riiswa · 2023-01-03T19:25:25Z

Description

I rewrited the code of the PR #783 to support nvec with several axes (Sorry for the double PR I pushed late ^^). Related to #781.

Example:

>>> ts = MultiDiscreteTensorSpec([[4, 2], [6, 9]])
>>> ts.rand()
tensor([[0, 1],
        [3, 6]])

The code is a bit complicated, but it's necessary to generalize to n dimensions, don't hesitate to tell me if you have any questions or if I should add some comments

Motivation and Context

Improve compatibility with Gym Spaces that have MultiDiscrete that support nvec with several axes.

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

riiswa · 2023-01-04T11:59:56Z

Should I add the expected behavior of to_categorical and to_one_hot to this PR ? If so, I will put it in draft form until then

vmoens

I left a bunch of comments.
Not sure they all make sense

test/test_tensor_spec.py

torchrl/data/tensor_specs.py

vmoens · 2023-01-05T13:32:37Z

torchrl/data/tensor_specs.py

+        return x.permute(*torch.arange(x.ndim - 1, -1, -1)).reshape([*shape, *_size])

    def _project(self, val: torch.Tensor) -> torch.Tensor:
        if val.dtype not in (torch.int, torch.long):


can we use this class with non integer dtype?
If no, we could just check that val.dtype matches the dtype of the object.
If yes, we could check self.dtype instead and always call round

Something like that ?

if not self.dtype.is_floating_point: val = torch.round(val)

torchrl/data/tensor_specs.py

riiswa · 2023-01-05T19:36:02Z

I tried to rewrite the code in agreement with your comments. Thank you!

vmoens

I'm still not sure I grasp everything can you comment a bit further?

torchrl/data/tensor_specs.py

vmoens · 2023-01-06T10:48:19Z

torchrl/data/tensor_specs.py

-        return [val] if self._size < 2 else val.split(1, -1)
+        x = self._rand(self.space, shape)
+        if self.nvec.ndim > 1:
+            x = x.transpose(len(shape), -1)


What does this do? Can you explain?

say spec.shape = [3, 4] and shape=[1, 2]
you want x to have shape [1, 2, 3, 4]. From what I understand the output of _rand has that shape already.
Why do you invert the dim with size 4 with the one with size 2?

No, in this case x have a shape of torch.Size([1, 2, 4, 3]) without the transpose. So I have to invert 4 and 3 (no the one with size 2, because index start with 0).

This is the log of the _rand algorithm (after and before stacking):

[torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2])] torch.Size([1, 2, 4]) [torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2])] torch.Size([1, 2, 4]) [torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2]), torch.Size([1, 2])] torch.Size([1, 2, 4]) [torch.Size([1, 2, 4]), torch.Size([1, 2, 4]), torch.Size([1, 2, 4])] torch.Size([1, 2, 4, 3])

Because at the end I should stack in -2 dim

What do you think about this solution:

def _rand(self, space: Box, shape: torch.Size, i: int): .... x.append(self._rand(_s, shape, i -1)) .... return torch.stack(x, -i)

Instead stacking in the last dimension I stack in a dimension according to the depth in the recursive box list, I pass the test with this solution.

@vmoens

vmoens · 2023-01-06T10:48:42Z

torchrl/data/tensor_specs.py

+        x = self._rand(self.space, shape)
+        if self.nvec.ndim > 1:
+            x = x.transpose(len(shape), -1)
+        return x.squeeze(-1)


If the shape is Size([1]), shouldn't we keep the last dim?

It's a oversight, when ts.shape == self.shape == torch.Size([1]), the previous computation add an empty dimension, so I added this case:

if self.shape == torch.Size([1]): x = x.squeeze(-1) return x

riiswa · 2023-01-06T19:48:54Z

The CI looks broken :/

vmoens

LGTM! Thanks!

vmoens · 2023-01-06T21:48:23Z

The CI looks broken :/

It's weird, can you try pushing an empty commit?
git commit --allow-empty -m empty

riiswa · 2023-01-06T21:55:45Z

Done but still broken. Maybe a key was removed (or expired) from Github

vmoens · 2023-01-06T22:02:20Z

I managed to get it to run!

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 3, 2023

riiswa force-pushed the feature/multdiscretetensorspec branch from 4362e67 to af2eb52 Compare January 3, 2023 19:42

riiswa force-pushed the feature/multdiscretetensorspec branch from 81b6e3e to 8280a72 Compare January 4, 2023 14:48

vmoens added the enhancement New feature or request label Jan 5, 2023

vmoens reviewed Jan 5, 2023

View reviewed changes

vmoens changed the title ~~[Feature] MultiDiscreteTensorSpec nvec with several axes.~~ [Feature] MultiDiscreteTensorSpec nvec with several axes Jan 5, 2023

riiswa marked this pull request as draft January 5, 2023 21:12

riiswa force-pushed the feature/multdiscretetensorspec branch from b152de3 to f575f51 Compare January 5, 2023 21:16

riiswa marked this pull request as ready for review January 5, 2023 21:23

riiswa added 9 commits January 6, 2023 09:13

Support multidimensional nvec for MultiDiscreteTensorSpec

781c1ba

Support int for nvec argument

a3ff995

Simplify _rand by avoid permutation and reshape

263b1b2

Fix to_one_hot method condition

fb004c2

Improve _project and is_in methods

3c5d4d0

Update tests

a4effbb

Support multi dtype

7c9aca1

Fix tests

7082960

Fix docs

62e324d

riiswa force-pushed the feature/multdiscretetensorspec branch from 8ec4325 to 62e324d Compare January 6, 2023 08:13

vmoens reviewed Jan 6, 2023

View reviewed changes

riiswa added 2 commits January 6, 2023 20:42

Fix the _rand method

f9b5ac1

Merge branch 'main' into feature/multdiscretetensorspec

9fbb392

vmoens approved these changes Jan 6, 2023

View reviewed changes

empty

f0fd64a

vmoens merged commit 6daedd6 into pytorch:main Jan 7, 2023

[Feature] MultiDiscreteTensorSpec nvec with several axes #789

[Feature] MultiDiscreteTensorSpec nvec with several axes #789

Uh oh!

Conversation

riiswa commented Jan 3, 2023 • edited by vmoens Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Types of changes

Checklist

Uh oh!

riiswa commented Jan 4, 2023

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens Jan 5, 2023

Choose a reason for hiding this comment

Uh oh!

riiswa Jan 5, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

riiswa commented Jan 5, 2023

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

riiswa Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

vmoens Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

riiswa Jan 6, 2023

Choose a reason for hiding this comment

Uh oh!

riiswa commented Jan 6, 2023

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens commented Jan 6, 2023

Uh oh!

riiswa commented Jan 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vmoens commented Jan 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

riiswa commented Jan 3, 2023 •

edited by vmoens

Loading

riiswa commented Jan 6, 2023 •

edited

Loading