[Doc] More depth in VMAS docs #1802

matteobettini · 2024-01-15T16:19:56Z

No description provided.

pytorch-bot · 2024-01-15T16:19:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/1802

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (6 Unrelated Failures)

As of commit 847cc13 with merge base b632be9 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Unit-tests on Linux / tests-cpu (3.11) / linux-job (gh)
test/test_libs.py::TestGym::test_vecenvs_env[CartPole-v1]
Unit-tests on Linux / tests-stable-gpu (3.8, 11.8) / linux-job (gh)
test/test_rb.py::TestStorages::test_storage_state_dict[torchsnapshot-False-memmap-memmap]
Unit-tests on Windows / unittests-gpu / windows-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Habitat Tests on Linux / tests (3.9, 11.6) / linux-job (gh)
test/test_libs.py::TestHabitat::test_habitat_render[False-HabitatPick-v0]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens

Thanks so much for this! I left some comments

torchrl/envs/libs/vmas.py

vmoens · 2024-01-15T21:17:36Z

torchrl/envs/libs/vmas.py

+        continuous_actions (bool, optional): Whether to use continuous actions. Defaults to ``True``. If ``False``, actions
+            will be discrete. The number of actions and their size will depend on the scenario chosen.
+            See the VMAS repositiory for more info.
+        max_steps (int, optional): Horizon of the task. Defaults to ``None`` (infinite horizon). Each VMAS scenario can


just to make sure: even when rewards can be consumed there is no max_steps predefined right? All envs are not truncated?

Yes, max_steps is additional to the (eventually implemented) scenario "done" function

torchrl/envs/libs/vmas.py

vmoens · 2024-01-15T21:22:15Z

torchrl/envs/libs/vmas.py

+            will be discrete. The number of actions and their size will depend on the scenario chosen.
+            See the VMAS repositiory for more info.
+        max_steps (int, optional): Horizon of the task. Defaults to ``None`` (infinite horizon). Each VMAS scenario can
+            implement a ``done`` function that will define when the scenario is terminated. If ``max_steps`` is specified,


what does it mean that an env can define a done funtion? Some envs are never done? Can I choose if they have a done function?

Yes, each scenario optionally implements "done" function. Some scenarios do not present that yes. Those scenarios would be eventually terminated only if you use max_steps.

You can choose to implement the scenario done function if you implement a scenario yourself, otherwise it is a property of the scenario

got it
What is the done function in practice?
How does it relate to the "done" key?
I'm asking this to clarify if it's required to mention a function that we do not see appearing anywhere or if we can just say that env can be non-terminating unless max_steps is provided.

I am also pro to not mention it. I just did cause I did not know how much depth is required here.

we can definitely just say what you proposed. Seems better to me to.

We can say something like: some scnearios are terminating and some not. In all cases max_steps provides an additional termination condition

I don't think users need to know the inner machinery but they should know precisely how changing a kwarg will impact what they get from the environment.

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

torchrl/envs/libs/vmas.py

vmoens · 2024-01-16T14:57:40Z

torchrl/envs/libs/vmas.py

+        max_steps (int, optional): Horizon of the task. Defaults to ``None`` (infinite horizon). Each VMAS scenario can
+            implement a ``done`` function that will define when the scenario is terminated. If ``max_steps`` is specified,
+            the scenario will also be terminated after this horizon has been reached. If instead of terminating the scenario
+            you wish to truncate it, please use a :class:`~torchrl.envs.transforms.StepCounter` transform.


I still think the current formulation is confusing, because even if we don't set the truncated key this is still technically a truncation. What about

Unlike gym's `TimeLimit` transform or torchrl's :class:`~torchrl.envs.transforms.StepCounter`, this argument will not set a `"truncated"` entry in the tensordict.

torchrl/envs/libs/vmas.py

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

vmoens

LGTM

Amend

3c14066

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 15, 2024

matteobettini added 2 commits January 15, 2024 16:22

Typo

a6addbe

Amend

b3c1fc1

vmoens reviewed Jan 15, 2024

View reviewed changes

matteobettini and others added 8 commits January 16, 2024 08:15

Update torchrl/envs/libs/vmas.py

ddd8c31

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

Update torchrl/envs/libs/vmas.py

f3f94ec

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

Update torchrl/envs/libs/vmas.py

443eb4d

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

Update torchrl/envs/libs/vmas.py

0e63764

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

Amend

0a179ae

Amend

e27dbb7

Amend

cccc2fd

Amend

050e346

vmoens added the documentation Improvements or additions to documentation label Jan 16, 2024

matteobettini added 2 commits January 16, 2024 09:27

Amend

e289551

Links

b1e84bc

vmoens reviewed Jan 16, 2024

View reviewed changes

matteobettini and others added 4 commits January 16, 2024 15:05

Apply suggestions from code review

295963b

Co-authored-by: Vincent Moens <vincentmoens@gmail.com>

Amend

9ecf0ac

Amend

a6a681b

Amend

847cc13

vmoens approved these changes Jan 16, 2024

View reviewed changes

vmoens merged commit baea10b into pytorch:main Jan 16, 2024
58 of 64 checks passed

matteobettini deleted the vmas_doc branch January 17, 2024 09:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] More depth in VMAS docs #1802

[Doc] More depth in VMAS docs #1802

matteobettini commented Jan 15, 2024

pytorch-bot bot commented Jan 15, 2024 •

edited

Loading

vmoens left a comment

vmoens Jan 15, 2024

matteobettini Jan 16, 2024 •

edited

Loading

vmoens Jan 15, 2024

matteobettini Jan 16, 2024 •

edited

Loading

vmoens Jan 16, 2024

matteobettini Jan 16, 2024

vmoens Jan 16, 2024

matteobettini Jan 16, 2024

vmoens Jan 16, 2024

vmoens left a comment

[Doc] More depth in VMAS docs #1802

[Doc] More depth in VMAS docs #1802

Conversation

matteobettini commented Jan 15, 2024

pytorch-bot bot commented Jan 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/1802

✅ You can merge normally! (6 Unrelated Failures)

vmoens left a comment

Choose a reason for hiding this comment

vmoens Jan 15, 2024

Choose a reason for hiding this comment

matteobettini Jan 16, 2024 • edited Loading

Choose a reason for hiding this comment

vmoens Jan 15, 2024

Choose a reason for hiding this comment

matteobettini Jan 16, 2024 • edited Loading

Choose a reason for hiding this comment

vmoens Jan 16, 2024

Choose a reason for hiding this comment

matteobettini Jan 16, 2024

Choose a reason for hiding this comment

vmoens Jan 16, 2024

Choose a reason for hiding this comment

matteobettini Jan 16, 2024

Choose a reason for hiding this comment

vmoens Jan 16, 2024

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

pytorch-bot bot commented Jan 15, 2024 •

edited

Loading

matteobettini Jan 16, 2024 •

edited

Loading

matteobettini Jan 16, 2024 •

edited

Loading