[Doc] Tensordictmodule tutorial #267

nicolas-dufour · 2022-07-11T16:46:36Z

Description

Here is a PR for the tensordictmodule tutorial

Motivation and Context

Tutorial to explain how to use TensorDictModule

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Implemented Tasks

Added initial notebook

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

…o tensordict_tutorial

Some corrections in tensordict tuto

…ict_tutorial Retrieving nested tensordict

vmoens

Let's make sure that the hierarchy is well defined (# => ## etc)
Let's make sure the lint is ok everywhere.
Let's use code format for things that are extracted from code: nn.Module and not nn.Module etc.
We need a section about functorch: make_functional_with_buffer, vmap.
Let's keep it all human-readable.

People in RL will be interested in Actor, ProbabilisticTensorModule, ProbabilisticActor, ActorCritic stuff.
You can pretty much copy-paste the docstring, but it's important they are in the tuto.

tutorials/tensordictmodule.ipynb

vmoens

I have not gone through the whole nb, many comments from my previous review have not been addressed. Can you check that they are?
Thanks for this work!

tutorials/tensordictmodule.ipynb

vmoens

I feel the transformer part is slightly too long. Once we have pointed what TensorDictSequence does, do we gain anything specific in showing how the full architecture is coded? I'm afraid people will miss the last part which is more RL-oriented because the transformer is too long.

Maybe we can put the second part of the transformer (encoder and decoder) somewhere else? Or create a separate tuto (ie separate nb) for transformers?

tutorials/tensordictmodule.ipynb

nicolas-dufour · 2022-07-13T17:14:43Z

I put the transformer part since it's a good way to prove that you can do any kind of model with TensorDictModule. Maybe what we can do is inverse the RL part and the transformers. That way, the people interested in RL will see the RL part before the transformers part

…ictmodule_tutorial Merging with main

vmoens

have a look at these comments

tutorials/tensordictmodule.ipynb

vmoens · 2022-07-15T10:17:27Z

tutorials/tensordictmodule.ipynb

+    "tags": []
+   },
+   "source": [
+    "### `ProbabilisticTensorDictModule`"


To add here:

When you print the tensordict, put a few words in the print to say what it is (otherwise the print is not very readable)

Add something about interaction mode: how do you switch from "random" to "mode"? What is the difference? What sampling mode exist?

Add something about get_dist

How do you return the parameters of the dist in the tensordict? And the log_prob? Give examples for each

What distributions are available in torchrl?

This is ultra important for RL people! You must spend time on these classes

Is this not getting a bit far from the tensordictmodule initial tutorial? Won't this need it's own tutorial on its own to dig deeper on RL modules?

vmoens · 2022-07-15T10:18:24Z

tutorials/tensordictmodule.ipynb

+   "id": "dbd48bb2-b93b-4766-b7a7-19d500f17e2d",
+   "metadata": {},
+   "source": [
+    "### `ActorCriticOperator`"


Show the various ActorCritic and ActorValue operators, show what they do, what the differences are.
Also, ValueOperator is missing.

vmoens · 2022-07-17T17:58:33Z

We have added a tutorials/README.md, can you update it with this nb?

vmoens

See my comments

tutorials/README.md

tutorials/tensordictmodule.ipynb

vmoens · 2022-07-21T16:06:36Z

tutorials/tensordictmodule.ipynb

+   "id": "664adff3-1466-47c3-9a80-a0f26171addd",
+   "metadata": {},
+   "source": [
+    "We can see on this minimal example that the overhead introduced by `TensorDictModule` is minimal."


introduced by TensorDictModule is minimal.

to

introduced by TensorDictModule is marginal.

Also, I would remove the init time (where TensorDictModule clearly underperforms), it's irrelevant (of course it takes more time, but it's not an operation we will repeat often)

After this we're good to go

vmoens

LGTM thanks!

nicolas-dufour and others added 12 commits July 7, 2022 18:03

Added TensorDict tutorial

228b158

Fixed english mistakes and small refactoring

1f03199

init

f4039fa

init

1337768

Retrieved bug fixes

f4db1c2

Merge remote-tracking branch 'origin_fb/bugfix_in_keys_exclusion' int…

ab49165

…o tensordict_tutorial

Added suggered changes and cleaned up

04fe61c

Added transformer figure

0399f49

init

97bb800

Merge pull request #1 from vmoens/pr-255

dd726bf

Some corrections in tensordict tuto

Merge branch 'main' of github.com:nicolas-dufour/torchrl into tensord…

9ecff5e

…ict_tutorial Retrieving nested tensordict

TensorDictModule initial commit

4fb2571

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 11, 2022

vmoens added the documentation Improvements or additions to documentation label Jul 12, 2022

vmoens reviewed Jul 12, 2022

View reviewed changes

nicolas-dufour added 3 commits July 13, 2022 10:19

Details fixed

38e8beb

Made suggered modifications

f3a12be

Made suggered modifications

f7d8622

vmoens reviewed Jul 13, 2022

View reviewed changes

Made changes

7010b54

vmoens reviewed Jul 13, 2022

View reviewed changes

nicolas-dufour added 4 commits July 14, 2022 11:38

Suggested changes and do and dont

7836c02

Formating

7dd7efb

Formating

81be639

Merge branch 'main' of github.com:nicolas-dufour/torchrl into tensord…

8300358

…ictmodule_tutorial Merging with main

vmoens reviewed Jul 15, 2022

View reviewed changes

Did some changes

78a40b2

Made suggested changes

49edf5d

nicolas-dufour added 3 commits July 19, 2022 10:59

Merge branch 'main' into tensordictmodule_tutorial

0ee7a42

Added tensordictmodule tutorial to README.MD

84de974

Clean rerun

9999c2b

vmoens reviewed Jul 19, 2022

View reviewed changes

nicolas-dufour added 2 commits July 21, 2022 15:47

Added benchmark

a5acde6

Warning clean-up

52172ba

vmoens reviewed Jul 21, 2022

View reviewed changes

Made suggested changes

45ccd8b

vmoens approved these changes Jul 22, 2022

View reviewed changes

vmoens merged commit f07015d into pytorch:main Jul 22, 2022

nicolas-dufour deleted the tensordictmodule_tutorial branch September 16, 2022 16:36

[Doc] Tensordictmodule tutorial #267

[Doc] Tensordictmodule tutorial #267

Uh oh!

Conversation

nicolas-dufour commented Jul 11, 2022 • edited by vmoens Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Types of changes

Implemented Tasks

Checklist

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicolas-dufour commented Jul 13, 2022

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vmoens Jul 15, 2022

Choose a reason for hiding this comment

Uh oh!

nicolas-dufour Jul 15, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Jul 15, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens commented Jul 17, 2022

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicolas-dufour commented Jul 11, 2022 •

edited by vmoens

Loading