[Feature] Use ObservationNorm.init_stats for stats computation in example scripts #715

romainjln · 2022-11-25T16:35:49Z

Description

Refactor examples to use ObservationNorm.init_stats instead of get_stats_random_rollout

Motivation and Context

close #699

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

vmoens

I think that here, if we spawn multiple processes, each env on each process will run its own init_env_steps and hence they will all have a different set of summary stats.

We should compute the stats in the main process like we did before (but using the method you provided), then pass these stats to each env.

…tionNorm transforms. Add checks in init_stats to ensure proper initialization

vmoens

LGTM
Do you think things would be simpler with state dict?
We could
(1) create a dummy env, compute stats
(2) get the state dict of this dummy env
(3) load the state dict on every env created subsequently

LMK what you think

vmoens · 2022-12-01T21:04:43Z

torchrl/trainers/helpers/envs.py

+        raise AttributeError("init_env_steps missing from arguments.")
+
+    if (
+        type(proof_environment.transform) != Compose


Why not isinstance?
Why equality and not is not?

vmoens · 2022-12-01T21:04:55Z

torchrl/trainers/helpers/envs.py

+
+    if (
+        type(proof_environment.transform) != Compose
+        and type(proof_environment.transform) != ObservationNorm


vmoens · 2022-12-01T21:06:01Z

torchrl/trainers/helpers/envs.py

+            )
+
+    obs_norm_transforms = []
+    if type(proof_environment.transform) == Compose:


vmoens · 2022-12-01T21:07:38Z

torchrl/trainers/helpers/envs.py

+        obs_norm_transforms.append((0, proof_environment.transform))
+
+    stats = []
+    for (idx, transform) in obs_norm_transforms:


Upon reflection we could simply take the state dict of the transforms and load it no?
Wouldn't that be simpler than this?
If loc and scale are buffers it could simplify things a bit

codecov · 2022-12-06T20:05:05Z

Codecov Report

Merging #715 (60ad3c4) into main (a677fb1) will decrease coverage by 0.04%.
The diff coverage is 98.74%.

@@            Coverage Diff             @@
##             main     #715      +/-   ##
==========================================
- Coverage   88.71%   88.66%   -0.05%     
==========================================
  Files         120      120              
  Lines       20240    20386     +146     
==========================================
+ Hits        17955    18075     +120     
- Misses       2285     2311      +26

Flag	Coverage Δ
habitat-gpu	`25.03% <0.00%> (-0.03%)`	⬇️
linux-cpu	`85.75% <97.94%> (+0.09%)`	⬆️
linux-gpu	`86.69% <97.94%> (+0.08%)`	⬆️
linux-jumanji	`30.24% <0.00%> (-0.03%)`	⬇️
linux-outdeps-gpu	`72.21% <83.57%> (+0.10%)`	⬆️
linux-stable-cpu	`85.60% <97.94%> (+0.09%)`	⬆️
linux-stable-gpu	`86.32% <97.94%> (+0.08%)`	⬆️
linux_examples-gpu	`43.10% <63.93%> (+0.01%)`	⬆️
macos-cpu	`85.42% <97.94%> (+0.09%)`	⬆️
olddeps-gpu	`76.26% <99.28%> (+0.16%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
torchrl/trainers/helpers/models.py	`93.38% <ø> (ø)`
torchrl/envs/transforms/transforms.py	`87.19% <87.50%> (+<0.01%)`	⬆️
examples/dreamer/dreamer.py	`87.80% <92.30%> (+0.14%)`	⬆️
examples/dreamer/dreamer_utils.py	`78.53% <100.00%> (+0.37%)`	⬆️
test/test_helpers.py	`93.10% <100.00%> (+1.03%)`	⬆️
test/test_transforms.py	`96.28% <100.00%> (+0.06%)`	⬆️
torchrl/trainers/helpers/envs.py	`67.26% <100.00%> (-7.48%)`	⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

vmoens · 2022-12-07T12:16:19Z

torchrl/trainers/helpers/envs.py

+
+
+def retrieve_observation_norms_state_dict(proof_environment: TransformedEnv):
+    """Traverse the transforms of the environment and retrieve the ObservationNorm state dicts.


Maybe Traverses + retrieves?

Also for code objects we can use :obj:ObservationNorm for a cleaner rendering in the doc

vmoens · 2022-12-07T12:17:57Z

torchrl/trainers/helpers/envs.py

+    num_iter: int = 1000,
+    key: Union[str, Tuple[str, ...]] = None,
+):
+    """Calling init_stats on all uninitialised ObservationNorms transform of a TransformedEnv.


Maybe

Calls :obj:`ObservationNorm.init_stats` on all uninitialized :obj:`ObservationNorm` instances of a :obj:`TransformedEnv`.

vmoens · 2022-12-07T12:21:07Z

torchrl/trainers/helpers/envs.py

+):
+    """Calling init_stats on all uninitialised ObservationNorms transform of a TransformedEnv.
+
+    If an ObservationNorm already has non-null loc or stats, it will be skipped.


If an :obj:`ObservationNorm.init_stats` already has non-null loc or stats, a call to :obj:`initialize_observation_norm_transforms` will be a no-op. Similarly, if the transformed environment does not contain any ObservationNorm, a call to this function will have no effect.

…-stats-models

vmoens

LGTM!

romainjln added 3 commits November 24, 2022 11:27

Refactor stats computation in example scripts

df26500

Merge branch 'main' into init-stats-models and resolve conflict

15b9bbb

lint

a8fcee9

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 25, 2022

vmoens reviewed Nov 25, 2022

View reviewed changes

romainjln added 15 commits November 28, 2022 11:46

Change logic for computing stats using ObservationNorm in examples

215e57b

Lint and fix import

6ba6ea7

Merge branch 'main' into init-stats-models

48ea215

Removing redundant lines

09d093b

Fix typo

7351654

Adding test and fixing code

88780fb

lint

93e611a

Refactor generate_stats_from_observation_norms to handle many Observa…

e7be8f8

…tionNorm transforms. Add checks in init_stats to ensure proper initialization

Merge branch 'main' into init-stats-models

d702bfd

Fixing tests

85ace78

Add decorator to test. Fix dreamer example

fc0c7e5

Adding missing parameter in dreamer script

106f31c

lint

5b697e0

Modify dreamer_utils to match new behavior of make_env_transforms

160abda

Fixing stats issue in dreamer

7ff3007

vmoens reviewed Dec 1, 2022

View reviewed changes

romainjln added 4 commits December 6, 2022 11:41

Refactoring example logic to use state_dict of ObservationNorm transform

46935ea

Merge branch 'main' into init-stats-models

6f28b66

Fix new logic in dreamer_utils

2a5dbf0

Fix dreamer helper function following previous refactoring

17b1b15

romainjln force-pushed the init-stats-models branch from acf16b8 to 17b1b15 Compare December 6, 2022 19:08

vmoens reviewed Dec 7, 2022

View reviewed changes

romainjln and others added 8 commits December 7, 2022 15:17

Adding tests. Modify docstrings based on feedback

4908082

More modifications of docstrings

12a878a

Add more tests for helpers functions

15273e5

Merge branch 'pytorch:main' into init-stats-models

90e5374

Adding test for transformed_env_constructor

0b2b915

Merge branch 'pytorch:main' into init-stats-models

6a247bd

Adding more test for init_stats. Remove redundant line of code

e83922a

Merge branch 'init-stats-models' of github.com:romainjln/rl into init…

60ad3c4

…-stats-models

vmoens added the enhancement New feature or request label Dec 8, 2022

vmoens approved these changes Dec 8, 2022

View reviewed changes

vmoens merged commit ce350cc into pytorch:main Dec 8, 2022

vmoens mentioned this pull request Dec 8, 2022

[Feature Request] Remove duplicated code from /examples #730

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Use ObservationNorm.init_stats for stats computation in example scripts #715

[Feature] Use ObservationNorm.init_stats for stats computation in example scripts #715

Uh oh!

romainjln commented Nov 25, 2022 •

edited

Loading

Uh oh!

vmoens left a comment

Uh oh!

vmoens left a comment

Uh oh!

vmoens Dec 1, 2022

Uh oh!

vmoens Dec 1, 2022

Uh oh!

vmoens Dec 1, 2022

Uh oh!

vmoens Dec 1, 2022

Uh oh!

codecov bot commented Dec 6, 2022 •

edited

Loading

Uh oh!

vmoens Dec 7, 2022

Uh oh!

vmoens Dec 7, 2022 •

edited

Loading

Uh oh!

vmoens Dec 7, 2022

Uh oh!

vmoens left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		def retrieve_observation_norms_state_dict(proof_environment: TransformedEnv):
		"""Traverse the transforms of the environment and retrieve the ObservationNorm state dicts.

[Feature] Use ObservationNorm.init_stats for stats computation in example scripts #715

[Feature] Use ObservationNorm.init_stats for stats computation in example scripts #715

Uh oh!

Conversation

romainjln commented Nov 25, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Types of changes

Checklist

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 1, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 1, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 1, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 1, 2022

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Dec 6, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vmoens Dec 7, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vmoens Dec 7, 2022

Choose a reason for hiding this comment

Uh oh!

vmoens left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

romainjln commented Nov 25, 2022 •

edited

Loading

codecov bot commented Dec 6, 2022 •

edited

Loading

vmoens Dec 7, 2022 •

edited

Loading