[BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements #340

vmoens · 2022-08-05T09:41:24Z

Description

This PR introduces several changes aimed at making replay buffer sampling more efficient:

MemmapTensor.clone() now returns a regular tensor. This is a more intuitive behaviour than the previous one (where another MemmapTensor was returned). Also, to get another MemmapTensor, one can simply call MemmapTensor(memmap.clone()).
tensordict indexing now never returns a SubTensorDict. To get a SubTensorDict, one must built it explicitely by calling the appropriate dedicated method. This is to make sure that no overhead is introduced by working with this class.
Replay buffer storage is now cast to the appropriate device by the helper function: this makes data sampling faster and avoids the creation of tensors on cpu that are then cast on gpu (tensors are now created on gpu immediately).

vmoens · 2022-08-07T20:07:19Z

torchrl/trainers/helpers/losses.py

 )
 from torchrl.objectives.costs.common import LossModule
-from torchrl.objectives.costs.redq import REDQLoss
+from torchrl.objectives.costs.deprecated import REDQLoss_deprecated


REDQLoss is memory intensive when used with large models.
Some work is needed to make it less memory intensive.

init

4b2250e

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 5, 2022

vmoens added 16 commits August 5, 2022 11:50

amend

a8db936

amend

eb133e4

amend

757ad8a

amend

5c96b2d

amend

f49eb17

amend

c6d330f

amend

c913967

amend

56939ce

amend

7b3992d

amend

1a158fa

amend

539f70e

amend

bac68a8

amend

68dea20

amend

74cf214

Merge branch 'main' into clone_memmap

ff535ba

amend

e6ee239

vmoens added the bug Something isn't working label Aug 6, 2022

vmoens changed the title ~~[BugFix] Clone memmap tensors on regular tensors~~ [BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements Aug 6, 2022

vmoens added 10 commits August 6, 2022 13:02

amend

8444488

SubMemmapTensor (wip)

add79ff

amend

a6b4f51

Merge branch 'main' into clone_memmap

8ef1614

Memmap -> Tensor

002c3c8

Memmap -> Tensor

66e7dcc

Tensor -> list

59e453d

amend

51c7a77

amend

09b9924

amend

3d47bd6

vmoens added 23 commits August 7, 2022 09:44

no pin mem

493f604

no pin mem

e59cf49

no pin mem

819b4c5

no pin mem

3c488ce

no pin mem

ecd809d

no pin mem

a96d766

no pin mem

0c24d22

LazyMemmapStorage

06d536e

no backward

b43b2c3

no backward

55c8f11

no backward

4b714b7

empty cache

f34c337

REDQLoss_deprecated

0351f94

REDQLoss_deprecated

7ae81dc

REDQLoss_deprecated

29d2aa6

amend

c49a995

amend

55fd1eb

del

84d0bb1

erase cache

ff9b127

erase cache

952ae50

non_blocking

d734c71

amend

181982b

remove sub-memmap

75b9ad1

vmoens mentioned this pull request Aug 7, 2022

[Feature Request] Document get_sub_tensordict in the TensorDict tutorial #350

Open

vmoens commented Aug 7, 2022

View reviewed changes

vmoens added 3 commits August 7, 2022 21:08

cleanup

bf1a21c

amend

78e059c

amend

004ae89

vmoens merged commit c61ae7b into main Aug 8, 2022

vmoens deleted the clone_memmap branch August 8, 2022 06:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements #340

[BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements #340

Uh oh!

vmoens commented Aug 5, 2022 •

edited

Loading

Uh oh!

vmoens Aug 7, 2022

Uh oh!

Uh oh!

[BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements #340

[BugFix] Clone memmap tensors on regular tensors and other replay buffer improvements #340

Uh oh!

Conversation

vmoens commented Aug 5, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

vmoens Aug 7, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vmoens commented Aug 5, 2022 •

edited

Loading