[Algorithm] Online Decision transformer #1149

BY571 · 2023-05-12T08:21:38Z

Description

Implements the Online Decision Transformer Paper

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
[X ] Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
[ X] My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

…to decision_transformer

# Conflicts: # .circleci/unittest/linux_examples/scripts/environment.yml # test/test_modules.py # torchrl/modules/__init__.py

torchrl/modules/models/decision_transformer.py

BY571 · 2023-08-23T08:45:50Z

torchrl/modules/models/decision_transformer.py

+)
+
+
+class ModifiedGPT2Model(GPT2Model):


Wrapper class to remove wpe layer of the GPT2Model from transformers. Maybe we can compress this even more?

This should run even if transformers isn't installed.

Do we have dedicated tests?

Is it integrated in the doc?

The docstring is a bit cryptic for someone who doesn't know what it is all about.
I wish transformers had a more modular code... What is the signature of wpe? In some cases we can simply replace the layer by nn.Identity()...

I tried to use the identity but got some shape issues. But I found out that with all the fixes I did it now even converges with the wpe layer. For comparison, I also ran a test where I exchanged the wpe layer with a custom ZeroPosEmbeddingLayer returning only zeros. In the graph, you can see with wpe and with zero wpe.
Let me know what you think. For now, I took the ZeroPosEmbeddingLayer off as it does converge but I can add it as well.

Oh but at this point let's get rid of that class altogether no?

Yes, I removed it all already. If you can have a final look I think it should be ready now.

vmoens

LGTM! A couple of last edits and we can ship this!! 🚀💪🏻

examples/decision_transformer/lamb.py

examples/decision_transformer/utils.py

torchrl/modules/models/decision_transformer.py

vmoens · 2023-08-26T19:24:08Z

torchrl/modules/models/decision_transformer.py

+)
+
+
+class ModifiedGPT2Model(GPT2Model):


This should run even if transformers isn't installed.

Do we have dedicated tests?

Is it integrated in the doc?

The docstring is a bit cryptic for someone who doesn't know what it is all about.
I wish transformers had a more modular code... What is the signature of wpe? In some cases we can simply replace the layer by nn.Identity()...

torchrl/modules/models/decision_transformer.py

torchrl/modules/models/models.py

…to decision_transformer

vmoens · 2023-08-30T07:58:24Z

A million thanks for this feature @BY571!
Amazing stuff

Co-authored-by: vmoens <vincentmoens@gmail.com> Co-authored-by: Mateusz Guzek <matguzek@meta.com>

BY571 added 21 commits March 31, 2023 09:38

set struc

7d004a0

architecture test

520b8fb

Merge branch 'main' into decision_transformer

4859735

Merge branch 'main' into decision_transformer

d19a3e0

update dt transforms

18d3035

update padding

d521fa2

take off outputhead

c123fe0

update target and testscript

cfcc073

Merge branch 'main' into decision_transformer

2c314c5

add r2g

e377ae8

update context mask

8b69d6a

Merge branch 'main' into decision_transformer

72dc7c8

add offline example script first tests

7b9d029

Merge branch 'main' into decision_transformer

0672e1c

Update objective loss

e2fb927

Merge branch 'main' into decision_transformer

2c657cc

updates

69b0974

add objective

a5e5da7

fix

34fc6e8

small fixes

0200e29

Merge branch 'main' into decision_transformer

001413c

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 12, 2023

BY571 added 8 commits May 12, 2023 12:19

update DT loss docstring

9470797

update dt inference wrapper docstring with example

6b8185d

add odt cost tests

76e3a27

Merge branch 'main' into decision_transformer

247cfd6

try to add inverse catframes

082a75e

as_inverse add to catframes

2b636a6

make dt / odt split

b1788f5

add dt odt script

c6e3229

vmoens and others added 21 commits July 11, 2023 09:39

amend

0b8d564

Merge remote-tracking branch 'origin/main' into decision_transformer

3988ebf

fix

b08d3d4

Merge remote-tracking branch 'origin/main' into decision_transformer

4e57244

fix reward scale, reduce target return config

a522db0

Merge branch 'decision_transformer' of https://github.com/BY571/rl in…

17a86d7

…to decision_transformer

amend

aefbf61

Merge branch 'decision_transformer' of https://github.com/BY571/rl in…

9afb0a7

…to decision_transformer

amend

1c7cbbf

zero padding, fix obs loc, std for normalization

11d8779

Merge branch 'decision_transformer' of https://github.com/BY571/rl in…

094808a

…to decision_transformer

Merge branch 'main' into decision_transformer

b383339

# Conflicts: # .circleci/unittest/linux_examples/scripts/environment.yml # test/test_modules.py # torchrl/modules/__init__.py

temp - SerialEnv

3ff2fc6

merge main into branch

c43a02d

fix obs norm, fix action context

9135fa7

update buffer transforms to not use catframes

c3a67c8

test dist, small fixes

ca505eb

update utils

b260785

update and fixes

a820015

Merge branch 'main' into decision_transformer

a29c3b4

pull changes

6220c04

BY571 commented Aug 23, 2023

View reviewed changes

torchrl/modules/models/decision_transformer.py Outdated Show resolved Hide resolved

BY571 commented Aug 23, 2023

View reviewed changes

vmoens approved these changes Aug 26, 2023

View reviewed changes

vmoens and others added 3 commits August 26, 2023 15:28

running examples

17093b7

update header, docs and delete dtwrapper

a717c8e

Merge branch 'decision_transformer' of https://github.com/BY571/rl in…

3846a21

…to decision_transformer

vmoens merged commit b444007 into pytorch:main Aug 30, 2023
39 of 54 checks passed

vmoens added a commit to hyerra/rl that referenced this pull request Oct 10, 2023

[Algorithm] Online Decision transformer (pytorch#1149)

d0f8bcb

Co-authored-by: vmoens <vincentmoens@gmail.com> Co-authored-by: Mateusz Guzek <matguzek@meta.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Algorithm] Online Decision transformer #1149

[Algorithm] Online Decision transformer #1149

BY571 commented May 12, 2023 •

edited

Loading

BY571 Aug 23, 2023

vmoens Aug 26, 2023

BY571 Aug 28, 2023

vmoens Aug 29, 2023

BY571 Aug 29, 2023

vmoens left a comment

vmoens Aug 26, 2023

vmoens commented Aug 30, 2023

[Algorithm] Online Decision transformer #1149

[Algorithm] Online Decision transformer #1149

Conversation

BY571 commented May 12, 2023 • edited Loading

Description

Motivation and Context

Types of changes

Checklist

BY571 Aug 23, 2023

Choose a reason for hiding this comment

vmoens Aug 26, 2023

Choose a reason for hiding this comment

BY571 Aug 28, 2023

Choose a reason for hiding this comment

vmoens Aug 29, 2023

Choose a reason for hiding this comment

BY571 Aug 29, 2023

Choose a reason for hiding this comment

vmoens left a comment

Choose a reason for hiding this comment

vmoens Aug 26, 2023

Choose a reason for hiding this comment

vmoens commented Aug 30, 2023

BY571 commented May 12, 2023 •

edited

Loading