Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

pytorch / rl Public

Notifications You must be signed in to change notification settings
Fork 308
Star 2.3k

Code
Issues 157
Pull requests 70
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[Feature] AbsorbingStateTransform #2290

Draft

BY571 wants to merge 1 commit into pytorch:main

base: main

Choose a base branch

Loading

Loading

from BY571:absorbing_state

Draft

[Feature] AbsorbingStateTransform #2290

BY571 wants to merge 1 commit into pytorch:main from BY571:absorbing_state

Conversation 13 Commits 1 Checks 55 Files changed

Conversation

Copy link

Contributor

BY571 commented Jul 12, 2024

Description

Adds AbsorbingStateTransform as used in the DAC paper.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Remove all that do not apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)
Example (update in the folder of examples)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide (required)
My change requires a change to the documentation.
I have updated the tests accordingly (required for a bug fix or a new feature).
I have updated the documentation accordingly.

Sorry, something went wrong.

All reactions


          init

1d43d8b

Copy link

pytorch-bot bot commented Jul 12, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2290

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures, 1 Unrelated Failure

As of commit 1d43d8b with merge base 8e43ac8 ():

NEW FAILURES - The following jobs have failed:

Continuous Benchmark (PR) / CPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Continuous Benchmark (PR) / GPU Pytest benchmark (gh)
Workflow failed! Resource not accessible by integration
Habitat Tests on Linux / tests (3.9, 12.1) / linux-job (gh)
RuntimeError: Command docker exec -t b2d2055310791b7d668c6c783b9255861b3b75ca347af5ebdd2edfc5ba71c728 /exec failed with exit code 139
Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job (gh)
AttributeError: module 'torch' has no attribute 'compiler'
Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job (gh)
AttributeError: module 'torch' has no attribute 'compiler'

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Unit-tests on Windows / unittests-cpu / windows-job (gh) (trunk failure)
test/test_transforms.py::TestActionDiscretizer::test_trans_parallel_env_check[False]

This comment was automatically generated by Dr. CI and updates every 15 minutes.

All reactions

Sorry, something went wrong.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label

Copy link

Contributor Author

BY571 commented Jul 12, 2024

Will update the docstring with examples.
@vmoens might need your help on the tests.

All reactions

Sorry, something went wrong.

vmoens approved these changes

View reviewed changes

Copy link

Contributor

vmoens left a comment

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for this!
Not entirely sure about the implementation, I think it'll break in many (edge) cases.
Can you open an issue asking for the feature to talk about the proper way of handling this?

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                      >>> from torchrl.envs import GymEnv
+                      >>> t = AbsorbingStateTransform(max_episode_length=1000)
+                      >>> base_env = GymEnv("HalfCheetah-v4")
+                      >>> env = TransformedEnv(base_env, t)

Copy link

Contributor

vmoens Jul 12, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's not very informative about the functionality ;)

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                      terminate_key: Optional[NestedKey] = "terminated",
+                  ):
+                      if in_keys is None:
+                          in_keys = "observation"  # default

Copy link

Contributor

vmoens Jul 12, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

["observation"] no?

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                          batch_size = observation.size(0)
+                          if self._done:
+                              # Create absorbing states for the batched observations
+                              absorbing_state = torch.eye(observation.size(1) + 1)[-1]

Copy link

Contributor

vmoens Jul 12, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is rather wasteful, we creating a big tensor and indexing it, plus this is a view on a storage hence the original storage isn't cleared when you index.

Besides it lacks dtype and device.

You can create an incomplete eye with m and n, see the doc here

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                              # Create absorbing states for the batched observations
+                              absorbing_state = torch.eye(observation.size(1) + 1)[-1]
+                              return absorbing_state.expand(batch_size, -1)
+                          zeros = torch.zeros(batch_size, 1)

Copy link

Contributor

vmoens Jul 12, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing device and dtype

You could use observation.new_zeros

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

@@ @@ -8557,3 +8557,157 @@ def _inv_call(self, tensordict): @@
                       if self.sampling == self.SamplingStrategy.RANDOM:
                           action = action + self.jitters * torch.rand_like(self.jitters)
                       return tensordict.set(self.in_keys_inv[0], action)
+              class AbsorbingStateTransform(ObservationTransform):

Copy link

Contributor

vmoens Jul 22, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need tests for this class
It should be registered in the __init__.py and put in the doc.

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                  def forward(self, tensordict: TensorDictBase) -> TensorDictBase:
+                      raise RuntimeError(FORWARD_NOT_IMPLEMENTED.format(type(self)))
+                  def _apply_transform(self, observation: torch.Tensor) -> torch.Tensor:

Copy link

Contributor

vmoens Jul 22, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

there is a version of this that works for all batch sizes. This one will only work with uni of bidimensional batch sizes.

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                      elif observation.dim() == 2:
+                          # Batched observations
+                          batch_size = observation.size(0)
+                          if self._done:

Copy link

Contributor

vmoens Jul 22, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we need an in-place value? How does that work if one sub-env is done and the other not?
Maybe we could read the done state and change it on the fly, without using local attribute

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                              )
+                              return tensordict
+                      done = tensordict.get(self.done_key)
+                      self._done = done.any()

Copy link

Contributor

vmoens Jul 22, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this means that if any sub-env is done all are done?

Sorry, something went wrong.

All reactions

torchrl/envs/transforms/transforms.py

+                          # Single observation
+                          if self._done:
+                              # Return absorbing state which is [0, ..., 0, 1]
+                              return torch.eye(observation.size(0) + 1)[-1]

Copy link

Contributor

vmoens Jul 22, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what if the observation is more than 1d?

Sorry, something went wrong.

All reactions

vmoens requested changes

View reviewed changes

Copy link

Contributor

vmoens left a comment

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry wrongfully approved

Sorry, something went wrong.

All reactions

vmoens added enhancement

New feature or request

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

vmoens vmoens requested changes

Assignees

No one assigned

Labels

This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

New feature or request

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

3 participants

Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.