Merge Diffuser Agent #301

LiuTaowen-Tony · 2024-01-21T17:22:51Z

Description

Implemented the decision diffuser. Since there will be quite a lot of breaking change to train-eval-loop, I am making an initial pull request to discuss the change in API.
Decision Diffuser.pdf

The main contributions are:

implemented DecisionDiffuserActor that can seamlessly incorporate with other components
implemented training and testing loop in DecisionDiffuserAlgorithm
implemented conditional offline trajectories dataset for DecisionDiffuserAlgorithm training
implemented an environment for constraint composition demo
adapted the model code from the original repo (need refactor and fix mypy)

Motivation and Context

Initial task for Internship.

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide. (required)
My change requires a change to the documentation.
I have updated the tests accordingly. (required for a bug fix or a new feature)
I have updated the documentation accordingly.
I have reformatted the code using make format. (required)
I have checked the code using make lint. (required)
I have ensured make test pass. (required)

LiuTaowen-Tony · 2024-02-01T17:56:23Z

@Gaiejj

Hey Jiayi, I could you please kindly help me review the changes I made, and we can discuss design decision for dataset and evaluation integration?

Gaiejj · 2024-02-05T16:31:58Z

I will conduct a code review recently. Please check the items that failed in CI, namely Tests, and Lint. It would enhance the code quality. You can perform a self-check locally by running make lint and make test.

LiuTaowen-Tony · 2024-02-05T16:33:15Z

Sure. 2024年2月5日 16:32，Jiayi Zhou ***@***.***> 写道： I will conduct a code review recently. Please check the items that failed in CI, namely Tests, and Lint. It would enhance the code quality. You can perform a self-check locally by running make lint and make test. — Reply to this email directly, view it on GitHub<#301 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AJQOH3XOIIIYGTB6EEQR7V3YSECQXAVCNFSM6AAAAABCEEB6BWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMRXGM4TCNZXHA>. You are receiving this because you authored the thread.Message ID: ***@***.***>

…into diffuser

Gaiejj

I found a number of minor typos and I think we can make this PR better to correct them.

Gaiejj · 2024-03-04T04:53:07Z

examples/collect_offline_data.py

@@ -26,7 +26,7 @@
 env_name = 'SafetyAntVelocity-v1'
 size = 1_000_000
 agents = [
-    ('PATH_TO_AGENT', 'epoch-500.pt', 1_000_000),
+    ('train/PPOLag-{SafetyAntVelocity-v1}/seed-000-2024-01-07-21-14-30', 'epoch-500.pt', 1_000_000),


Should we change it to more general form like 'PATH_TO_AGENT', and make 'train/PPOLag-{SafetyAntVelocity-v1}/seed-000-2024-01-07-21-14-30' as an example in docs?

Gaiejj · 2024-03-04T04:53:36Z

examples/train_eval_diffuser.py

@@ -0,0 +1,110 @@
+# Copyright 2022-2024 OmniSafe Team. All Rights Reserved.


2024 OmniSafe Team, the same for others

Gaiejj · 2024-03-04T04:53:58Z

examples/train_eval_diffuser.py

+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+


Missing # ==============================================================================

Gaiejj · 2024-03-04T04:54:28Z

examples/train_eval_diffuser.py

+actor: DecisionDiffuserActor = agent._actor
+
+
+def cls_free_cond(actor: DecisionDiffuserActor) -> None:


The docstring should be google style.

Gaiejj · 2024-03-04T04:54:50Z

examples/train_eval_diffuser.py

+    """
+    condition on both state and cls free condition
+
+


Gaiejj · 2024-03-04T04:55:37Z

omnisafe/algorithms/offline/decision_diffuser.py

+    """Decision Diffuser algorithm.
+
+    References:
+        - Something.


What is "Something" ?

Gaiejj · 2024-03-04T04:56:44Z

omnisafe/algorithms/offline/decision_diffuser.py

+        +-------------------------+----------------------------------------------------+
+        | Things to log           | Description                                        |
+        +=========================+====================================================+
+        | Loss/Loss           | Loss of Diffusion and InvAR network               |


Misaligned table.

LiuTaowen-Tony · 2024-03-04T17:42:17Z

Any clue with the test errors? I felt like we have wrong wandb version or something

codecov · 2024-03-12T09:57:58Z

Codecov Report

Attention: Patch coverage is 29.06250% with 454 lines in your changes are missing coverage. Please review.

Project coverage is 91.20%. Comparing base (51a2692) to head (17ff03b).

❗ Current head 17ff03b differs from pull request most recent head d9ecd78. Consider uploading reports for the commit d9ecd78 to get more accurate results

Files	Patch %	Lines
omnisafe/models/diffuser/diffusion.py	16.57%	141 Missing ⚠️
omnisafe/models/diffuser/temporal_unet.py	21.09%	101 Missing ⚠️
omnisafe/common/offline/sequence_dataset.py	21.84%	68 Missing ⚠️
omnisafe/algorithms/offline/decision_diffuser.py	25.88%	63 Missing ⚠️
omnisafe/models/diffuser/helpers.py	42.86%	36 Missing ⚠️
omnisafe/envs/legacy_env.py	48.84%	22 Missing ⚠️
omnisafe/models/actor/decision_diffuser_actor.py	48.39%	16 Missing ⚠️
omnisafe/utils/ema.py	53.85%	6 Missing ⚠️
omnisafe/models/actor/actor_builder.py	66.67%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #301      +/-   ##
==========================================
- Coverage   96.89%   91.20%   -5.69%     
==========================================
  Files         138      148      +10     
  Lines        7000     7635     +635     
==========================================
+ Hits         6782     6963     +181     
- Misses        218      672     +454

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…into diffuser

LiuTaowen-Tony and others added 4 commits January 12, 2024 12:47

feat: add diffuser

f706b07

style: fix style for new added code

021a627

style: make mypy happy

b3460d7

Merge branch 'PKU-Alignment:main' into diffuser

354f87d

LiuTaowen-Tony marked this pull request as draft January 21, 2024 17:23

LiuTaowen-Tony mentioned this pull request Jan 21, 2024

Update DecisionDiffuser Algorithm to omnisafe #300

Closed

LiuTaowen-Tony added 8 commits February 17, 2024 06:04

fixing docstring and type annotations

de1d4f3

Merge branch 'diffuser' of https://github.com/LiuTaowen-Tony/omnisafe …

ef1bd48

…into diffuser

improve docstring for temporal unets

1ede594

improve docstring for temporal unets

de6924b

improve pass lint

86ab63f

add dependency einops

a223470

add dependency gym for custom env

7eba840

fix typo

e862a9a

Gaiejj reviewed Mar 4, 2024

View reviewed changes

fix: fix torch desperatation

6367dae

Gaiejj and others added 4 commits March 12, 2024 18:00

docs: fix typo

17ff03b

fix style

c940633

Merge branch 'diffuser' of https://github.com/LiuTaowen-Tony/omnisafe …

46ce5a6

…into diffuser

fix style

d9ecd78

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge Diffuser Agent #301

Merge Diffuser Agent #301

LiuTaowen-Tony commented Jan 21, 2024 •

edited

LiuTaowen-Tony commented Feb 1, 2024

Gaiejj commented Feb 5, 2024

LiuTaowen-Tony commented Feb 5, 2024 via email

Gaiejj left a comment

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

Gaiejj Mar 4, 2024

LiuTaowen-Tony commented Mar 4, 2024

codecov bot commented Mar 12, 2024 •

edited

		@@ -0,0 +1,110 @@
		# Copyright 2022-2024 OmniSafe Team. All Rights Reserved.

		actor: DecisionDiffuserActor = agent._actor


		def cls_free_cond(actor: DecisionDiffuserActor) -> None:

Merge Diffuser Agent #301

Are you sure you want to change the base?

Merge Diffuser Agent #301

Conversation

LiuTaowen-Tony commented Jan 21, 2024 • edited

Description

Motivation and Context

Types of changes

Checklist

LiuTaowen-Tony commented Feb 1, 2024

Gaiejj commented Feb 5, 2024

LiuTaowen-Tony commented Feb 5, 2024 via email

Gaiejj left a comment

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

Gaiejj Mar 4, 2024

Choose a reason for hiding this comment

LiuTaowen-Tony commented Mar 4, 2024

codecov bot commented Mar 12, 2024 • edited

Codecov Report

LiuTaowen-Tony commented Jan 21, 2024 •

edited

codecov bot commented Mar 12, 2024 •

edited