refactor: change wrapper setting #73

Gaiejj · 2023-01-11T04:00:04Z

Description

refactor: change wrapper setting

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.
You can use the syntax close #15213 if this solves the issue #15213

I have raised an issue to propose this change (required for new features and bug fixes)

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide. (required)
My change requires a change to the documentation.
I have updated the tests accordingly. (required for a bug fix or a new feature)
I have updated the documentation accordingly.
I have reformatted the code using make format. (required)
I have checked the code using make lint. (required)
I have ensured make test pass. (required)

zmsn-2077 · 2023-01-11T06:05:51Z

omnisafe/algorithms/off_policy/ddpg.py

@@ -13,15 +13,15 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the DDPG algorithm."""
-


This blank line is required.

omnisafe/algorithms/off_policy/ddpg.py

zmsn-2077 · 2023-01-11T06:20:47Z

omnisafe/algorithms/off_policy/sac.py

@@ -13,6 +13,7 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the SAC algorithm."""


blank line.

zmsn-2077 · 2023-01-11T06:21:47Z

omnisafe/algorithms/off_policy/sac_lag.py

@@ -13,25 +13,27 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the Lagrange version of the SAC algorithm."""


blank line.

zmsn-2077 · 2023-01-11T06:24:01Z

omnisafe/algorithms/off_policy/sddpg.py

@@ -13,6 +13,7 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the SDDPG algorithm."""
+from typing import Dict, NamedTuple, Tuple


blank line.

zmsn-2077 · 2023-01-11T06:24:43Z

omnisafe/algorithms/off_policy/td3.py

@@ -13,6 +13,7 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the TD3 algorithm."""


blank line.

zmsn-2077 · 2023-01-11T06:25:17Z

omnisafe/algorithms/off_policy/td3_lag.py

@@ -13,23 +13,27 @@
 # limitations under the License.
 # ==============================================================================
 """Implementation of the Lagrange version of the TD3 algorithm."""


blank line.

zmsn-2077 · 2023-01-11T06:27:17Z

omnisafe/algorithms/on_policy/early_terminated/ppo_early_terminated.py

@@ -28,6 +30,11 @@ class PPOEarlyTerminated(PPO):
        URL: https://arxiv.org/abs/2107.04200


why is this line different from Reference's comment in off-policy algorithms?

zmsn-2077 · 2023-01-11T06:32:36Z

omnisafe/algorithms/on_policy/first_order/focops.py

-        """Update."""
-        raw_data, data = self.buf.pre_process_data()
-        # First update Lagrange multiplier parameter
+    # pylint: disable=too-many-locals


should we use disable？

zmsn-2077 · 2023-01-11T06:34:31Z

omnisafe/algorithms/on_policy/naive_lagrange/ppo_lag.py

 from omnisafe.common.lagrange import Lagrange


 @registry.register
-class PPOLag(PolicyGradient, Lagrange):
+class PPOLag(PPO, Lagrange):
    """The Lagrange version of the PPO algorithm.



Do I need to cite the safety gym paper here?

zmsn-2077 · 2023-01-11T06:35:58Z

omnisafe/algorithms/on_policy/second_order/cpo.py

-        URL: https://arxiv.org/abs/1705.10528
+        - Title: Constrained Policy Optimization
+        - Authors: Joshua Achiam, David Held, Aviv Tamar, Pieter Abbeel.
+        - URL: https://arxiv.org/abs/1705.10528


zmsn-2077 · 2023-01-11T06:36:39Z