0.5.0 #76

StoneT2000 · 2023-04-01T21:10:41Z

Draft PR for upgrading ManiSkill2 to gymnasium as well making various improvements as discussed with @Jiayuan-Gu

Changes:

upgrade API to gymnasium
improve reward functions by scaling to [0, 1] and verifying they are working (@xuanlinli17 )
Removing goal visuals
Updating environment state representations
update stable baselines notebook tutorials to gymnasium versions
update stable baselines scripts to gymnasium versions
Add CleanRL style baselines
Semi-automated pytests on all environments (@StoneT2000 )
Move downloadable files to Google Storage and avoid using Google Drive.

Breaking Changes

env.render now accepts no arguments. The old render functions are separated out as other functions and env.render calls them and chooses which one based on the env.render_mode attribute (set usually upon env creation).
env.step returns observation, reward, terminated, truncated, info. See https://gymnasium.farama.org/content/migration-guide/#environment-step for details. For ManiSkill2, the old done signal is now called terminated and truncated is False. All environments by default have a 200 max episode steps so truncated=True after 200 steps.
env.reset returns a tuple observation, info. For ManiSkill2, info is always an empty dictionary. Moreover, env.reset accepts two new keyword arguments: seed: int, options: dict | None. Note that options is usually used to configure various random settings/numbers of an environment. Previously ManiSkill2 used to use custom keyword arguments such as reconfigure. These keyword arguments are still usable but must be passed through an options dict e.g. env.reset(options=dict(reconfigure=True)).
env.seed has now been removed in favor of using env.reset(seed=val) per the Gymnasium API.
ManiSkill VectorEnv is now also modified to adhere to the Gymnasium Vector Env API. Note this means that vec_env.observation_space and vec_env.action_space are batched under the new API, and the individual environment spaces are defined as vec_env.single_observation_space and vec_env.single_action_space
All reward functions have been changed to be scaled to the range of [0, 1], generally making any value-learning kind of approach more stable and avoiding gradient explosions. On any environment a reward of 1 indicates success as well and is also indicated by the boolean stored in info["success"]. The scaled dense rewards are the new default reward function and is called normalized_dense. To use the old <0.5.0 ManiSkill2 dense rewards, set reward_mode to dense.

New Additions

Code

Environment code come with separated render functions representing the old render modes. There is now env.render_human for creating a interactive GUI and viewer, env.render_rgb_array for generating RGB images of the current env from a 3rd person perspective, and env.render_cameras which renders all the cameras (including rgb, depth, segmentation if available) and compacts them into one rgb image that is returned. Note that human and rgb_array are used only for visualization purposes. They may include artifacts like indicators of where the goal is for visualization purposes, see PickCube-v0 or PandaAvoidObstacles-v0 for examples. cameras mode is reflective of what the actual visual observations are returned by calls to env.reset and env.step.
The ManiSkill2 VecEnv creator function make_vec_env now accepts a max_episode_steps argument which overrides the default max_episode_steps specified when registering the environment. The default max_episode_steps is 200 for all environments, but note it may be more efficient for RL training and evaluation to use a smaller value as shown in the RL tutorials.

Tutorials

All tutorials have been updated to reflect new gym API, new stable baselines 3, and should be more stable on google colab

Not Code

New CONTRIBUTING.md document has been added, with details on how to locally develop on ManiSkill2 and test it

Bug Fixes

Closes Change of an asset size after vectorization #124 with using the newest version of Sapien, 2.2.2.
Closes ValueError: Output array has wrong dimensionality #119 via fix #119 issue #123 where scalar values returned by the state part of a dictionary would cause errors.
Fixes a compatability bug with Gymnasium AsyncVectorEnv where Gymnasium also could not handle scalar values as it expects shape (1, ), not shape (). This is done by modifying environments to instead of returning floats for certain scalar observation values to return numpy array versions of them. So far only affected TurnFaucet-v0. Partially closes TurnFaucet-v0 and other envs with mesh sample based rewards are not deterministic. #125 where TurnFaucet-v0 had non-deterministic rewards due to computing rewards based on unseeded sampled points from various meshes.

Miscellaneous Changes

Dockerfile now accepts a python version as an argument
README and documentation updated to reflect new gym API
mani_skill2.examples.demo_vec_env module now accepts a --vecenv-type argument which can be either ms2 or gym and defaults to ms2. Lets users benchmark the speed difference themselves. Module was further cleaned to print more nicely
Various example scripts that have main functions now accept an args argument and allow for using those scripts from within python and not just the CLI. Used for testing purposes.
Fix some lack of quietness on some example scripts
Replaying trajectories accepts a new --count argument that lets you specify how many trajectories to replay. There is no data shuffling so the replayed trajectories will always be the same and in the same order. By default this is None meaning all trajectories are replayed.

…apper for it

- workaround for `env.render` - remove `make_box_space_readable` - fix `VecRobotSegmentationObservationWrapper`

…python floats for gymnasium

…, so reward function is deterministic as well.

…-vis is true, fix goal visual to now show up for rgb_array and human

…odes

…_env

Jiayuan-Gu

Please also format the directory by black (but please take a look over which files are formatted, since pyproject.toml might be updated for filtering which files to be formatted.

requirements.txt

mani_skill2/__init__.py

Jiayuan-Gu · 2023-08-14T08:24:08Z

examples/tutorials/imitation-learning/bc_liftcube_rgbd.py

-    depth2 = rgbd[..., 7:8] / (2**10)
+    depth1 = rgbd[..., 3:4]
+    depth2 = rgbd[..., 7:8]
+    if not scale_rgb_only:


You can remove this if-statement if scale_rgb_only is always True, or add a comment to explain why depth is normalized by being divided by (2 ** 10).

mani_skill2/utils/wrappers/observation.py

mani_skill2/utils/registration.py

mani_skill2/vector/wrappers/sb3.py

tests/manual_test_venv.py

…lls to reset used same seed.

Jiayuan-Gu

LGTM

StoneT2000 added 2 commits April 1, 2023 06:24

init

426f416

init

cf5eabd

StoneT2000 marked this pull request as draft April 1, 2023 21:10

StoneT2000 mentioned this pull request May 2, 2023

Google Colab failed "No module named 'mani_skill2'" #85

Closed

StoneT2000 added 10 commits May 2, 2023 15:54

update reset funcs

3eb5544

render api

e282e6a

Update record.py

1be4100

Merge branch 'main' into 0.5.0

9744567

update vec env to gymnasium vec env implementation and updated sb3 wr…

fe2b4e2

…apper for it

formatting

f530cdf

use single observation space per new API

f0fad3b

tune rgbd baseline

31ccc5e

fix some bugs with envs closing incorrectly

da48037

bug with terminated signal

cae58ee

StoneT2000 mentioned this pull request May 4, 2023

ManiSkill2 - Fast Visual RL robotics cleanrl baselines vwxyzjn/cleanrl#366

Open

13 tasks

StoneT2000 and others added 7 commits May 18, 2023 16:50

Update record.py

d380b85

Fix for gymnasium migration

ae7b4a5

- workaround for `env.render` - remove `make_box_space_readable` - fix `VecRobotSegmentationObservationWrapper`

python local matrix test with docker

ea65c85

Update run.sh

9114f09

Update run.sh

17413fd

Modify PandaAvoidObstacles reward to clip the collision penalty

8c128f3

add normalized dense reward mode

1ca14ef

StoneT2000 mentioned this pull request Jul 18, 2023

RL Colab tutorial doesn't work #114

Closed

StoneT2000 added 7 commits July 19, 2023 00:41

fix original venv tests

2e95788

Merge branch '0.5.0' of github.com:haosulab/ManiSkill2 into 0.5.0

62db20c

fix robot_seg obs wrappers

975fd24

fix docs using old reset API

01de7ab

more tests and init contributing doc and pytest-xdist

2e41065

Update run.sh

4335d0f

coverage tests

74dda69

xuanlinli17 and others added 16 commits August 1, 2023 00:18

adapt replay_trajectory.py with the new env.reset(seed, options) api

35f2cf4

fix PegInsertionSide reset function

f92212b

fix environment reset: always pop out options specific to an environment

0db3547

clean up errors when env fails on start and add some types

87a3fc6

use fixed model scales and ensure obs use np array floats instead of …

13dec4b

…python floats for gymnasium

Fix TurnFaucet-v0 mesh sampling to be deterministic with episode seed…

d20335d

…, so reward function is deterministic as well.

add demo and asset download tests, and example code tests

009f724

add render_cameras option, fix replay_trajectory to show viewer iff -…

a92302d

…-vis is true, fix goal visual to now show up for rgb_array and human

basic replay trajectory tests and allow to replay up to a max of epis…

30f0c08

…odes

fix goal visuals for PandaAvoidObstacles-v0

d73fba3

fix goal visuals for PickClutterYCB-v0

4e7c46c

explicitly specify the max_episode_steps keyword argument in make_vec…

c86986f

…_env

fix bug where max_episode_steps wasnt processed in make_vec_env

85f8c25

versioned demo download from HF

37e5885

fix missing arg

5e57e15

Fix bug where main rng was not set on reset(seed=...) calls

1a8f280

Jiayuan-Gu self-requested a review August 14, 2023 08:13

Jiayuan-Gu self-assigned this Aug 14, 2023

Jiayuan-Gu reviewed Aug 14, 2023

View reviewed changes

Jiayuan-Gu marked this pull request as ready for review August 14, 2023 09:00

StoneT2000 added 5 commits August 15, 2023 16:29

remove old test and minor fix to sb3 vector wrapper where repeated ca…

9249470

…lls to reset used same seed.

enable back env check

9d6957d

improve comment and fix test name to be different from file name.

7a744c7

black formatting.

ab9b770

isort

b36e52d

Jiayuan-Gu approved these changes Aug 16, 2023

View reviewed changes

Update registration.py

d0c8dcf

StoneT2000 merged commit 0a5e5b8 into main Aug 16, 2023

StoneT2000 deleted the 0.5.0 branch August 23, 2023 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.5.0 #76

0.5.0 #76

StoneT2000 commented Apr 1, 2023 •

edited

Loading

Jiayuan-Gu left a comment

Jiayuan-Gu Aug 14, 2023

Jiayuan-Gu left a comment

0.5.0 #76

0.5.0 #76

Conversation

StoneT2000 commented Apr 1, 2023 • edited Loading

Breaking Changes

New Additions

Code

Tutorials

Not Code

Bug Fixes

Miscellaneous Changes

Jiayuan-Gu left a comment

Choose a reason for hiding this comment

Jiayuan-Gu Aug 14, 2023

Choose a reason for hiding this comment

Jiayuan-Gu left a comment

Choose a reason for hiding this comment

StoneT2000 commented Apr 1, 2023 •

edited

Loading