Integration of the skrl RL library by Toni-SM · Pull Request #6 · isaac-sim/IsaacLab

Toni-SM · 2023-01-22T15:11:36Z

Description

Integration of the skrl reinforcement learning library

Adds a wrapper function to wrap an IsaacEnv for skrl compatibility
Adds a trainer class that also logs episode information
Adds train.py and play.py in workflows to run skrl
Adds configuration files for currently included environments with parameters matching that of rl_games as close as possible

Dependencies:

skrl >= 0.10.0: https://github.com/Toni-SM/skrl

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

Checklist

I have run the pre-commit checks with pre-commit run --all-files (see here instructions to set it up)
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file

…-integration

Mayankm96 · 2023-01-22T21:29:27Z

Thanks a lot for the pull request! I'll take a look at it tomorrow.

Can you please also update the changelog and the version in the config/extension.toml file of omni.isaac.orbit_envs?

https://isaac-orbit.github.io/orbit/source/refs/contributing.html#maintaining-a-changelog

Toni-SM · 2023-01-22T22:02:14Z

Done, CHANGELOG and extension.toml files updated!!!

Mayankm96 · 2023-01-25T01:08:43Z

Hi!

I finally got time today to test this out. I am yet to try training on the classic environments but would be nice to do some evaluations while having the same configurations for PPO across the frameworks.

Right now, mostly tried training for the reach environment. I have the following comments.

Issues

Currently, while training the reach environment, I am seeing a divergence in the loss curve of the value function. Is it possible the parameters are not tuned? I suppose there might be tiny differences in algorithmic implementations but don't think the value function should be diverging. One possible explanation could be that the learning iterations are too large and that's causing some kind of collapse.
The learned policy for reach seems to be not that stable (I tried the best_agent.pt) . It is quite shaky which I don't see when I train with rsl_rl. Maybe again the parameters are not similar or the same and that is the potential problem.

skrl.mp4
Currently, the play.py is using the runner class to run the agent-env interaction.
- When I hit the STOP icon on the GUI, it just freezes and doesn't exit the script quickly. Is there a possible issue in the trainer that doesn't let it exit?
- When I hit the X icon on the GUI, I observe a similar behavior.

Nice to haves

Is it possible to also log the episode information in the runner? Many times in robotics, we care about how each reward term is evolving (for weight tuning). The cumulative reward is often hard to decipher when having many reward terms
Can the play.py just load the last or best checkpoint by default? We do something similar for rsl-rl (code snippet).
An updated example on how to use a learned policy outside the runner loop. It should usually be simple since that is a torch model that needs to be loaded. But it would be great if we can have an example to show this. It comes quite handy when deploying a policy in a complete robot system (in this case, an extension).

Additional comments

We will also need to add the license of skrl inside the docs/licenses/dependencies directory for book-keeping purposes.

Toni-SM · 2023-01-25T17:26:43Z

Hi @Mayankm96

Comments:

Loss curve divergence of the value function should not be a problem. rl_games also diverge... I checked the rsl_rl PPO implementation and there is a small difference in the way KL and learning rate scheduler are computed, also some training parameters are different
I mapped (as far as possible) the same parameters as .yaml files for rl_games. The stability of the policy comes mostly from the number of time steps used compared to rsl_rl.
- skrl: 8000,
- rl_games: 16 * 500 = 8000
- rsl_rl: 64 * 250 = 16000
rls_rl works just as "badly" as the others at 8000 timesteps.
Training for 16000 timesteps generates a better curves
Solved

Nice to haves

Log the episode information:
Done
Load the last or best checkpoint by default:
Done
An updated example of how to use a learned policy outside the runner loop:
It sounds great. I am going to create an example for futures pull requests

Additional comments

skrl license in docs:
Done

Mayankm96

Thanks a lot for providing more information. I just added some minor feedback on the code to understand things a bit better. Once that is cleared up, we can merge this awesome PR :)

Toni-SM · 2023-01-25T22:19:02Z

Done...

Summary:

Create a customized trainer in the file where the skrl wrapper is defined for:
- log episode information during training
- simplify train.py and play.py
Remove logging in play.py
Move the import statements to the beginning of the config.py file

Mayankm96

The PR looks good to me now. Thanks a lot for working on skrl integration. It is great to have more libraries supported in the workflows.

Would love to try out other agents as well in skrl at some point and provide examples :)

Toni-SM · 2023-01-26T08:16:29Z

Great!
Let's work on integrating new agents in future pull requests!

By the way... The online documentation does not reflect any changes 🤔

Mayankm96 · 2023-01-26T08:39:39Z

It should be up now! :)

By the way, I made some minor refactoring in this commit f7e4183 :

renaming of SkrlLogTrainer to SkrlSequentialLogTrainer
readding of the eval mode to the class for completeness
some docstring fixes
setting the agent to "eval" mode in the play.py script

Toni-SM · 2023-01-26T08:48:36Z

Nice... It looks much better now

# Description This PR introduces an actuator model that takes care of the Serial to Parallel conversion for the dynaarm, (Part of the dynaarm integration isaac-sim#6) ## Type of change - New feature (non-breaking change which adds functionality) ## Screenshots ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there --------- Co-authored-by: Pascal Roth <57946385+pascal-roth@users.noreply.github.com> Co-authored-by: Mayank Mittal <mittalma@leggedrobotics.com>

# Description Adds the dynaarm asset with and without covers and the correct collision shapes. It depends on the Parallel Actuator model from isaac-sim#7 To view an example: ```python ./isaaclab.sh -p source/standalone/demos/dynaarms.py ``` ![image](https://github.com/user-attachments/assets/607e16fe-088d-41ae-a2ef-3c5f251095c3) (This PR is part of isaac-sim#6) ## Type of change - New feature (non-breaking change which adds functionality) ## Screenshots ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there --------- Co-authored-by: Pascal <roth.pascal@outlook.de>

# Description ```bash ./isaaclab.sh -p source/standalone/environments/random_agent.py --task Isaac-Sysid-Dynaarm-v0 --num_envs 2 ./isaaclab.sh -p source/standalone/environments/random_agent.py --task Isaac-Sysid-Dynaarm-Covers-v0 --num_envs 2 ``` This PR adds the sysid environments for the dynaarm. Sysid environments are environments which only contain the standalone assets and directly observe joint positions and velocities. This PR depends on isaac-sim#7 and isaac-sim#8, which need to be merged first. Related Issue: isaac-sim#6 ## Type of change  - New feature (non-breaking change which adds functionality) ## Screenshots Please attach before and after screenshots of the change if applicable. ## Checklist - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there  --------- Co-authored-by: Pascal Roth <57946385+pascal-roth@users.noreply.github.com> Co-authored-by: Pascal <roth.pascal@outlook.de>

Add RAII-style context managers for safe raw Fabric access: - fabric_write(): calls PrepareForReuse on entry, update_world_xforms + sync on exit. Provides world_matrices fabricarray and view_to_fabric mapping for custom warp kernel launches. - fabric_read(): calls PrepareForReuse on entry (ensures valid pointers after topology changes), no-op on exit. Also exposes read-only properties: - world_matrices: the raw fabricarray of omni:fabric:worldMatrix - view_to_fabric_mapping: the view-index to fabric-index mapping This addresses Piotr's Issue isaac-sim#6 (reader/writer pattern) by providing a structured way to bracket Fabric operations that ensures PrepareForReuse and hierarchy updates are never forgotten. Tests added: - test_fabric_write_context_manager: validates write + readback - test_fabric_read_context_manager: validates read without side effects Depends on: fix/fabric-prepare-for-reuse (PR isaac-sim#5380)

Toni-SM added 15 commits January 19, 2023 17:04

Add environment wrapper for the skrl RL library

51a8694

Add training workflow for the skrl RL library

f3464af

Add training configuration .yaml files for the skrl RL library

49400fa

Add evaluation workflow for the skrl RL library

afb6444

Run pre-commit for all files

86852ec

Add skrl to RL extra dependencies

ed14985

Merge branch 'NVIDIA-Omniverse:main' into skrl-integration

55f2402

Improve skrl wrapper dosctring

f72ff95

Merge branch 'NVIDIA-Omniverse:main' into skrl-integration

2f522e9

Add skrl RL library integration to docs

7b7d256

Merge branch 'skrl-integration' of github.com:Toni-SM/Orbit into skrl…

3142f22

…-integration

Add skrl RL library reference to Python code

27c554c

Update the skrl RL library example steps in docs

916e599

Allow creating shared models in skrl RL library workflow

110aa07

Update skrl RL library task hyperparameters

311870a

Mayankm96 added the enhancement New feature or request label Jan 22, 2023

Increase MINOR version and update CHANGELOG

7b28ef8

Toni-SM added 3 commits January 25, 2023 09:57

Merge branch 'NVIDIA-Omniverse:main' into skrl-integration

3fceaad

Add skrl LICENSE to docs dependencies

5a2bb3d

Run evaluation steps without trainer

7a39851

Mayankm96 self-requested a review January 25, 2023 10:36

Toni-SM added 6 commits January 25, 2023 11:47

Configure training/evaluation mode

4f497a2

Log custom environment data to Tensorboard

55cb5e1

Load last checkpoint during evaluation if not specified

2ae2e96

Merge branch 'NVIDIA-Omniverse:main' into skrl-integration

3ecba46

Update training hyperparameters

e50fc53

Update CHANGELOG date

ffbdb1e

Mayankm96 requested changes Jan 25, 2023

View reviewed changes

Toni-SM added 4 commits January 25, 2023 22:11

Move import statements to the beginning of the script

75d8c25

Simplify the training and evaluation scripts for the skrl RL library

bc7d140

Add customized trainer for logging episode information

64f0c97

Delete unused parameters from configuration files

c46dab9

Mayankm96 approved these changes Jan 26, 2023

View reviewed changes

Mayankm96 changed the title ~~Skrl integration~~ Integration of the skrl RL library Jan 26, 2023

Mayankm96 merged commit e9862d4 into isaac-sim:main Jan 26, 2023

Toni-SM deleted the skrl-integration branch August 18, 2023 08:06

isaaclab-review-bot Bot mentioned this pull request Apr 10, 2026

Adds surface deformable support and migrate deformable API to isaaclab_physx #5049

Merged

7 tasks

pv-nvidia mentioned this pull request Apr 23, 2026

feat: add fabric_read/fabric_write context managers to FabricFrameView #5382

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration of the skrl RL library#6

Integration of the skrl RL library#6
Mayankm96 merged 29 commits intoisaac-sim:mainfrom
Toni-SM:skrl-integration

Toni-SM commented Jan 22, 2023 •

edited by Mayankm96

Loading

Uh oh!

Mayankm96 commented Jan 22, 2023 •

edited

Loading

Uh oh!

Toni-SM commented Jan 22, 2023

Uh oh!

Mayankm96 commented Jan 25, 2023 •

edited

Loading

Uh oh!

Toni-SM commented Jan 25, 2023

Uh oh!

Mayankm96 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Toni-SM commented Jan 25, 2023 •

edited by Mayankm96

Loading

Uh oh!

Mayankm96 left a comment

Uh oh!

Toni-SM commented Jan 26, 2023

Uh oh!

Mayankm96 commented Jan 26, 2023

Uh oh!

Toni-SM commented Jan 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Toni-SM commented Jan 22, 2023 • edited by Mayankm96 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist

Uh oh!

Mayankm96 commented Jan 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Toni-SM commented Jan 22, 2023

Uh oh!

Mayankm96 commented Jan 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issues

Nice to haves

Additional comments

Uh oh!

Toni-SM commented Jan 25, 2023

Uh oh!

Mayankm96 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Toni-SM commented Jan 25, 2023 • edited by Mayankm96 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mayankm96 left a comment

Choose a reason for hiding this comment

Uh oh!

Toni-SM commented Jan 26, 2023

Uh oh!

Mayankm96 commented Jan 26, 2023

Uh oh!

Toni-SM commented Jan 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Toni-SM commented Jan 22, 2023 •

edited by Mayankm96

Loading

Mayankm96 commented Jan 22, 2023 •

edited

Loading

Mayankm96 commented Jan 25, 2023 •

edited

Loading

Toni-SM commented Jan 25, 2023 •

edited by Mayankm96

Loading