Skip to content

Is the implementation of the ipo algorithm in the project based on a single constraint? #258

Answered by Gaiejj
moodybluesf asked this question in Q&A
Discussion options

You must be logged in to vote

Thanks for your support to OmniSafe. Currently OmniSafe's IPO algorithm only supports a single constraint. If you need to use multiple constraints, you can try following steps:

  • Set the output size of the cost_critic hidden layer to the number of constraint types you need. (or use multiple single output cost_critic).
  • Modify omnisafe/adapter/on_policy_adapter, omnisafe/env/wrapper to receive multiple costs.
  • Add the corresponding storage key value to the buffer.
  • Compute the advantage function for multiple constraints.
  • Adapt IPO.py file to multiple cost version. You can define multiple penalty in _compute_adv_surrogate to match the multiple cost advantage function.

We only provide the above…

Replies: 3 comments 5 replies

Comment options

You must be logged in to vote
3 replies
@moodybluesf
Comment options

@7tosmoke
Comment options

@Gaiejj
Comment options

Answer selected by moodybluesf
Comment options

You must be logged in to vote
1 reply
@moodybluesf
Comment options

Comment options

You must be logged in to vote
1 reply
@moodybluesf
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants