-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLLib] MAML extension for all models except RNNs #11337
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very cool! Thanks for fixing some of these limitations.
Please change the import-order.
Is this ready to merge? |
Yes, I think so |
Looks like maml and mbmpo tests are failing |
Looks like higher should be an optional import...i'll fix it |
This should work now |
@michaelzhiluo I think what you need to do is move the import into the class itself instead of pulling it in at top-level. It seems like higher is importing torch automatically. |
@ericl Makes sense; the pytorch maml tests will fail so I'll go ahead and remove that test for now |
Hey, let's get this merged. Only test_dependency_torch is causing problems now. Everything else looks good. |
…to maml_higher
Co-authored-by: Sven Mika <sven@anyscale.io>
@ericl I think it should be good to go. There are no errors for imports! |
# tags = ["agents_dir"], | ||
# size = "medium", | ||
# srcs = ["agents/mbmpo/tests/test_mbmpo.py"] | ||
#) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't this test be put back now?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It turns out that MBMPO depends on Pytorch MAML, which has higher
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Question on the test removal
@ericl Added explanation why MBMPO tests are removed |
We now use a higher order API from Facebook for higher order gradients. This allows for extension for many different types of models for metaupdate.
Why are these changes needed?
Related issue number
Checks
scripts/format.sh
to lint the changes in this PR.