New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Main difference from DrQ? #1
Comments
RAD and DrQ are concurrent (published 2 days apart). Main difference: In addition to data aug, DrQ modifies underlying SAC algo by weighing Q functions (both Q and target Q). RAD does not modify the underlying algo at all, it achieves same results only with data aug and can plug and play with any RL algo (we also show that it works with PPO with SOTA test-time generalization on ProcGen). RAD also extensively ablates a variety of data augs and provides insight as to why random crop works well. |
@denisyarats When comparing DrQ with RAD, do you convert the 'step' in RAD eval.log to 'environment step'? |
Thanks for sharing the code!
I wonder what's the main algorithmic difference between DrQ-SAC and RAD-SAC? You only mentioned DrQ in passing in the paper, but didn't elaborate. Thanks!
The text was updated successfully, but these errors were encountered: