Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sparse Reward Environments #5

Closed
bhairavmehta95 opened this issue Mar 29, 2018 · 3 comments
Closed

Sparse Reward Environments #5

bhairavmehta95 opened this issue Mar 29, 2018 · 3 comments

Comments

@bhairavmehta95
Copy link

Did you happen to see SAC's performance on sparse-reward environments?

I know the DIAYN paper trained on sparse rewards, but I was wondering if vanilla SAC (in your expts) had any luck solving things like Continuous MountainCar.

@haarnoja
Copy link
Owner

We haven't tried spare-reward environments with the vanilla SAC. My intuition is that it will not work any better than other RL algorithms with Gaussian/Boltzmann exploration because of lack of temporal correlation in the exploration noise.

@bhairavmehta95
Copy link
Author

Gotcha; that's what we seem to be seeing, but just wanted to make sure!

@ethanabrooks
Copy link

Could you clarify what you mean by temporal correlation in the exploration noise? Thanks.

hartikainen added a commit that referenced this issue Feb 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants