Sparse Reward Environments #5

bhairavmehta95 · 2018-03-29T22:02:43Z

Did you happen to see SAC's performance on sparse-reward environments?

I know the DIAYN paper trained on sparse rewards, but I was wondering if vanilla SAC (in your expts) had any luck solving things like Continuous MountainCar.

haarnoja · 2018-03-30T23:23:45Z

We haven't tried spare-reward environments with the vanilla SAC. My intuition is that it will not work any better than other RL algorithms with Gaussian/Boltzmann exploration because of lack of temporal correlation in the exploration noise.

bhairavmehta95 · 2018-03-31T15:25:20Z

Gotcha; that's what we seem to be seeing, but just wanted to make sure!

ethanabrooks · 2018-05-06T22:08:44Z

Could you clarify what you mean by temporal correlation in the exploration noise? Thanks.

Random goal swimmer

bhairavmehta95 closed this as completed Mar 31, 2018

hartikainen added a commit that referenced this issue Feb 24, 2019

Merge pull request #5 from haarnoja/random-goal-swimmer

83f1508

Random goal swimmer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse Reward Environments #5

Sparse Reward Environments #5

bhairavmehta95 commented Mar 29, 2018

haarnoja commented Mar 30, 2018

bhairavmehta95 commented Mar 31, 2018

ethanabrooks commented May 6, 2018

Sparse Reward Environments #5

Sparse Reward Environments #5

Comments

bhairavmehta95 commented Mar 29, 2018

haarnoja commented Mar 30, 2018

bhairavmehta95 commented Mar 31, 2018

ethanabrooks commented May 6, 2018