Can't get results for LunarLander #4

TrentBrick · 2020-07-13T23:38:16Z

Hi thanks for sharing your code and implementation.

However, when running your notebook with the LunarLander-v2, even with 1,000 epochs it doesn't seem to be learning anything:

Can you share the hyperparameters necessary to reproduce the LunarLander?

Thank you.

BY571 · 2020-07-14T19:35:28Z

hello @TrentBrick !
Sure, here the hyperparameter:
"horizon_scale" : 0.01,
"return_scale" : 0.025,
"replay_size" : 500,
"n_warm_up_episodes" : 10,
"n_updates_per_iter" : 100,
"n_episodes_per_iter" : 20,
"last_few" : 75,
"batch_size" : 768,
"layer_size" : 128,
"learning_rate" : 1e-3

TrentBrick · 2020-07-17T00:03:58Z

Thanks for getting back to me and sharing these parameters. I just tried them but they also dont seem to be working for me.

BY571 · 2020-07-17T08:36:50Z

oh, I just noticed that there is an older network architecture. try to add two linear layers, it should work then.
Thanks for noticing, ill update the code!

TrentBrick · 2020-08-05T14:07:27Z

Just heads up I've moved on to using other repos for this. Spent too long trying to run yours. You should double check that everything now works though for future people that come across this repo.

tangzk · 2022-11-22T06:21:53Z

oh, I just noticed that there is an older network architecture. try to add two linear layers, it should work then. Thanks for noticing, ill update the code!

@BY571 Have you updated the latest code? I got results similar to TrentBrick in LunarLander-v2 env.

TrentBrick · 2022-12-15T19:08:50Z

@tangzk if you want to use my repo for this... https://github.com/TrentBrick/RewardConditionedUDRL

TrentBrick closed this as completed Aug 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't get results for LunarLander #4

Can't get results for LunarLander #4

TrentBrick commented Jul 13, 2020

BY571 commented Jul 14, 2020

TrentBrick commented Jul 17, 2020

BY571 commented Jul 17, 2020

TrentBrick commented Aug 5, 2020

tangzk commented Nov 22, 2022

TrentBrick commented Dec 15, 2022

Can't get results for LunarLander #4

Can't get results for LunarLander #4

Comments

TrentBrick commented Jul 13, 2020

BY571 commented Jul 14, 2020

TrentBrick commented Jul 17, 2020

BY571 commented Jul 17, 2020

TrentBrick commented Aug 5, 2020

tangzk commented Nov 22, 2022

TrentBrick commented Dec 15, 2022