[Hyper param tuning - early stop / green penalty] #68

xeviknal · 2021-04-18T18:33:24Z

Adding hyperparameter tuning with extra wrappers:

early stop - finish the environment when the avg reward of the latest N steps is negative (similar to a curriculum learning)
green-penalty - reduce the rewards when the car is in the grass (reward -0.15 instead of -0.1).

See the tuning trails in the following comments.

xeviknal · 2021-04-18T18:46:18Z

First experiment:

xeviknal · 2021-04-18T18:52:48Z

Second experiment:

xeviknal · 2021-04-18T19:33:43Z

Third experiment:

xeviknal added 8 commits April 15, 2021 00:06

Adding early stop wrapper

4da0c9f

Preparing tuning experiments

d33ce4c

Reward system change: -0.15 reward when car is mainly in the green side

ec37078

Add reporting to tune + recording video

0480872

Change experiment name

dfaadca

Fixup of the early-step wrapper

879efaa

Changing epsilon

8c0acd6

Giving more space to train

95b1f6e

Provide feedback