[4/11/2022[: testing and implementing various algorithms like PPO, IMPALA
[4/22/2022]: added basic reward shaping function to PPO which was used as a baseline
[5/20/2022]: added ARS reward shaping and curriculum learning
himadrir/grf-test-algorithms
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|