Make Continuous Mountain Car Environment same as Gym implementation #1394

sshkhr · 2018-05-15T11:22:37Z

This PR make the contiunous Mountain Car Environment streamlined with the implementation in OpenAI Gym (https://github.com/openai/gym/blob/master/gym/envs/classic_control/continuous_mountain_car.py) . The changes are :

Added a goal position
The terminal state is reached when car crosses goal position (earlier it was set as when car reaches maximum allowed position)
The reward for reaching the terminal state is 100 - cost of action (earlier it was just 100)
The next state calculation is more configurable via introducing the power member variable (This was requested by @zoq in Added Continuous Mountain Car to Reinforcement Learning Environment #1368 but not addressed)

I have also added my name to list of contributors following #1388

zoq · 2018-05-16T08:34:25Z

src/mlpack/methods/reinforcement_learning/environment/continuous_mountain_car.hpp

@@ -96,13 +96,17 @@ class ContinuousMountainCar
   * @param velocityMax Maximum legal velocity.
   */
  ContinuousMountainCar(const double positionMin = -1.2,
-                        const double positionMax = 0.5,
+                        const double positionMax = 0.6,


Can you add a comment for each new parameter? Also I think you did some good additions to this file, so feel free to add yourself as another author.

zoq · 2018-05-16T08:38:05Z

src/mlpack/methods/reinforcement_learning/environment/continuous_mountain_car.hpp

-      return 100.0;
-    return -pow(action.action[0], 2)*0.1;
+      reward = 100.0;
+    reward -= std::pow(action.action[0], 2) * 0.1;


Don't think this makes a huge difference, but I guess it's a good idea to get the same results as the gym env.

rcurtin

No comments from my side; thanks for the contribution! :)

zoq · 2018-05-17T11:24:32Z

@mlpack-jenkins test this please

zoq · 2018-05-20T01:57:28Z

Thanks again for the contribution 👍

sshkhr added 3 commits May 15, 2018 16:06

Added name to contributors

315839f

Fix Continuous Mountain Car environment

2d5c8e9

Fixed style issue

ea69b9a

zoq reviewed May 16, 2018

View reviewed changes

sshkhr added 2 commits May 16, 2018 16:23

Added description for parameters

46863e9

Added name to authors

02c6899

rcurtin approved these changes May 16, 2018

View reviewed changes

zoq merged commit 095f784 into mlpack:master May 20, 2018

sshkhr deleted the rl2 branch May 20, 2018 08:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Continuous Mountain Car Environment same as Gym implementation #1394

Make Continuous Mountain Car Environment same as Gym implementation #1394

sshkhr commented May 15, 2018

zoq May 16, 2018

zoq May 16, 2018

rcurtin left a comment

zoq commented May 17, 2018

zoq commented May 20, 2018

Make Continuous Mountain Car Environment same as Gym implementation #1394

Make Continuous Mountain Car Environment same as Gym implementation #1394

Conversation

sshkhr commented May 15, 2018

zoq May 16, 2018

Choose a reason for hiding this comment

zoq May 16, 2018

Choose a reason for hiding this comment

rcurtin left a comment

Choose a reason for hiding this comment

zoq commented May 17, 2018

zoq commented May 20, 2018