Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.Sign up
- Name: Pendulum-v0
- Category: Classic Control
- Leaderboard Page
- Old links:
Try to keep a frictionless pendulum standing up.
The precise equation for reward:
-(theta^2 + 0.1*theta_dt^2 + 0.001*action^2)
Theta is normalized between -pi and pi. Therefore, the lowest cost is
-(pi^2 + 0.1*8^2 + 0.001*2^2) = -16.2736044, and the highest cost is
0. In essence, the goal is to remain at zero angle (vertical), with the least rotational velocity, and the least effort.
Random angle from -pi to pi, and random velocity between -1 and 1
There is no specified termination. Adding a maximum number of steps might be a good idea.
None yet specified