A pole is attached by an un-actuated joint to a cart, which moves along a frictionless track. The system is controlled by applying a force of +1 or -1 to the cart. The pendulum starts upright, and the goal is to prevent it from falling over. A reward of +1 is provided for every timestep that the pole remains upright. The episode ends when the pole is more than 15 degrees from vertical, or the cart moves more than 2.4 units from the center.
-
Notifications
You must be signed in to change notification settings - Fork 0
A pole is attached by an un-actuated joint to a cart, which moves along a frictionless track. The system is controlled by applying a force of +1 or -1 to the cart. I solved it with the help of Deep_Q learning with Experience Replay and also with a variant of Reinforce Policy Gradient algorithm
Hojjat-Mokhtarabadi/CartPole
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A pole is attached by an un-actuated joint to a cart, which moves along a frictionless track. The system is controlled by applying a force of +1 or -1 to the cart. I solved it with the help of Deep_Q learning with Experience Replay and also with a variant of Reinforce Policy Gradient algorithm
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published