Skip to content

buzzpranav/cartpole-ppo-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

cartpole-ppo-ai

Description


The Jupyter Notebook will train and evaluate an agent in CartPole-v0 (OpenAI Gym) environment via Proximal Policy Optimization (PPO) algorithm.

A reward of +1 is provided for every step taken, and a reward of 0 is provided at the termination step. The state space has 4 dimensions and contains the cart position, velocity, pole angle and pole velocity at tip. Given this information, the agent has to learn how to select best actions. Two discrete actions are available, corresponding to:

0 - 'Push cart to the left'
1 - 'Push cart to the right'
For more details about the cartpole environment, see https://github.com/openai/gym/wiki/CartPole-v0

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published