Skip to content

Using MS-Pacman visual input, training a reinforcement learning agent using proximal policy optimization algorithm

Notifications You must be signed in to change notification settings

jadhavhninad/Analysis-of-Proximal-Policy-Optimization-using-OpenAI-gym

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Analysis of Proximal Policy Optimization algorithm Using OpenAI Gym

Goals:

  • Comparing the algorithm performance with other baseline techniques for OpenAI game environment
  • Exploring performance based on input data preprocessing , using different Neural Network architectures & CPU vs GPU training
  • Modifying different hyperparameters to analyze their impact on the overall performance of the algorithm

Implementation:

  • The model is developed using TensorFlow and input data is collected from OpenAI GYM's MS-PACMAN environment.

  • Performance of different neural network architectures is explored:

"CNN vs LSTM - Reward function"

Output:

  • Different models based on the modified hyperparemeters, CPU training & GPU training.
  • Performance comparison(rewards & loss function) plots.

About

Using MS-Pacman visual input, training a reinforcement learning agent using proximal policy optimization algorithm

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages