Skip to content

viv92/convoptfinal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Trust Region Policy Optimization

Description

This is the implementation code for Convex Optimization course project on "Reformulation and Analysis of Trust Region Policy Optimization" with its application on optimizing an industrial operation using a discrete event simulator. The final report can be found here (relative link)

Run command:

python DynaFork_Online_TRPO.py

Dependencies:

  • Python 2.7
  • Tensorflow 1.12.0

About

Convex Optimization course project on reformulation and analysis of Trust Region Policy Optimization (TRPO)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages