PG4LQR

This is an implementation of model-free policy gradient algorithm for LQR

The structure of this project is shown as follows

PG4LQR
|--dynamics.py: implementation of LQR system
|--lqr.py: implementation of model-free algorithm plus the adam optimizer (optional)
|--parallel_lqr.py: the multi-process version (the algorithm is the same as lqr.py)
|--plot.py: plot figures according to saved results

Usage

python lqr.py --action_dim xx --state_dim xx --lr xx --epoch xx --r xx --natural

action_dim: dimension of inputs
state_dim: dimensional of states
lr: step size, also known as the learning rate
epoch: total number of training iterations
r: the smoothing parameter
natural: adding this argument means using natural policy gradient (gradient descent as default)

Currently, the single-process version is more efficient. You can just ignore parallel version

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
README.md		README.md
dynamics.py		dynamics.py
lqr.py		lqr.py
paralell_lqr.py		paralell_lqr.py
plot.py		plot.py
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PG4LQR

Usage

About

Releases

Packages

Languages

xiangyu-liu/PG4LQR

Folders and files

Latest commit

History

Repository files navigation

PG4LQR

Usage

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages