Reinforcement learning for Control

Stochastic Policy Search

Policy network

In the same way as we used data-driven optimization to tune the gains $K_P,K_I,K_D$ in the PID controllers, we can use the same approach to tune (or train) the parameters (or weights) $\boldsymbol{\theta}$ of a neural network.

In the first part a relatively simple neural network controller in PyTorch is built.

Reinforce algorithm

In order to implement and apply the Reinforce algorithm, the following steps are performed:

Create a policy network that uses transfer learning
Create an auxiliary function that selects control actions out of the distribution
Create an auxilary function that runs multiple episodes per epoch
Finally, put all the pieces together into a function that computes the Reinforce algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
figure		figure
README.md		README.md
cstr_model.py		cstr_model.py
plotting.py		plotting.py
reinforcement.py		reinforcement.py
stochastic_search.py		stochastic_search.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement learning for Control

Stochastic Policy Search

Policy network

Reinforce algorithm

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Reinforcement learning for Control

Stochastic Policy Search

Policy network

Reinforce algorithm

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages