O-LSPI

A version of optimistic least-squares policy iteration (LSPI) for the classic discrete-time linear quaratic regulation (LQR) problem published in paper:

Bo Pang, and Zhong-Ping Jiang. "Robust reinforcement learning: A case study in linear quadratic regulation." Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 35. No. 10. 2021.

O-LSPI.m

Implements the main O-LSPI algorithm.

func_data_collect.m

Implements the data collection step for the learning algorithm.

data_collect_Noise_Mag.m & Noise_Mag_exp.m

Collects the data for the experiment in the paper.

draw_picture.m

Draws the Fig. 1 in the paper.

kronv.m, vec2sm.m & sm2vec.m

Auxilliary functions for vector/matrix conversions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Noise_Mag_exp.m

Noise_Mag_exp.m

O-LSPI.m

O-LSPI.m

README.md

README.md

data_collect_Noise_Mag.m

data_collect_Noise_Mag.m

draw_picture.m

draw_picture.m

func_data_collect.m

func_data_collect.m

kronv.m

kronv.m

sm2vec.m

sm2vec.m

vec2sm.m

vec2sm.m

Repository files navigation

O-LSPI

O-LSPI.m

func_data_collect.m

data_collect_Noise_Mag.m & Noise_Mag_exp.m

draw_picture.m

kronv.m, vec2sm.m & sm2vec.m

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Noise_Mag_exp.m		Noise_Mag_exp.m
O-LSPI.m		O-LSPI.m
README.md		README.md
data_collect_Noise_Mag.m		data_collect_Noise_Mag.m
draw_picture.m		draw_picture.m
func_data_collect.m		func_data_collect.m
kronv.m		kronv.m
sm2vec.m		sm2vec.m
vec2sm.m		vec2sm.m

bo-pang/O-LSPI

Folders and files

Latest commit

History

Repository files navigation

O-LSPI

O-LSPI.m

func_data_collect.m

data_collect_Noise_Mag.m & Noise_Mag_exp.m

draw_picture.m

kronv.m, vec2sm.m & sm2vec.m

About

Resources

Stars

Watchers

Forks

Languages