Solving infinite-horizon POMDPs with memoryless stochastic policies in state-action space

This repo decoments the code used for the experiments presented in the extended abstract Solving infinite-horizon POMDPs with memoryless stochastic policies in state-action space presented at RLDM 2022 (see https://arxiv.org/abs/2205.14098 and also https://arxiv.org/abs/2110.07409 for theoretical discussion of the geometry of the optimization problem). This includes an implementation of the presented method for reward optimization in state-action space (ROSA) as well as the two baselines used for comparison.

Overview over the content:

utilities.jl: Contains implementations of basic functions like the reward as well as implementations of the solution of the Bellmann constrained program (BCP) proposed by Amato et. al. (see http://people.csail.mit.edu/camato/publications/OptimalPOMDP-aimath05.pdf) as well as the reward optimization in state-action space (ROSA), both relying on the interior point solver IPOpt (see https://coin-or.github.io/Ipopt/). Further, it contains code for the generation of random solvable mazes (see also https://rosettacode.org/wiki/Maze_generation) as well as code for the automated generation of the transition and observation matrices of the model.
Code for the experiments: The code for the experiments can be found in the julia notebooks ROSA_discount_fixed.ipynb, ROSA_size_fixed.ipynb etc.
Mazes used for computations: The mazes used in the experiments, which where generated by the maze() function provided in utilities.jl can be found in the mazes folder. Here, mazesn.csv contains a list of the solved mazes with 2n^2-1 states.
Code used for plotting: Not provided as for now.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
mazes		mazes
BCP_Discount_Fixed.ipynb		BCP_Discount_Fixed.ipynb
BCP_Size_Fixed.ipynb		BCP_Size_Fixed.ipynb
DPO_Discount_Fixed.ipynb		DPO_Discount_Fixed.ipynb
DPO_Size_Fixed.ipynb		DPO_Size_Fixed.ipynb
README.md		README.md
ROSA_Discount_Fixed.ipynb		ROSA_Discount_Fixed.ipynb
ROSA_Size_Fixed.ipynb		ROSA_Size_Fixed.ipynb
utilities.jl		utilities.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving infinite-horizon POMDPs with memoryless stochastic policies in state-action space

About

Releases

Packages

Languages

muellerjohannes/POMDPs-ROSA

Folders and files

Latest commit

History

Repository files navigation

Solving infinite-horizon POMDPs with memoryless stochastic policies in state-action space

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages