POMDPLearn

A library to learn MDP and POMDP models using a single dataset

More information on what are POMDPs and MDPs follow the links.

Installation & Setup

Pull repo
Preferably use linux os.
Install octave from ubuntuSoftware
Intall oct2py python module using pip

pip install oct2py

BNT library uses mex (source wikipedia:A MEX file is a type of computer file that provides an interface between MATLAB or Octave and functions written in C, C++ or Fortran. It stands for "MATLAB executable".)
On bash select files to mex certain .c files
run following commands.

sudo apt install liboctave-dev
cd ~/POMDPLearn/BNT/BNT/potentials/Tables/
mkoctfile -mex marg_table.c
mkoctfile -mex mult_by_table.c
mkoctfile -mex divide_by_table.c

Dependencies

import sys
from tqdm import tqdm
from scipy.optimize import linprog
import numpy as np
from itertools import product
from tqdm import tqdm_notebook
import pandas as pd
import numpy as np

Usage

Learning an MDP or POMDP model

Steps:

Data preprocessing

The state of the dataset to be used by the POMDPLearn library must satisfy the following criteria
- States,action, and observations must be separate colummns with the keyword "state_", "action_", "obs_" followed by the number of the epoch in the horizon.

Dataset and Model definition

Dataset

The preprocessed dataset will be input to the MDP or POMDP dataset object
Using this dataset object we define an MDP or POMDP object

MDP or POMDP model definition

By calling the MDP class and passing the pandas dataframe of our MDP dataset we automatically instantiate an MDP dataset,similarly for the POMDP class.

mdpModel = MDP(df=dfMDP_dataset)

Training and solving the MDP or POMDP model

MDP

Using the trainMDP() method of the MDP class we train the MDP model
Using the MDPsolve() method we solve the MDP model using value iteration

POMDP

Using the trainPOMDP() method of the MDP class we train the MDP model
Using the POMDPsolve() method we solve the MDP model using value iteration

Policy execution

MDP

The policy obtained using the MDPsolve() method can be executed for any initial state using the policyExecution() method

POMDP

Define an initial beleif model to generate initial beliefs using baseline features
Using the getRecActions() method we get the policy that we should follow over the horizon.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
BNT		BNT
Datasets		Datasets
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
MDP model of multiple agents.ipynb		MDP model of multiple agents.ipynb
POMDP model of disease.ipynb		POMDP model of disease.ipynb
POMDPLearn.py		POMDPLearn.py
README.md		README.md
dbnMDP_BNT.m		dbnMDP_BNT.m
dbnPOMDP_BNT.m		dbnPOMDP_BNT.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

POMDPLearn

Installation & Setup

Dependencies

Usage

Learning an MDP or POMDP model

Steps:

Data preprocessing

Dataset and Model definition

Dataset

MDP or POMDP model definition

Training and solving the MDP or POMDP model

MDP

POMDP

Policy execution

MDP

POMDP

Contributing

About

Releases

Packages

Contributors 2

Languages

License

panas89/POMDPLearn

Folders and files

Latest commit

History

Repository files navigation

POMDPLearn

Installation & Setup

Dependencies

Usage

Learning an MDP or POMDP model

Steps:

Data preprocessing

Dataset and Model definition

Dataset

MDP or POMDP model definition

Training and solving the MDP or POMDP model

MDP

POMDP

Policy execution

MDP

POMDP

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages