Skip to content

Latest commit

 

History

History
28 lines (19 loc) · 1.01 KB

README.md

File metadata and controls

28 lines (19 loc) · 1.01 KB

HS3MDP

POMCP enhancement for Hidden Semi-Markovian Mode MDP.

HS3MDP is based on POMCP 1.0 used in the NIPS 2010 paper Online Monte-Carlo Planning in Large POMDPs by David Silver and Joel Veness.

The original code can be found in this repository under the tag POMCP-1.0.

Traffic, Elevator and Sailboat problems are modified versions of those proposed by Samuel Ping-Man Choi in his thesis Reinforcement Learning in Non-stationary Environments.

The presentation on this work, in the 8th joint NII-LIP6 workshop, can be found here.

Requirements

  • git
  • automake, autoconf, etc.
  • C++ Boost

Installation

  1. git clone this repository
  2. run autoreconf -i
  3. run ./configure (possibly with --enable-assert)
  4. run make

You will find the executable pomcp in the src directory.

Usage

Simply run pomcp --help to see all possible parameters.