Skip to content

SharathRaparthy/nearest_sequence_memory

Repository files navigation

Nearest Sequence Memory for Hidden State Idenification

This repository is the reimplementation of the paper "Instance-Based State Identification for Reinforcement Learning" by R.Andrew McCallum

NSM is an instance-based algorithm for solving partially observable Markov decision problems (POMDPs). Here NSM algorithm is applies to a partially obsevable version of McCallum's grid-world presented in figure below.

Prerequisites

Matlab 2015b (or later version)
Ubuntu 14.04 (or later version)/Windows

Getting Started

After successful installation of matlab, clone this repository by using the following command

git clone https://github.com/SharathRaparthy/nearest_sequence_memory.git

Open your matlab and execute rndTrial.m script with the following MATLAB command:

plot(rndTrial(1000));

The result should like similar to figure below, but it should not be exactly the same.

Now run the NSMTrial function and plot the individual number of steps taken for 1000 episodes using the MATLAB command:

plot(NSMTrial(1000));

The result should like similar to Figure shown below but it should not be exactly the same.

About

Reimplementation of the paper "Instance-Based State Identification for Reinforcement Learning "

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages