Multi-Objective Reinforcement Learning components built on top of RL glue components
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.

README.md

MORL-Glue

MORL-Glue is a Multi-Objective Reinforcement Learning framework, adapted from RL-Glue by Brian Tanner and Adam White.

MORL-Glue logically separates the three main components of a reinforcement learning system, allowing the environment, experiment and agent to be implemented in different programming languages, on different systems.

Communication between the three is orchestrated by the MORL-Glue server, which is available for Linux and Windows systems. Windows 32-bit and 64-bit builds are available under Releases.

Most of the documentation for RL-Glue still applies. A good starting point is the RL-Glue overview, available under the docs directory.

Changes from RL-Glue

The main difference between RL-Glue and MORL-Glue is that MORL supports multiobjective reinforcement learning. That is, where RL-Glue is built around the assumption that agents should learn to optimize for a single scalar-valued reward, MORL-Glue allows representing vector-valued rewards. Agents can then combine these rewards into a single scalar value and apply standard single-objective RL techniques, or use explicitly multiobjective approaches.

Included MORL Agents

Several example MORL agents are included in the source distribution:

  • SkeletonAgent - A starting-point agent that selects a random action
  • UserControlledAgent - An example of an agent that uses user input to select an action, which is useful for debugging new environments.
  • TLO_Agent - a Q-Learning agent that uses Thresholded Lexicographic Ordering to determine how to prioritize objectives, as described by Gabor et al (1998) and Issabekov & Vamplew (2012).

Included MORL Environments

MORL-Glue includes a large number of standard RL and MORL benchmark problems:

  • DeepSeaTreasureEnv - The Deep Sea Treasure episodic multiobjective benchmark problem, with two objectives - a positive treasure and a negative time objective. The rewards in Deep Sea Treasure are such that the pareto front is concave. See Vamplew et al (2011) for details.
  • DeepSeaTreasureMixed - A variant of the above Deep Sea Treasure environment with a convex pareto front.
  • GeneralisedDeepSeaTreasureEnv - A generalised version of the Deep Sea Treasure benchmark, with configurable environment size, stochasticity and pareto shape.
  • LinkedRings and NonRecurrentRings - The Linked Rings benchmarks as described in Vamplew et al (2017a)
  • MOMountainCarDiscretised - A multiobjective version of the Mountain Car environment, as described in Vamplew et al (2011).
  • SpaceExploration - The Space Exploration benchmark as described in Vamplew et al (2017b)
  • BonusWorld - An episodic 2D grid-world with 3 objectives - 2 are terminal-only, and the other is the time objective. As described in Vamplew et al (2017b)
  • ResourceGathering - The resource collection problem from Barrett & Narayanan (2008)

Experiments

  • SkeletonExperiment - A starting-point experiment for MORL-Glue.
  • DemoExperiment - Designed for use in demonstrating the capabilities of MORL-Glue, in particular using the Generalised Deep Sea Treasure environment.

Codecs

There are several "codecs" available for RL-Glue which assist in writing agents, environments and experiments.

For MORL-Glue, only the Java codec is currently available. JavaRLGlueCodec.jar contains the Java codec classes, and is required for the included projects to run. A Python implementation is in development.

Running an experiment

MORL_Glue_Driver.java contains an example of how to connect an experiment, an agent and an environment to MORL-Glue. Importantly, any of these could be extracted to a separate process or on a separate machine.

Installation

Linux

MORL-Glue, like RL-Glue, uses the GNU autotools build framework.

If you have downloaded a source tarball, from the morlglue-server directory, run

configure && make && make install

Depending on your distribution, you may need to sudo make install.

You should now be able to run morlglue from the command prompt.

If you have cloned the Git repository, you may need to rerun the autotools. See the README for morlglue-server for more details.

Windows

The MORL-Glue server is available as a statically-linked stand-alone 32 or 64-bit executable for Windows (Vista or higher). Simply extract the executable into a directory and run it.

Visual Studio 2015 solutions are available if you wish to build your own version.