POTMMCP

This repository contains the implementation of the Partially Observable Type-based Meta Monte-Carlo Planning (POTMMCP) algorithm from the paper: Combining a Meta-Policy and Monte-Carlo Planning for Scalable Type-Based Reasoning in Partially Observable Environments.

Installation

The library is implemented in python 3.8 and has the following main dependencies:

Pytorch >=1.11
Rllib == 1.12
posggym (comes with repo)
posggym-agents (comes with repo)

As with any python package we recommend using a virtual environment (e.g. Conda).

Installation of potmmcp requires cloning the repo then installing using pip:

git clone git@github.com:Jjschwartz/potmmcp.git

# install posggym
cd potmmcp/posggym
pip install -e .

# install posggym-agents
cd ../posggym-agents
pip install -e .

# install potmmcp
cd ..
pip install -e .

This will install the potmmcp package along with the necessary dependencies.

The codebase

There are two main parts to the codebase:

The potmmcp directory containing the potmmcp python package
The experiments directory containing scripts and Jupyter notebooks for running and analysing the experiments used in the paper. The results are also store here.

potmmcp

The potmmcp python package contains a few main parts:

baselines - implementation code for the different baselines used in the paper
plot - code used for generating plots and running analysis
run - code for running and tracking experiments
tree - the implementation of the POTMMCP algorithm
meta-policy.py - classes and functions implementing the meta-policy
policy_prior.py - classes and functions implementing the prior over policies

The main implementation of the POTMMCP algorithm is contained in the potmmcp/tree/policy.py file.

experiments

This directory contains scripts for running the experiments in each environment as well as Jupyter notebooks for analysing the results and the actual results files.

Results

If you run any of the experiment scripts, by default experiment results are saved to the ~/potmmcp_results directory.

Questions or Issues

If you have any questions or issues please email jonathon.schwartz@anu.edu.au or create an issue in the issue section on github.

Authors

Jonathon Schwartz (primary author)
Hanna Kurniwati
Marcus Hutter

Please Cite

If you use the code in this repository or the POTMMCP algorithm, consider citing:

@article{schwartz2023combining,
  title = {Combining a Meta-Policy and Monte-Carlo Planning for Scalable Type-Based Reasoning in Partially Observable Environments}, 
  author = {Jonathon Schwartz and Hanna Kurniawati and Marcus Hutter},
  year = {2023},
  journal = {arXiv preprint arXiv:2306.06067},
  eprint = {2306.06067},
  eprinttype = {arxiv},
  archivePrefix = {arXiv}
}

Name		Name	Last commit message	Last commit date
Latest commit History 273 Commits
experiments		experiments
posggym-agents		posggym-agents
posggym		posggym
potmmcp		potmmcp
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
tox.ini		tox.ini

License

Jjschwartz/potmmcp

Folders and files

Latest commit

History

Repository files navigation

POTMMCP

Installation

The codebase

potmmcp

experiments

Results

Questions or Issues

Authors

Please Cite

About

Resources

License

Stars

Watchers

Forks

Languages