GitHub - crowdAI/marLo: Multi Agent Reinforcement Learning using MalmÖ

MarLÖ : Reinforcement Learning + Minecraft = Awesomeness

YOU-NEED-TO-READ-THIS : We are actively looking for maintainers for this library. If you are interested in helping maintain this library, please drop in a line [here](https://twitter.com/MeMohanty/) 😄

MarLÖ (short for Multi-Agent Reinforcement Learning in MalmÖ) is a high level API built on top of Project MalmÖ to facilitate Reinforcement Learning experiments with a great degree of generalizability, capable of solving problems in pseudo-random, procedurally changing single and multi agent environments withing the world of the mediatic phenomenon game Minecraft .

The Malmo platform provides an API which enables access to actions, observations (i.e. location, surroundings, video frames, game statistics) and other general data that Minecraft provides. Marlo, on the other hand, is a wrapper for Malmo that provides a higher level API and more standardized RL-friendly environment for scientific study.

The framework is written as an extension to OpenAI's Gym framework , which is a toolkit for developing and comparing reinforcement learning algorithms, thus providing an industry-standard and familiar platform for scientists, developers and popular RL frameworks.

The framework was used in the 2018 MarLo Challenge.

`MarLo-MazeRunner-v0`	`MarLo-CliffWalking-v0`	`MarLo-CatchTheMob-v0`
`MarLo-FindTheGoal-v0`	`MarLo-Attic-v0`	`MarLo-DefaultFlatWorld-v0`
`MarLo-DefaultWorld-v0`	`MarLo-Eating-v0`	`MarLo-Obstacles-v0`
`MarLo-TrickyArena-v0`	`MarLo-Vertical-v0`

Please consider citing the following paper if you find this work useful :

Diego Perez-Liebana, Katja Hofmann, Sharada Prasanna Mohanty, Noburu Kuno, Andre Kramer, Sam Devlin, Raluca D. Gaina “The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) Competition”, 2019, Challenges in Machine Learning (NIPS Workshop), 2018; <http://arxiv.org/abs/1901.08129>.

Simple Example

#!/usr/bin/env python
# Please ensure that you have a Minecraft client running on port 10000
# by doing :
# $MALMO_MINECRAFT_ROOT/launchClient.sh -port 10000

import marlo
client_pool = [('127.0.0.1', 10000)]
join_tokens = marlo.make('MarLo-FindTheGoal-v0',
                          params={
                            "client_pool": client_pool
                          })
# As this is a single agent scenario,
# there will just be a single token
assert len(join_tokens) == 1
join_token = join_tokens[0]

env = marlo.init(join_token)

observation = env.reset()

done = False
while not done:
    _action = env.action_space.sample()
    obs, reward, done, info = env.step(_action)
    print("reward:", reward)
    print("done:", done)
    print("info", info)
env.close()

Authors

Sharada Mohanty

Name		Name	Last commit message	Last commit date
Latest commit History 318 Commits
agent		agent
docs		docs
examples		examples
marlo		marlo
results		results
source		source
tests		tests
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE.md		LICENSE.md
Makefile		Makefile
README.rst		README.rst
README_Draft.md		README_Draft.md
README_MINIMAL.md		README_MINIMAL.md
acknowledgements.md		acknowledgements.md
make.bat		make.bat
setup.py		setup.py
two_agent_minecraft_launch.py		two_agent_minecraft_launch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MarLÖ : Reinforcement Learning + Minecraft = Awesomeness

Contents

Simple Example

Authors

About

Releases

Packages

Contributors 9

Languages

License

crowdAI/marLo

Folders and files

Latest commit

History

Repository files navigation

MarLÖ : Reinforcement Learning + Minecraft = Awesomeness

Contents

Simple Example

Authors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages