Name	Name	Last commit message	Last commit date
parent directory ..
envs	envs
README.md	README.md
__init__.py	__init__.py
keyboard_play_demo_2d.py	keyboard_play_demo_2d.py
keyboard_play_demo_continous_3d.py	keyboard_play_demo_continous_3d.py
keyboard_play_demo_discrete_3d.py	keyboard_play_demo_discrete_3d.py
test.py	test.py

Introduction

MetaMaze is a powerful and efficient simulator for navigation in a randomly generated maze. You may use MetaMaze to generate nearly unlimited configuration of mazes, and nearly unlimited different tasks. We use MetaMaze to facilitate researches in Meta-Reinforcement-Learning.

There are 3 types of mazes:

meta-maze-2D-v0
--- Observation space: its surrounding $(2n+1)\times(2n+1)$ (n specified by view_grid parameter) grids
--- Action space: 4-D discrete N/S/W/E
meta-maze-discrete-3D-v0
--- Observation space: RGB image of 3D first-person view.
--- Action space: 4-D discrete TurnLeft/TurnRight/GoForward/GoBackward.
meta-maze-continuous-3D-v0
--- Observation space: RGB image of 3D first-person view.
--- Action space: 2-D continuous [Turn, Forward/Backward]

Each type of mazes support 2 modes:

ESCAPE mode
--- Reach an unknown goal as soon as possible
--- The goal is specified by task configuration
--- Each step the agent receives reward of step_reward
--- Acquire goal_reward when reaching the goal
--- Episode terminates when reaching the goal
SURVIVAL mode
--- The agent begins with initial_life specified by the task
--- Food is generated at fixed grids specified by the task
--- When agent reaches the food spot, its life is extended depending on the food
--- When food is cosumed, it will be refreshed following a fixed periodicity
--- The life slowly decreases with time, depeding on step_reward
--- Episode terminates when life goes below 0
--- The total reward is the food being consumed
--- The agent's current life is shown by a red bar at the top of its view in 3D mazes
--- The agent's current life is shown in the center of the $(2n+1)\times(2n+1)$ in 2D mazes

Demonstrations of 2D maze

Demonstrations of 3D mazes

Install

pip install metagym[metamaze]

For local installation, execute the following commands:

git clone https://github.com/PaddlePaddle/MetaGym
cd MetaGym
pip install .[metamaze]

Quick Start

Import

Import and create the meta maze environment with

import gym
import metagym.metamaze
from metagym.metamaze import MazeTaskSampler

maze_env_3D_Cont = gym.make("meta-maze-continuous-3D-v0", enable_render=True, task_type="SURVIVAL") # Running a continuous 3D Maze with SURVIVAL task
maze_env_3D_Disc = gym.make("meta-maze-discrete-3D-v0", enable_render=True, task_type="ESCAPE") # Running a discrete 3D Maze with ESCAPE task
maze_env_2D = gym.make("meta-maze-2D-v0", enable_render=True, task_type="ESCAPE") # Running a 2D Maze with ESCAPE task

Maze Generation

Use the following code to generate a random maze

#Sample a task by specifying the configurations
task = MazeTaskSampler(
    n            = 15,  # Number of cells = n*n
    allow_loops  = False,  # Whether loops are allowed
    crowd_ratio  = 0.40,   # Specifying how crowded is the wall in the region, only valid when loops are allowed. E.g. crowd_ratio=0 means no wall in the maze (except the boundary)
    cell_size    = 2.0, # specifying the size of each cell, only valid for 3D mazes
    wall_height  = 3.2, # specifying the height of the wall, only valid for 3D mazes
    agent_height = 1.6, # specifying the height of the agent, only valid for 3D mazes
    view_grid    = 1, # specifiying the observation region for the agent, only valid for 2D mazes
    step_reward  = -0.01, # specifying punishment in each step in ESCAPE mode, also the reduction of life in each step in SURVIVAL mode
    goal_reward  = 1.0, # specifying reward of reaching the goal, only valid in ESCAPE mode
    initial_life = 1.0, # specifying the initial life of the agent, only valid in SURVIVAL mode
    max_life     = 2.0, # specifying the maximum life of the agent, acquiring food beyond max_life will not lead to growth in life. Only valid in SURVIVAL mode
    food_density = 0.01,# specifying the density of food spot in the maze, only valid in SURVIVAL mode
    food_interval= 100, # specifying the food refreshing periodicity, only valid in SURVIVAL mode
    )

Running Mazes

#Set the task configuration to the meta environment
maze_env.set_task(task)
maze_env.reset()

#Start the task
done = False
while not done:
    action = maze_env.action_space.sample() 
    observation, reward, done, info = maze_env.step(action)
    maze_env.render()

Keyboard Demonstrations

2D Mazes Demonstration

For a demonstration of keyboard controlled 2D mazes, run

python metagym/metamaze/keyboard_play_demo_2d.py

3D Discrete Mazes Demonstration

For a demonstration of keyboard controlled discrete 3D mazes, run

python metagym/metamaze/keyboard_play_demo_discrete_3d.py

3D Continuous Mazes Demonstration

For a demonstration of keyboard controlled 3D mazes, run

python metagym/metamaze/keyboard_play_demo_continuous_3d.py

Writing your own policy

Specifying action with your own (Meta RL) policy without relying on keyboards and rendering, check

python metagym/metamaze/test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metamaze

metamaze

README.md

Introduction

Install

For local installation, execute the following commands:

Quick Start

Import

Maze Generation

Running Mazes

Keyboard Demonstrations

2D Mazes Demonstration

3D Discrete Mazes Demonstration

3D Continuous Mazes Demonstration

Writing your own policy

Files

metamaze

Directory actions

More options

Directory actions

More options

Latest commit

History

metamaze

Folders and files

parent directory

README.md

Introduction

Install

For local installation, execute the following commands:

Quick Start

Import

Maze Generation

Running Mazes

Keyboard Demonstrations

2D Mazes Demonstration

3D Discrete Mazes Demonstration

3D Continuous Mazes Demonstration

Writing your own policy