Box-World

A simple implementation of the environment in Google Deepmind's paper "Relational Deep Reinforcement Learning". No rendering, only for computer training and testing.

Metadata

kwargs of gym.make

metadata	description	default
max_length	the number of boxes in the path to the goal	4
max_branch_num	the number of distractor branches	4
branch_length	the length of the distractor branches	1

Observation

Type: ndarray(14, 14, 3)

Max	Min
0.0	255.0

Type	Color
Wall	RGB(0.0, 0.0, 0.0)
Space	RGB(192.25, 191.25, 191.25)
Agent	RGB(127.5, 127.5, 127.5)
Gem	RGB(255.0, 255.0, 255.0)

Action

Type: Discrete(4)

Num	Action
0	Agent move up
1	Agent move left
2	Agent move down
3	Agent move right

Reward

action	reward
collecting the gem	10
opening a box in the solution path	1
opening a distractor box	-1
otherwise	0

Termination

Terminated when:

the gem is collected
a distractor box is opened

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
box_world		box_world
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Box-World

Metadata

Observation

Action

Reward

Termination

About

Releases

Packages

Languages

License

HTYISABUG/Box-World

Folders and files

Latest commit

History

Repository files navigation

Box-World

Metadata

Observation

Action

Reward

Termination

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages