Craft Environment

CraftEnv is a 2D crafting environment, adapted and extended from Andreas et al.,[1].

It supports a fully flexible setup of hierarchical tasks, with sparse rewards, in a fully procedural setting.

In this environment, an agent has to:

Move around
Pick up items and keeping them in an inventory
Transform things at workshops

Tasks are quite varied, and requires the agent to pick up components and create various items. Here is an example of what is required to perform the task pick up gold (see Figure below):

Get wood
Make plank: Get wood → Use workbench
Make bridge: Get wood → Get iron → Use factory
Get gold: Make bridge → Use bridge on water

Crafting environment example for a complex multi-step task.
When provided with a "get gold" task by the Teacher, the agent needs to first collect wood, transform it at a workshop to construct a bridge, in order to finally pass across the water and reach the gold.

This kind of multi-step action sequences can be really hard to learn, given that agents usually only performs random exploration.

In total, I have extended the environment and defined a collection of 17 tasks, ranging from simple "collection" tasks to more complex ones that requires crafting several items to succeed. See the next Figure for a complex list, along with an indication of their difficulty.

Full list of tasks considered in our crafting environment.
Tasks range from simple collection "Get X" tasks, slightly more difficult crafting tasks (requiring to collect items before), all the way to complex tasks requiring several crafted components which depend themselves on sub-tasks.

I wrote a visualisation tool to see how these environments look like and how agents interact with it. This is currently only used for post-hoc analysis or debugging, as the agent instead receives a flattened vector of features (1-hot indicators of the type of element in a given cell), a 5x5 egocentric view around the agent and its inventory as observation. But in future work, one could instead use this 2D view as observation for agents.

Visualisation of a random agent trying to solve the "Get grass" task.
The instruction is shown at the top, the 2D grid in the middle and the inventory is shown at the bottom. Each color correspond to a different object or workshop, the player is the red dot. When the player picks up a green square (grass), the screen flashes to indicate a positive reward.

Usage:

To visualise random agent, run with:

python random_agent.py

Dependencies:

Note:

This code should also be able to integrate with Gym environments with minor changes, which will be added soon. Currently the wrapper for the Craft Env mimics DMLab interface.

Acknowledgements

@jacobandreas for open-sourcing the mine-craft inspired Craft Environment used in Policy Sketches paper [1] Craft Environment which this codebase is heavily based on

References

[1] Modular Multitask Reinforcement Learning with Policy Sketches (Andreas et al., 2016)

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
misc		misc
resources		resources
README.md		README.md
__init__.py		__init__.py
cookbook.py		cookbook.py
craft.py		craft.py
env.py		env.py
env_factory.py		env_factory.py
run_random_agent.py		run_random_agent.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Craft Environment

Usage:

Dependencies:

Note:

Acknowledgements

References

About

Releases

Packages

Languages

Feryal/craft-env

Folders and files

Latest commit

History

Repository files navigation

Craft Environment

Usage:

Dependencies:

Note:

Acknowledgements

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages