NetHack

Description

NetHack is an environment for evaluating agents on the classic roguelike dungeon exploration game NetHack, built on the NetHack Learning Environment (NLE). Agents explore procedurally generated dungeons by sending individual keystrokes and receiving ASCII terminal screen observations along with structured player statistics. The goal is to maximize the in-game score through strategic exploration, combat, item management, and dungeon navigation.

Capabilities

Strategic exploration of procedurally generated dungeons
Combat decision-making against diverse monsters
Inventory and resource management under constraints
Long-horizon sequential decision-making under uncertainty
Reading and interpreting ASCII game displays

Compute Requirements

4 GB RAM, 2 CPUs. NLE compiles from source and requires cmake and build tools in the Docker image.

License

NetHack General Public License.

Tasks

There is one split:

train: 1,000 tasks (seeds 0-999)

Each task corresponds to a unique random seed that generates a distinct dungeon layout, starting character, and item placement. All tasks use the full NetHack action space (121 actions). Since every task is a procedurally generated dungeon with no structural difference between them, a single split is used.

Reward Structure

This is a dense, verifiable reward environment. After each keystroke action, the agent receives the raw reward from the NLE engine, which is the in-game score delta (change in score since the previous step). Score increases from exploration, combat, item identification, gold collection, and dungeon descent.

We do not use LLM graders for this environment.

Data

No external data is required. Game state is generated procedurally by the NetHack engine using seeded randomness. Data is stored on the OpenReward platform.

Tools

Agents are given two tools:

step(keystroke): Send a keystroke to the game and receive the updated terminal screen, player stats, and game messages. Accepts single characters (k, ,, <), control keys (ctrl+d), meta keys (meta+j), and special keys (enter, space, escape).
commands(): Display a reference of all available NetHack keystroke commands grouped by category.

Time Horizon

NetHack is a multi-turn environment with up to 1,000,000 steps per episode. Each step corresponds to one keystroke action. Episodes end on character death, ascension, or reaching the step limit.

Other Environment Requirements

There are no further environment requirements; NetHack works out of the box without any secrets or API keys.

Safety

Agents interact only with a roguelike game simulation. The environment has no access to external systems, the internet, or sensitive data.

Citations

@inproceedings{kuettler2020nethack,
  author    = {Heinrich K{\"{u}}ttler and Nantas Nardelli and
               Alexander H. Miller and Roberta Raileanu and
               Marco Selvatici and Edward Grefenstette and
               Tim Rockt{\"{a}}schel},
  title     = {{The NetHack Learning Environment}},
  booktitle = {Proceedings of the Conference on Neural Information
               Processing Systems (NeurIPS)},
  year      = {2020},
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Dockerfile		Dockerfile
README.md		README.md
nethack_env.py		nethack_env.py
requirements.txt		requirements.txt
server.py		server.py
test_agent.py		test_agent.py
test_local.py		test_local.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NetHack

Description

Capabilities

Compute Requirements

License

Tasks

Reward Structure

Data

Tools

Time Horizon

Other Environment Requirements

Safety

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NetHack

Description

Capabilities

Compute Requirements

License

Tasks

Reward Structure

Data

Tools

Time Horizon

Other Environment Requirements

Safety

Citations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages