Skip to content

Hide & Seek 2.0 — v1.0.0

Choose a tag to compare

@GeFAA GeFAA released this 13 Jun 16:28
· 9 commits to main since this release

Hide & Seek 2.0 — v1.0.0

First public release: a modern JAX / Flax recreation and expansion of
OpenAI's 2019 Emergent Tool Use From Multi-Agent Autocurricula ("Hide-and-Seek").

Highlights

  • End-to-end on-GPU JAX (PureJaxRL / JaxMARL style): environment, physics and
    learner in one jit / scan graph, zero host/device copies in the hot loop.
  • MAPPO with CTDE — decentralized actors on local masked observations, a
    centralized critic on privileged global state; per-team parameter sharing.
  • Entity-centric Transformer encoder + scanned GRU memory (object
    permanence through fog and occlusion).
  • ELO historical self-play for a stable autocurriculum.
  • 2.0 mechanics: variable mass & cooperative physics, sensory decoys, fog of
    war & dynamic lighting, destructible walls & doors, and stamina.
  • Anti box-surfing fix: strict Newtonian ground-contact gating.
  • Clean 3D replay viewer (Three.js, OpenAI Emergent Tool Use visual style) —
    no build step, with a pure-stdlib synthetic demo that runs without JAX or a GPU.

Quick look (no training needed)

python viz/make_demo_trajectory.py
python -m viz.serve
# then open http://localhost:8000

Status

A research scaffold: engineered to be correct, importable and faithful to
current JAX idioms. Quoted performance is a design target, not a benchmark,
and the JAX execution paths require a JAX/GPU environment to run. Not affiliated
with or endorsed by OpenAI.