Hide & Seek 2.0 — v1.0.0
Hide & Seek 2.0 — v1.0.0
First public release: a modern JAX / Flax recreation and expansion of
OpenAI's 2019 Emergent Tool Use From Multi-Agent Autocurricula ("Hide-and-Seek").
Highlights
- End-to-end on-GPU JAX (PureJaxRL / JaxMARL style): environment, physics and
learner in onejit/scangraph, zero host/device copies in the hot loop. - MAPPO with CTDE — decentralized actors on local masked observations, a
centralized critic on privileged global state; per-team parameter sharing. - Entity-centric Transformer encoder + scanned GRU memory (object
permanence through fog and occlusion). - ELO historical self-play for a stable autocurriculum.
- 2.0 mechanics: variable mass & cooperative physics, sensory decoys, fog of
war & dynamic lighting, destructible walls & doors, and stamina. - Anti box-surfing fix: strict Newtonian ground-contact gating.
- Clean 3D replay viewer (Three.js, OpenAI Emergent Tool Use visual style) —
no build step, with a pure-stdlib synthetic demo that runs without JAX or a GPU.
Quick look (no training needed)
python viz/make_demo_trajectory.py
python -m viz.serve
# then open http://localhost:8000Status
A research scaffold: engineered to be correct, importable and faithful to
current JAX idioms. Quoted performance is a design target, not a benchmark,
and the JAX execution paths require a JAX/GPU environment to run. Not affiliated
with or endorsed by OpenAI.