Inference runtime for Causal GPT-RL policy bundles: decoder-only autoregressive transformers as RL agents.
reinforcement-learning transformers continuous-control mujoco autoregressive-models offline-rl policy-inference
-
Updated
May 24, 2026 - Python