Code from "How useful is quantilization for mitigating specification-gaming?"
reinforcement-learning
sklearn
atari2600
python3
pytorch
behavioral-cloning
hopper
imitation-learning
imitation
paper-implementations
mujoco-py
video-pinball
-
Updated
Jun 17, 2024 - Python