Apollo DQN — reproducible DQN variants for LunarLander-v3 with a Residual MLP backbone, PER-lite replay, strict evaluation pipeline and human baselines.
-
Updated
Mar 3, 2026 - Python
Apollo DQN — reproducible DQN variants for LunarLander-v3 with a Residual MLP backbone, PER-lite replay, strict evaluation pipeline and human baselines.
Project for CENG567 Reinforcement Course that I took at IZTECH
Empirical study of over-estimation bias in DDPG vs TD3 on LunarLanderContinuous-v3.
Add a description, image, and links to the lunarlander-v3 topic page so that developers can more easily learn about it.
To associate your repository with the lunarlander-v3 topic, visit your repo's landing page and select "manage topics."