Reinforcement Learning: DQN on CartPole-v1: Baseline, Reward Shaping, and Hyperparameter Study
-
Updated
Jan 19, 2026 - Jupyter Notebook
Reinforcement Learning: DQN on CartPole-v1: Baseline, Reward Shaping, and Hyperparameter Study
MSc Dissertation Project. Slate-based reinforcement learning for recommender systems using RecSim NG. Implements and evaluates SlateQ and its variants against bandit and heuristic baselines.
This is a repo for a Reinforcement Learning research environment, customized algorithms and trainers for the RL course competition @uni-tuebingen
MSc Dissertation Project. Slate-based reinforcement learning for recommender systems using RecSim NG. Implements and evaluates SlateQ and its variants against bandit and heuristic baselines.
Add a description, image, and links to the rl-research topic page so that developers can more easily learn about it.
To associate your repository with the rl-research topic, visit your repo's landing page and select "manage topics."