This repository contains implementations of reinforcement learning algorithms and experiments with environments using the OpenAI Gym API.
- Monte Carlo on-policy evaluation
- Monte Carlo on-policy control
- Monte Carlo off-policy evaluation
- Monte Carlo off-policy control
- 2d grid world: A 2d rectangular grid world where agent can move deterministically up, down, left, right.
- 1d random walk: A 1d random walk reward-only process.