This repository contains an implementation of PPO for solving the CartPole-v1 environment using PyTorch. The code is adapted from a tutorial with significant enhancements.
- Modern implementation using Gymnasium (successor to OpenAI Gym)
- Performance monitoring with TensorBoard
- Experiment tracking via Weights & Biases
- Optimized hyperparameters for CartPole-v1
rl-video-step-400.mp4
As a fellow neanderthal, I needed to make notes for myself:
Here is a notion link with some of my findings: Notion Link
Environment dependencies are managed through Conda. To get started:
conda env create -f environment.yml