Master Data Science | Institut Polytechnique de Paris

Capstone Project

Topic : Rainbow: Combining Improvements in Deep Reinforcement Learning

Realised by :

Choho Yann Eric CHOHO
Yedidia AGNIMO

Academic year: 2023-2024

February 2024.

Introduction

The goal of this project is to implement the Rainbow algorithm and compare it to the DQN algorithm.

The Rainbow algorithm is an improvement of the DQN algorithm. It combines six improvements in deep reinforcement learning. These six improvements are the following:

Double Q-learning
Prioritized Experience Replay
Dueling Network Architecture
Multi-step Learning
Distributional RL
Noisy Nets

Methodology

We implement the Rainbow algorithm and compare it to the DQN algorithm and extensions of DQN. We will use the CartPole environment. We will compare the algorithms in terms of a score define in the paper.

The implementation of the Rainbow algorithm is based on the following steps:

Importing the required libraries
Defining the hyperparameters
Defining the agent
Defining the replay buffer
Defining the network
Training the agent
Evaluating the agent
Visualizing the agent's performance

Hessel, Matteo, et al. “Rainbow: Combining Improvements in Deep Reinforcement Learning.” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018. Crossref

1.Importing Requirements

To run this code, you will need to install the following libraries:

Open your terminal or command prompt.
Create a virtual environment (optional but recommended):
- Run python -m venv env to create a virtual environment named "env".
Activate the virtual environment:
- On Windows, run env\Scripts\activate.
- On macOS and Linux, run source env/bin/activate.
Install the required libraries by running pip install -r requirements.txt.

Les bibliothèques principale sont :

- torch
- gymnasium (by OpenAI)

2. Defining the hyperparameters

We define the hyperparameters for the Rainbow algorithm.

# Hyperparameters
BATCH_SIZE = 32
LR = 0.0005
EPSILON = 0.0005
GAMMA = 0.99
TARGET_UPDATE = 1000
REPLAY_MEMORY_SIZE = 15000
LEARNING_STARTS = 1000
N_ATOMS = 51
V_MIN = -10
V_MAX = 10

Structure of the repository

To launch the code : use the notebook.ipynb
The utils folder contains the Agent class used for training each Q-algorithm (including Rainbow). The class name is written in uppercase (e.g., AGENT), while the neural network classes have names ending with Network. Additionally, there are specific buffer classes with names starting with Buffer.
The Result folder contains testing on one episode of each algorithm in .mp4 format

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
results		results
utils		utils
.gitignore		.gitignore
README.md		README.md
notebook.ipynb		notebook.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Master Data Science | Institut Polytechnique de Paris

Capstone Project

Topic : Rainbow: Combining Improvements in Deep Reinforcement Learning

Realised by :

Academic year: 2023-2024

Introduction

Methodology

1.Importing Requirements

2. Defining the hyperparameters

Structure of the repository

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Master Data Science | Institut Polytechnique de Paris

Capstone Project

Topic : Rainbow: Combining Improvements in Deep Reinforcement Learning

Realised by :

Academic year: 2023-2024

Introduction

Methodology

1.Importing Requirements

2. Defining the hyperparameters

Structure of the repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages