The impact of data distribution on Q-learning with function approximation

This repository contains the code for the experimental results presented in: https://arxiv.org/abs/2111.11758

Installation

Setup and activate virtual environment (tested with python 3.8.10)
Git clone this repo: git clone https://github.com/PPSantos/rl-data-distribution.git
cd rl-data-distribution
Install requirements: pip install -r requirements.txt
pip install -e .
Test installation: python scripts/run.py

Concentrability coefficients

Code to estimate optimal data distributions from the point of view of concentrability (Sec. 3.2 of the paper):

concentrability_coefficients/opt-conc-coefs.ipynb jupyter notebook that runs our algorithm on a simple example MDP.
concentrability_coefficients/run.py main script to run our algorithm.

Four-state MDP

Code for the four-state MDP experiments (Sec. 4.1 of the paper):

four_state_mdp/offline_plots.py computes the plots for the (offline + oracle version) and (offline + TD Error version) versions of the experiments.
four_state_mdp/run.py contains the code used for the (online TD version + unlimited replay capacity) and (online TD version + limited replay capacity) versions of the experiments. four_state_mdp/plots.py computes the plots for both aforementioned versions.

Empirically assessing the impact of the data distribution in offline RL

Code for Sec. 4.2 of the paper:

scripts/run.py: contains the main script that generates a dataset (using scripts/dataset.py) and feeds it to the RL algorithm to train (using scripts/train.py). Check the RUN_ARGS global variable in scripts/run.py for a description of the different dataset types and options available, as well as the different RL algorithms and respective hyperparameters.

Compiled data:

parsed_results.csv: contains all the compiled data used to produce the article's plots.

Run dashboard:

Install dash: pip install dash
Setup analysis/dash_app/app.py global CSV_PATH path variable to point to the parsed_results.csv file.
Run dashboard: python analysis/dash_app/app.py

Name		Name	Last commit message	Last commit date
Latest commit History 381 Commits
algos		algos
analysis		analysis
concentrability_coefficients		concentrability_coefficients
data		data
envs		envs
four_state_mdp		four_state_mdp
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
parsed_results.csv		parsed_results.csv
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The impact of data distribution on Q-learning with function approximation

Installation

Concentrability coefficients

Four-state MDP

Empirically assessing the impact of the data distribution in offline RL

About

Releases

Packages

Contributors 2

Languages

PPSantos/rl-data-distribution-public

Folders and files

Latest commit

History

Repository files navigation

The impact of data distribution on Q-learning with function approximation

Installation

Concentrability coefficients

Four-state MDP

Empirically assessing the impact of the data distribution in offline RL

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages