Exploiting Distributional Temporal Difference Learning
To Deal With Tail Risk

This repository contains both the environment and the agents of the leptokurtosis project.

For a detailed description of the experiment, please refer to our paper.

Environment (envs):

Leptokurtosis

Implemented algorithms (agents):

Tabular SARSA
Categorical Temporal Difference Learning (a tabular version of Categorical distributional reinforcement learning)
(Efficient) Distributional Temporal Difference Learning:
- Integration over reward distribution (sample average) method.
- Maximum Likelihood Estimator (EM-MLE) method.

Prerequisites

see requirements.txt

Results replication

To replicate the simulation results, run stats_and_plots/run_game_100_times.py (warning: takes time)
Archived simulation results (in json format) are available in a zip file.
The performance statistics can be calculated by stats_and_plot/analysis.py
To generate the data for plots in the paper, run stats_and_plots/figures.py
To see all the essential meta-parameters used in the paper, please refer to utils/config.py

To reference this repository

@misc{distributionalRLTailRisk,
  author = {Peter Bossaerts, Shijie Huang and Nitin Yadav},
  title = {Exploiting Distributional Temporal Difference Learning To Deal With Tail Risk},
  year = {2020}
}

Key references

Outlier and Leptokurtosis:

Statistics:

Casella, G., & Berger, R. L. (2002). Statistical inference (Vol. 2). Pacific Grove, CA: Duxbury.
Schervish, M. J. (2012). Theory of statistics. Springer Science & Business Media.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
outlier_project		outlier_project
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
archive_simulation_data.zip		archive_simulation_data.zip
bmm_logo.png		bmm_logo.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

outlier_project

outlier_project

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

archive_simulation_data.zip

archive_simulation_data.zip

bmm_logo.png

bmm_logo.png

requirements.txt

requirements.txt

Repository files navigation

Exploiting Distributional Temporal Difference Learning
To Deal With Tail Risk

Environment (envs):

Implemented algorithms (agents):

Prerequisites

Results replication

To reference this repository

Key references

Outlier and Leptokurtosis:

Statistics:

(Distributional) Reinforcement Learning:

Primary:

Others:

About

Releases

Packages

Languages

License

bmmlab/Distributional-RL-Tail-Risk

Folders and files

Latest commit

History

Repository files navigation

Exploiting Distributional Temporal Difference Learning To Deal With Tail Risk

Environment (envs):

Implemented algorithms (agents):

Prerequisites

Results replication

To reference this repository

Key references

Outlier and Leptokurtosis:

Statistics:

(Distributional) Reinforcement Learning:

Primary:

Others:

About

Resources

License

Stars

Watchers

Forks

Languages

Exploiting Distributional Temporal Difference Learning
To Deal With Tail Risk