This repository contains both the environment and the agents of the leptokurtosis project.
For a detailed description of the experiment, please refer to our paper.
- Leptokurtosis
- Tabular SARSA
- Categorical Temporal Difference Learning (a tabular version of Categorical distributional reinforcement learning)
- (Efficient) Distributional Temporal Difference Learning:
- Integration over reward distribution (sample average) method.
- Maximum Likelihood Estimator (EM-MLE) method.
see requirements.txt
- To replicate the simulation results, run
stats_and_plots/run_game_100_times.py
(warning: takes time) - Archived simulation results (in json format) are available in a zip file.
- The performance statistics can be calculated by
stats_and_plot/analysis.py
- To generate the data for plots in the paper, run
stats_and_plots/figures.py
- To see all the essential meta-parameters used in the paper, please refer to
utils/config.py
@misc{distributionalRLTailRisk,
author = {Peter Bossaerts, Shijie Huang and Nitin Yadav},
title = {Exploiting Distributional Temporal Difference Learning To Deal With Tail Risk},
year = {2020}
}
- Neural Mechanisms Behind Identification of Leptokurtic Noise and Adaptive Behavioral Response
- Supplementary Material: behavioural results and reinforcement learning
- Casella, G., & Berger, R. L. (2002). Statistical inference (Vol. 2). Pacific Grove, CA: Duxbury.
- Schervish, M. J. (2012). Theory of statistics. Springer Science & Business Media.