MeanQ Code base for paper: Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Many implementations details of this project are adapted from SUNRISE https://github.com/pokaxpoka/sunrise. Thanks Kimin!