Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs.
dng-mctsis the code release of DNG-MCTS in the paper
- "Bayesian Mixture Modelling and Inference based Thompson Sampling in Monte-Carlo Tree Search", by Aijun Bai, Feng Wu, and Xiaoping Chen, Advances in Neural Information Processing Systems 26 (NIPS), Lake Tahoe, Nevada, United States, December 2013.
d2ng-pomcpis the code release of D2NG-POMCP in the paper
- "Thompson Sampling based Monte-Carlo Planning in POMDPs", Aijun Bai, Feng Wu, Zongzhang Zhang, and Xiaoping Chen, Proceedings of the 24th International Conference on Automated Planning and Scheduling (ICAPS), Portsmouth, New Hampshire, United States, June 2014.