Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
chapters
figures
README.md
bibliography.bib
main.tex
ring_roman_bsc_2018.pdf

README.md

Bachelor’s thesis

  • Title: Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods
  • Author: Roman Ring
  • Submission date: 8th May, 2018
  • Supervisors: Ilya Kuzovkin, Tambet Matiisen
  • Institution: Institute of Mathematics and Statistics, Faculty of Science and Technology, University of Tartu
  • Abstract: Reinforcement Learning (RL) is a subfield of Artificial Intelligence (AI) that deals with agents navigating in an environment with the goal of maximizing total reward. Games are good environments to test RL algorithms as they have simple rules and clear reward signals. Theoretical part of this thesis explores some of the popular classical and modern RL approaches, which include the use of Artificial Neural Network (ANN) as a function approximator inside AI agent. In practical part of the thesis we implement Advantage Actor-Critic RL algorithm and replicate ANN based agent described in [Vinyals et al., 2017]. We reproduce the state-of-the-art results in a modern video game StarCraft II, a game that is considered the next milestone in AI after the fall of chess and Go.
  • Full text: Digital version
  • Implemented agent: https://github.com/inoryy/pysc2-rl-agent

[Vinyals et al., 2017] Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Sasha Vezhnevets, A., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J., Quan, J., Gaffney, S., Petersen, S., Simonyan, K., Schaul, T., van Hasselt, H., Silver, D., Lillicrap, T., Calderone, K., Keet, P., Brunasso, A., Lawrence, D., Ekermo, A., Repp, J., and Tsing, R. (2017). StarCraft II: A New Challenge for Reinforcement Learning.