Skip to content

inoryy/bsc-thesis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bachelor’s thesis

  • Title: Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods
  • Author: Roman Ring
  • Submission date: 8th May, 2018
  • Supervisors: Ilya Kuzovkin, Tambet Matiisen
  • Institution: Institute of Mathematics and Statistics, Faculty of Science and Technology, University of Tartu
  • Abstract: Reinforcement Learning (RL) is a subfield of Artificial Intelligence (AI) that deals with agents navigating in an environment with the goal of maximizing total reward. Games are good environments to test RL algorithms as they have simple rules and clear reward signals. Theoretical part of this thesis explores some of the popular classical and modern RL approaches, which include the use of Artificial Neural Network (ANN) as a function approximator inside AI agent. In practical part of the thesis we implement Advantage Actor-Critic RL algorithm and replicate ANN based agent described in [Vinyals et al., 2017]. We reproduce the state-of-the-art results in a modern video game StarCraft II, a game that is considered the next milestone in AI after the fall of chess and Go.
  • Full text: Digital version
  • Implemented agent: https://github.com/inoryy/pysc2-rl-agent

[Vinyals et al., 2017] Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Sasha Vezhnevets, A., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J., Quan, J., Gaffney, S., Petersen, S., Simonyan, K., Schaul, T., van Hasselt, H., Silver, D., Lillicrap, T., Calderone, K., Keet, P., Brunasso, A., Lawrence, D., Ekermo, A., Repp, J., and Tsing, R. (2017). StarCraft II: A New Challenge for Reinforcement Learning.

About

Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages