AlphaZero

Code for the project for the course "Reinforcement Learning" taught in 2020/2021 at École polytechnique. Implementation of AlphaZero [1].

The environment for Tic-Tac-Toe is the intellectual property of Kim Jeong Ju and was made available at the following repository:

https://github.com/haje01/gym-tictactoe

## Usage

Install the requirements using the requirements.txt file.

pip install -r requirements.txt
A demonstration of the algorithm for the game of Tic-Tac-Toe can be ran with the script demo.py.

python demo.py

Reference

[1] David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, et al. Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815, 2017.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
RLcode		RLcode
gym-tictactoe		gym-tictactoe
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AlphaZero

Reference

About

Uh oh!

Releases

Packages

Languages

lcsdn/RL-alphazero

Folders and files

Latest commit

History

Repository files navigation

AlphaZero

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages