AlphaGOZero (python tensorflow implementation)

This is a trial implementation of DeepMind's Oct19th publication: Mastering the Game of Go without Human Knowledge.

DeepMind release AlphaZero Teaching Go. It's a lot of fun!

From Paper

Pure RL has outperformed supervised learning+RL agent

SL evaluation

Download trained model

https://drive.google.com/drive/folders/1Xs8Ly3wjMmXjH2agrz25Zv2e5-yqQKaP?usp=sharing
Place under ./savedmodels/large20/

Set up

Install requirement

python 3.6 tensorflow/tensorflow-gpu (version 1.4, version >= 1.5 can't load trained models)

pip install -r requirement.txt

Download Dataset (kgs 4dan)

Under repo's root dir

cd data/download
chmod +x download.sh
./download.sh

Preprocess Data

It is only an example, feel free to assign your local dataset directory

python preprocess.py preprocess ./data/SGFs/kgs-*

Train A Model

python main.py --mode=train

Play Against An A.I.

python main.py --mode=gtp —-gtp_poliy=greedypolicy --model_path='./savedmodels/your_model.ckpt'

Play in Sabaki

In console:

which python

add result to the headline of main.py with #! prefix.

Add the path of main.py to Sabaki's manage Engine with argument --mode=gtp

TODO:

Credit (orderless):

*Brain Lee *Ritchie Ng *Samuel Graván *森下健 *yuanfengpang

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
data		data
elo		elo
figure		figure
model		model
processed_data		processed_data
savedmodels		savedmodels
support		support
utils		utils
AlphaGo_Zero_mynotes.pdf		AlphaGo_Zero_mynotes.pdf
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Network.py		Network.py
README.md		README.md
__init__.py		__init__.py
auto_restart.sh		auto_restart.sh
config.py		config.py
main.py		main.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt

License

yhyu13/AlphaGOZero-python-tensorflow

Folders and files

Latest commit

History

Repository files navigation

AlphaGOZero (python tensorflow implementation)

From Paper

SL evaluation

Download trained model

Set up

Install requirement

Download Dataset (kgs 4dan)

Preprocess Data

Train A Model

Play Against An A.I.

Play in Sabaki

TODO:

Credit (orderless):

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages