AlphaGOZero (python tensorflow implementation)

This is a trial implementation of DeepMind's Oct19th publication: Mastering the Game of Go without Human Knowledge.

DeepMind release AlphaZero Teaching Go. It's a lot of fun!

From Paper

Pure RL has outperformed supervised learning+RL agent

Nov20 SL evaluation

Download trained model

https://drive.google.com/drive/folders/1Xs8Ly3wjMmXjH2agrz25Zv2e5-yqQKaP?usp=sharing
Place under ./savedmodels/large20/

Set up

Install requirement

python 3.6 tensorflow/tensorflow-gpu

pip install -r requirement.txt

Download Dataset (kgs 4dan)

Under repo's root dir

cd data/download
chmod +x download.sh
./download.sh

Preprocess Data

It is only an example, feel free to assign your local dataset directory

python preprocess.py preprocess ./data/SGFs/kgs-*

Train A Model

python main.py --mode=train

Play Against An A.I. (currently only random A.I. is available)

python main.py --mode=gtp —-policy=randompolicy --model_path='./savedmodels/model--0.0.ckpt'

Play in Sabaki

In console:

which python

add result to the headline of main.py with #! prefix.

Add the path of main.py to Sabaki's manage Engine with argument --mode=gtp

TODO:

Credit (orderless):

*Brain Lee *Ritchie Ng *Samuel Graván *森下健 *yuanfengpang

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
data		data
elo		elo
figure		figure
model		model
processed_data		processed_data
savedmodels		savedmodels
support		support
utils		utils
AlphaGo_Zero_mynotes.pdf		AlphaGo_Zero_mynotes.pdf
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Network.py		Network.py
README.md		README.md
__init__.py		__init__.py
auto_restart.sh		auto_restart.sh
config.py		config.py
main.py		main.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt

License

chengstone/AlphaGOZero-python-tensorflow

Folders and files

Latest commit

History

Repository files navigation

AlphaGOZero (python tensorflow implementation)

From Paper

Nov20 SL evaluation

Download trained model

Set up

Install requirement

Download Dataset (kgs 4dan)

Preprocess Data

Train A Model

Play Against An A.I. (currently only random A.I. is available)

Play in Sabaki

TODO:

Credit (orderless):

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages