codingame-othello

CodingameのOthello(オセロ)タスクのAI

codingame提出用ファイル作成

学習済みsavedmodelが model/debug_ch8_d7/sm_26 にある想定。モデルの推論コードを、TVMを用いて重みを含め1つのmodel.aファイルにまとめ、それと探索部をリンクする。

ln -sf $(pwd)/model/debug_ch8_d7/sm_26 model/savedmodel
make

build/codingame.pyが成果物。

合法手チェック

データ生成

正しく動作するコミットにおいて

./build/make_legal_move_test_data > dataset/legal_move_dataset.txt

チェック

検証したいコミットにおいて

./build/legal_move_test < dataset/legal_move_dataset.txt

教師あり学習

教師データ生成

アルファベータ法＋石の数で評価する(学習不要の)簡易AIで生成

mkdir -p dataset/alphabeta_train_1
./build/generate_training_data_1 dataset/alphabeta_train_1/raw_game_train.bin 10000
python -m othello_train.shuffle_train_data dataset/alphabeta_train_1/raw_game_train.bin dataset/alphabeta_train_1/train_shuffled.bin

学習

mkdir -p model
python -m othello_train.supervised_train_v1

自己対局

./build/random_match

対戦相手は main_random_match.cpp 内にハードコードされている

DNNモデルを使うエンジンの場合は、評価サーバを立てておく必要がある

python -m othello_train.eval_server_v1 model/alphabeta_supervised_model_v1

強化学習

最新のモデルで棋譜生成→それを用いてモデルを更新というループを回す

python othello_train/rl_loop.py model/debug

自己対局

評価サーバを立てる

python -m othello_train.eval_server_v1 model/debug/sm_9

sm_の後ろの番号はエポック数。大きいほうが学習が進んでいる。

別のシェルで対局を実行

./build/random_match

対戦相手は main_random_match.cpp 内にハードコードされている

本番対局用

make

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.vscode		.vscode
extern		extern
feature_check		feature_check
othello_train		othello_train
scripts		scripts
src		src
training		training
tvm		tvm
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

codingame-othello

codingame提出用ファイル作成

合法手チェック

データ生成

チェック

教師あり学習

教師データ生成

学習

自己対局

強化学習

自己対局

本番対局用

About

Releases 9

Packages

Contributors 2

Languages

License

select766/codingame-othello

Folders and files

Latest commit

History

Repository files navigation

codingame-othello

codingame提出用ファイル作成

合法手チェック

データ生成

チェック

教師あり学習

教師データ生成

学習

自己対局

強化学習

自己対局

本番対局用

About

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 2

Languages

Packages