mcts-reversi

Reversi (othello) AI experiments in C++

Implemented algorithms:

Minimax / alpha-beta pruning
UCB1 (bandit algorithm)
UCT (i.e. monte-carlo tree search)

Tournament

Inspired by tom7's chess algorithm adventures (and as a sanity check on my implementations), I run the strategies against each other in a head-to-head tournament to evaluate them.

Setup

Each strategy plays 100 games against every other strategy as black, and again as white. Win rates for black are plotted below.

Players

Random: chooses a random valid move.
Greedy: chooses the move which will convert the most pieces.
Generous: chooses the move which will convert the least pieces.
Uniform sampling (n): for each valid move, plays n random games and chooses the move resulting in the most wins.
UCB1 (n): plays a total of n times number of valid moves games, but distributes the games over the valid starting moves using the UCB1 bandit algorithm. (i.e. each valid move is an arm on a multi-armed bandit).
UCT (n): implements the upper confidence bound for trees (UCT) algorithm. Simulates a total n times number of valid moves games.
MiniMax (d): Deterministic tree search using the Minimax algorithm with alpha-beta pruning. Evaluates the game tree to depth d at each step. Leaves are valued counting the number of pieces on the board.

Results

We see a logical progression in winrates with the complexity of the stochastic methods, from uniform sampling to UCB1 to UCT.

Strangely, the worst strategy (generous) managed one win in 100 games against the best (UCT 100).

Note: MiniMax, Greedy, and Generous are fully deterministic, so results between them reflect only one played game.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
tournament_data		tournament_data
Makefile		Makefile
README.md		README.md
basic.h		basic.h
board.h		board.h
main.cpp		main.cpp
minimax.h		minimax.h
plot_tournament.py		plot_tournament.py
test.cpp		test.cpp
tournament.py		tournament.py
ucb.h		ucb.h
uct.h		uct.h
util.h		util.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mcts-reversi

Tournament

Setup

Players

Results

About

Releases

Packages

Languages

psaikko/mcts-reversi

Folders and files

Latest commit

History

Repository files navigation

mcts-reversi

Tournament

Setup

Players

Results

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages