Abalone-AI

Altin Rexhepaj, 2018

Abstract

Abalone-AI is an intelligent game-playing agent for the strategic board game Abalone. I discuss the implementation of the game-playing agent using several artificial intelligence algorithms.

Complexity

Abalone's complexity is comparable to Chess and Go.

Game	Board size	State complexity (log₁₀)	Game tree complexity (log₁₀)	Average plies
Checkers	32	21	31	70
Chess	64	46	123	80
Abalone	61	23	154	87
Go	361	172	360	150

A game tree is a tree whose nodes are positions in a game and whose branches (edges) are moves. The complete game tree for a game is the game tree starting at the initial position and containing all possible moves from each position. The number of leaf nodes in the complete game tree represent the number of possible ways to play the game.

The branching factor is the number of children of each node. In Abalone, the average branching factor is 60.

A ply refers to a half-move: one of the turn of the players. As a result, after 20 moves of an Abalone game, 40 plies have been completed. The average plies for an Abalone game is 87. So, the game tree complexity can be estimated by raising the game’s average branching factor to the power of plies in an average game:

60⁸⁷ = 5.0 * 10¹⁵⁴

Artificial intelligence algorithms

In total, I have implemented six adversarial search algorithms that are commonly used in zero-sum games like Abalone.

1. Minimax
2. Alpha-Beta Pruning
3. Alpha-Beta Pruning with Transposition Table Optimization
4. Principle Variation Search
5. Principle Variation Search with Transposition Table Optimization
6. Monte-Carlo Tree Search

State representation

Unlike many board games, Abalone uses a hexagonal grid which is not feasible to represent with a Cartesian or matrix coordinate system as done with square boards.

Special thanks to https://github.com/umazalakain for her solid implementation of the game logic and API's. Her implementation served as the foundation for the game agent.

The coordinates are read in terms of the diagonal columns (x), and the horizontal rows (z). The columns increase diagonally, while the rows increase vertically. The coordinates of a single marble are expressed as a named tuple with two entries:

Hex(x=−3, z=1)

When moving, a tuple with the same format is added to the coordinate which will shift the coordinate in that direction:

Hex(−3, 1) + Direction (1,0) = Hex(−2,1)
Single marble moving in direction (1,0)

Similarly, groups of marbles (those which are adjacent to each other and can be moved together) are represented the same way, except as a list:

True: [Hex(0,0),Hex(0,1),Hex(0,2)], where True = white piece
False: [Hex(1,0),Hex(1,1),Hex(1,2)], where False = black piece

Finally, for an entire state representation of the board, we add each group of marbles to a hash table:

{
True: [Hex(0,0),Hex(0,1),Hex(0,2)],
False: [Hex(1,0),Hex(1,1),Hex(1,2)]
}

Heuristics and evaluation function

The heuristics used by some of the algorithms are outlined below:

Center proximity (h₁)

Abalone requires very defensive moves. Being in the center of the board is the best place to be because it forces the opponent towards the edges, which is the most vulnerable spot in the board. The distance between each marble to the center of the grid is calculated for both players, then their difference is taken (maximizer favours low h₁).

Cohesion (h₂)

Marbles that are closer together form a population, which is the set of adjacent marbles. The more cohesion means the less populations. It is ideal for a player to have the minimal amount of populations and keep their marbles together. Cohesion is measured by the distance between all marbles (maximizer favours low (h₂).

Marbles on board (h₃)

The last heuristic is given by the difference in the amount of marbles on the board. This is an offensive heuristic rather than a defensive heuristic like the previous ones. It is given by the difference of marbles between the two players (maximizer favours high h₃).

The evaluation function, then, is the sum of the scores given by all the heuristics: eval = h₁ + h₂ + h₃

I discovered through trial and error that h₂ should be used when the marbles are far from the center: |h₁| > 2 and h₃ should be used when the marbles are near the center: |h₁| < 1.8 , and when h₃ is used, h₃ is scaled by a factor of 100 so that attacking moves are favoured.

Optimizations

Move ordering and transposition tables are extremely important so that better paths are searched first.

Three in a row

The amount of ways to have three blocks in a row is also ideal. This heuristic will be used for move ordering.

Push moves

States that lead to potentially more pushes will shorten the amount of moves it takes to win the game. So this heuristic is good for move ordering.

Transposition table

The transposition table I used is a hash map which stores a hashed version of the state as its keys. Since the transposition table will do many lookups, it will have to check if a state is already in the table. In order to compare states, a hashing function is needed. I used Zobrist hashing, a common hashing function for board games like Chess. It serializes the state into a unique 64bit signed integer.

Results

Overall, the Principle Variation Search with the Transposition Table Optimization performed best, while Minimax performed worst. Monte-Carlo Tree Search was not testable on my machine.

Running the code

Prerequisites

Python 3.8*

Generating a state file

Open dump.py and fill out the game state template and run the program using the Python interpreter. The program will output a JSON file which can be loaded in the main program and used to either generate the next best move, or simulate a game starting from that state.

Configuring the game agent

Open abalone/config.py and set the initial starting board by filling in values of your choosing (default is mini for a miniaturized version of the game for demo purposes)
The grid size can be changed using GRID_RADIUS
The number of marbles required to win can be changed using GAME_OVER
The maximum number of attacking marbles in a row can be changed using GROUP_LENGTHS (default is 3)
The states of the players can be changed as well using BLACK and WHITE (default: BLACK = False, WHITE = True)

Running the game agent

Install dependencies: python setup.py develop
Run: start.py and follow the command-line interface to either load a JSON state (see Generating a state file) or simulate a game between two AI computers in a game of Abalone

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
abalone		abalone
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dump.py		dump.py
example.PNG		example.PNG
example2.PNG		example2.PNG
setup.py		setup.py
start.py		start.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abalone-AI

Abstract

Complexity

Artificial intelligence algorithms

State representation

Heuristics and evaluation function

Center proximity (h₁)

Cohesion (h₂)

Marbles on board (h₃)

Optimizations

Three in a row

Push moves

Transposition table

Results

Running the code

Prerequisites

Generating a state file

Configuring the game agent

Running the game agent

Screenshot

About

Releases

Packages

Languages

License

altin/abalone-ai

Folders and files

Latest commit

History

Repository files navigation

Abalone-AI

Abstract

Complexity

Artificial intelligence algorithms

State representation

Heuristics and evaluation function

Center proximity (h1)

Cohesion (h2)

Marbles on board (h3)

Optimizations

Three in a row

Push moves

Transposition table

Results

Running the code

Prerequisites

Generating a state file

Configuring the game agent

Running the game agent

Screenshot

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Center proximity (h₁)

Cohesion (h₂)

Marbles on board (h₃)

Packages