Battleships AI

Introduction

This is a version of the game Battleships in which two players attempt to sink all of their opponents ships first. Here, you play vs an AI, which works through Monte Carlo Simulation. Originally this project was designed to build a neural reinforcement learning agent to play the game, however I have been unsuccessful in this goal so far. If anyone reading this has any ideas of how to go about such a thing, I would love to hear them!

Monte Carlo Simulation

Description

The basics of how it works are as follows:

Take current board state as input
Create copy of board state
Simulate a specific number of samples (given by --monte_carlo_samples), each of which is a random placement of a remaining ship type
Stack all of the simulations and sum the total number of ships in each square (emphasise ships that overlap existing hits)
Take the mean for each sqaure, giving us a frequency matrix or heatmap
Pick the largest value corresponding to a legal move in the matrix
Repeat

Heatmap

There are two heatmap gifs in this project which demonstrate the probability matrix as a heatmap for every square after each move in the game. Here is one of them:

Prerequisites

Packages

Done entirely using the python 3.6 standard library, with the exception of numpy which is required.

Instructions

The main.py file takes 3 arguments as follows:

--board_size: The size of the board, default: 10
--ship_sizes: Array of ship sizes to randomly place, default: 5,4,3,3,2
--monte_carlo_samples: The number of samples to get the algorithm to do, default: 10000

If you have a slow computer, choose a lower number of samples, but generally 10,000 should get good results in decent time. Make sure not to put spaces between the integers in ship_sizes. Extemely large or small board sizes may have unexpected behaviour, generally the safe range is 5-10, but with appropriate adjustments to the other parameters, sizes outwith this range will work fine.

Once run, the game will initialise two boards of ships randomly (choosing ship locations is not implemented currently), one for you and one for the computer. The key for square types is as follows:

Sea: ■
Hit: X
Miss: □
Destroyed: *

The player goes first and specifies a move by giving first a letter and then a number to determine the target square.

License

This project is released under the MIT license, see LICENSE.md for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
heatmap_gifs		heatmap_gifs
LICENSE		LICENSE
README.md		README.md
ai.py		ai.py
board.py		board.py
game_env_interface.py		game_env_interface.py
main.py		main.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

heatmap_gifs

heatmap_gifs

LICENSE

LICENSE

README.md

README.md

ai.py

ai.py

board.py

board.py

game_env_interface.py

game_env_interface.py

main.py

main.py

utils.py

utils.py

Repository files navigation

Battleships AI

Introduction

Monte Carlo Simulation

Description

Heatmap

Prerequisites

Packages

Instructions

License

About

Releases

Packages

Languages

License

mitchelljy/battleships_ai

Folders and files

Latest commit

History

Repository files navigation

Battleships AI

Introduction

Monte Carlo Simulation

Description

Heatmap

Prerequisites

Packages

Instructions

License

About

Resources

License

Stars

Watchers

Forks

Languages