Skip to content
Game to demonstrate reinforcement learning
Python
Branch: master
Clone or download
Latest commit c27d239 Jul 18, 2019

README.md

chomp

Chomp is a pencil-and-paper territorial game for two players, conventionally played on a 3x4 grid representing a delicious chocolate bar. Players take it in turns to bite off rectangular regions from the lower right corner, aiming for the other player to eat the last, poisoned square.

GIF of the game of chomp

With 3 rows and 4 columns, the game has a total of 34 states, which makes it a candidate for a simple machine-learning demonstration, in the same vein as the MENACE machine.

Clone the repository and run chomp.py to run a simulation of chomp.

The 50GamesHuman.pkl file contains the state of the machine after 50 games against a human, starting with 2 beads in each box. The 2000Games#.pkl files contain the machine state after facing a random opponent, or another 'intelligent' opponent using the same strategy. perfect.pkl is almost certain to win when playing first.

State transition probability diagram This image shows the relative probability of choosing a square on a colour scale, after 10000 games of self-play. White squares cannot be played (they are already eaten, or they are the poisoned top-left square). State 34 (lower right corner of the image) is the starting state of the game. The program has determined that playing in square number 6 (3rd column, 2nd row) is the best first move.

The physical version of Chomp, which uses plastic containers filled with coloured beads to represent game states, was premiered at a Bringing Research to Life roadshow event at a school in south-west england.

Chomp at the Bringing Research to Life Roadshow

Including a short training period for three demonstrators to learn how to host the game, 98 games were played, in sets of seven. The results were marked on the chart below, with human wins marked from the top of the diagram and chomp wins from the bottom. The plot shows that over time chomp learns how to play and win.

Chomp's win record

Chomp was reset then taken to Cheltenham Science Festival for two days. We played 143 games over this time and saw Chomp learning somewhat more erratically this time, representative of learning non-optimal strategies from its opponents, which it had to forget in order to progress. Chomp at Cheltenham Science Festival James and Ed teach the machine Pepper takes a look at Chomp

Chomp's next outing was at the Southampton Science and Engineering Day. Over the day, we played 86 games, learning steadily from a win-rate of 1/7 to 6/7 over the day.

Chomp at Southampton Science and Engineering Day

We were invited to Winchester Science Centre and had a great time. This time we asked everyone who played to give us a one word review - we've compiled them into a Wordle.

Wordle of Chomp reviews after Winchester Science Centre Visit

It's safe to say that Chomp is a winner!

Chomp on the top step of the podium at Winchester Science Centre

You can’t perform that action at this time.