grid-X

Grid game using the Microchip PIC18F87K22 microprocessor.

Demonstration of Q-learning using the microprocessor on three different levels

One can observe that the algorithm converges to a steady state after a few iterations - the agent repeatedly takes the same path. These gifs were converted from screen recordings on the oscilloscope.

Level 1	Level 2	Level 3

Initialisation instructions

$ git clone https://github.com/ricktjwong/grid-X.git

MPLAB X IDE is recommended for viewing and building the project.

Linux
$ wget https://www.microchip.com/mplabx-ide-linux-installer

Windows
$ wget https://www.microchip.com/mplabx-ide-windows-installer

Mac
$ wget https://www.microchip.com/mplabx-ide-osx-installer

Repository branches

There are three branches - master, helper-modules_python and q-learning_python.

master contains the main assembly code
q-learning_python handles the Grid-X implementation in python
helper-modules_python has the Python scripts to handle map generation and png to hexadecimal voltage convertion

Top-level directory layout

The folders represent logical folders, which will only be organised in MPLAB's IDE.

.
└── main.asm                   # Main program which handles setup and the display of screens depending on game state
└── constants.inc              # Contains the constant values such as item reward, movement penalty etc.
└── Graphics                   # Contains the graphics files for output onto an oscilloscope in x-y mode
    ├── digits                 # Graphics files for displaying digits 0-9 for game scores
    ├── grid_sprites           # Graphics the grid sprites (wall, player, item, fire, goal)
    ├── splash_screens         # Graphics for start screen and end screen
    ├── graphics.asm           # Logic to handle the checks and rendering of graphics
    ├── score_display.asm      # Decomposition of a two's complement number to the digits for display
└── Keypad
    ├── actions.asm            # Handles checks of map element interaction with player movement, update scores accordingly
    ├── keypad.asm             # Handles key being pressed and lifted and do checks
    ├── keypad_editor.asm      # Handles keypad checks for the map builder mode which has different controls
    ├── keypad_input.asm       # Abstracted subroutine which handles recording of keypad bytes
└── Qlearning
    ├── agent.asm              # Reinforcement learning agent, handles the updating of Q-table based on Q-learning algorithm
    ├── findmax.asm            # Subroutine to find maximum number in a list, returns number and its index in list
    ├── q_table.asm            # Initialises a 49x4 table of Q-values which is used by the agent to make decisions
    ├── q_learning_mode.asm    # Checks the various game states to activate Q-learning mode for different levels
└── Tables                     # Initialises tables for different levels and the mapmatrix for 7x7 or 9x9 map size
└── Utils
    ├── delay.asm              # Subroutines to introduce small or large delays
    └── interrupt.asm          # Initialises interrupts for graphics rendering and keypad checks

Game play

Grid-X has three different levels. The aim of the game is to move the player from the start position to the goal, accumulating the maximum number of points possible.

Level 1
Score: ___

  1 2 3 4 5 6 7
1 W W W W W W W
2 W - I - - G W
3 W - - - - W W
4 W W W W - - W
5 W - I - W - W
6 W X - - - - W
7 W W W W W W W

Legend:
W - wall
I - item
G - goal
X - character
F - fire

Goal:
Move the character X from the start point to the goal (G), accumulating as many points as possible.

Rules:

Each movement incurs a penalty of 3 points
Each item (I) collected will gain you 9 points
Walking into the fire (F) will lose you 10 points

Features

Grid-X has a normal game play mode, a map builder mode, and a Q-learning mode where the agent uses Q-learning to find the optimal path to reach the goal.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
gifs		gifs
nbproject		nbproject
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
actions.asm		actions.asm
agent.asm		agent.asm
config.asm		config.asm
constants.inc		constants.inc
delay.asm		delay.asm
eight.asm		eight.asm
endscreen.asm		endscreen.asm
findmax.asm		findmax.asm
fire.asm		fire.asm
five.asm		five.asm
four.asm		four.asm
goal.asm		goal.asm
graphics.asm		graphics.asm
grids.asm		grids.asm
interrupt.asm		interrupt.asm
item.asm		item.asm
keypad.asm		keypad.asm
keypad_editor.asm		keypad_editor.asm
keypad_input.asm		keypad_input.asm
level1.asm		level1.asm
level2.asm		level2.asm
level3.asm		level3.asm
level4.asm		level4.asm
level_empty7.asm		level_empty7.asm
main.asm		main.asm
mapmatrix7x7.asm		mapmatrix7x7.asm
mapmatrix9x9.asm		mapmatrix9x9.asm
negative.asm		negative.asm
nine.asm		nine.asm
one.asm		one.asm
player.asm		player.asm
q_learning_mode.asm		q_learning_mode.asm
q_table.asm		q_table.asm
score.asm		score.asm
score_display.asm		score_display.asm
seven.asm		seven.asm
six.asm		six.asm
startscreen.asm		startscreen.asm
three.asm		three.asm
two.asm		two.asm
wall.asm		wall.asm
zero.asm		zero.asm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

grid-X

Demonstration of Q-learning using the microprocessor on three different levels

Initialisation instructions

Repository branches

Top-level directory layout

Game play

Features

About

Releases

Packages

Contributors 2

Languages

ricktjwong/grid-X

Folders and files

Latest commit

History

Repository files navigation

grid-X

Demonstration of Q-learning using the microprocessor on three different levels

Initialisation instructions

Repository branches

Top-level directory layout

Game play

Features

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages