Pacman-AI-agent-for-stochastic-environment

A Markov Decision Process (MDP) based implementation of a Pacman agent, to survive and battle through a handicapped stochastic environment

Handicap on AI

If AI chooses to move to a particular direction:

It moves to the direction intended 80% of the time
However, in 20% of the time, it may move to a direction which is perpendicular to the original intended direction.

For instance, if PACMAN chooses to move UP, there is a 80% chance it will do so, however, a 10% chance it may move left, and 10% to the right (not originally intended).

Value Iteration Algorithm

The Value Iteration algorithm is used in each time step of the game to calculate the max expected utility of all possible action.It uses the bellman equation to calculate utility of each possible action from a given state, by taking into account non‐determinism of the game and outputs the actions which yields maximum expected utility along with its value. Once this is computed, the action with the max value is chosen.

Evaluation of the AI Agent

Discount factor

Discount factor represents to what degree of importance the AI places on the future for each decision it makes

Reward values

Blank states represents blank cells within the board (have neither of the following: capsule, food, ghost, wall, ghost). Assigning a very small negative reward to such a state may incentivize the Pacman agent to pursue its goal quicker and discourage it from lingering around blank cells which do not have any reward.

The chart below demonstrates different reward values of blank states,tested against win percentages.

Number of iterations in the Algorithm

It can be noticed that iterations as small as 100 is good enough for small grid, however for larger layouts such as ‘mediumClassic’, it can be noticed that there is a lot to gain from a higher number of iterations, as increasing iterations from 1000 to 2000 results in dramatic increase of about 15% win.

Instructions for running the project:

python pacman.py -q -n 1 -p MDPAgent -l <grid name>

# example
python pacman.py -q -n 1 -p MDPAgent -l mediumClassic

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.idea		.idea
.vs		.vs
layouts		layouts
test_cases		test_cases
BlankState_Reward_Evaluation_Chart.jpg		BlankState_Reward_Evaluation_Chart.jpg
Discount_Factor_Evaluation_Chart.jpg		Discount_Factor_Evaluation_Chart.jpg
Iteration_Evaluation_Chart.jpg		Iteration_Evaluation_Chart.jpg
README.md		README.md
VERSION		VERSION
VERSIONS		VERSIONS
api.py		api.py
api.pyc		api.pyc
commands.txt		commands.txt
eightpuzzle.py		eightpuzzle.py
game.py		game.py
game.pyc		game.pyc
ghostAgents.py		ghostAgents.py
ghostAgents.pyc		ghostAgents.pyc
graphicsDisplay.py		graphicsDisplay.py
graphicsDisplay.pyc		graphicsDisplay.pyc
graphicsUtils.py		graphicsUtils.py
graphicsUtils.pyc		graphicsUtils.pyc
keyboardAgents.py		keyboardAgents.py
keyboardAgents.pyc		keyboardAgents.pyc
layout.py		layout.py
layout.pyc		layout.pyc
mdpAgents.py		mdpAgents.py
mdpAgents.pyc		mdpAgents.pyc
pacman.py		pacman.py
pacman.pyc		pacman.pyc
pacmanAgents.py		pacmanAgents.py
projectParams.py		projectParams.py
sampleAgents.py		sampleAgents.py
searchTestClasses.py		searchTestClasses.py
textDisplay.py		textDisplay.py
textDisplay.pyc		textDisplay.pyc
util.py		util.py
util.pyc		util.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pacman-AI-agent-for-stochastic-environment

Handicap on AI

Value Iteration Algorithm

Evaluation of the AI Agent

Discount factor

Reward values

Number of iterations in the Algorithm

Instructions for running the project:

About

Releases

Packages

Languages

Naharul98/Pacman-AI-agent-for-stochastic-environment

Folders and files

Latest commit

History

Repository files navigation

Pacman-AI-agent-for-stochastic-environment

Handicap on AI

Value Iteration Algorithm

Evaluation of the AI Agent

Discount factor

Reward values

Number of iterations in the Algorithm

Instructions for running the project:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages