GitHub - JordanMSchall/Project3: This is A.I. Berkeley Project 3

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.idea		.idea
layouts		layouts
test_cases		test_cases
.gitignore		.gitignore
README.txt		README.txt
VERSION		VERSION
analysis.py		analysis.py
autograder.py		autograder.py
crawler.py		crawler.py
environment.py		environment.py
featureExtractors.py		featureExtractors.py
game.py		game.py
ghostAgents.py		ghostAgents.py
grading.py		grading.py
graphicsCrawlerDisplay.py		graphicsCrawlerDisplay.py
graphicsDisplay.py		graphicsDisplay.py
graphicsGridworldDisplay.py		graphicsGridworldDisplay.py
graphicsUtils.py		graphicsUtils.py
gridworld.py		gridworld.py
keyboardAgents.py		keyboardAgents.py
layout.py		layout.py
learningAgents.py		learningAgents.py
mdp.py		mdp.py
pacman.py		pacman.py
pacmanAgents.py		pacmanAgents.py
projectParams.py		projectParams.py
qlearningAgents.py		qlearningAgents.py
reinforcementTestClasses.py		reinforcementTestClasses.py
submission_autograder.py		submission_autograder.py
testClasses.py		testClasses.py
testParser.py		testParser.py
textDisplay.py		textDisplay.py
textGridworldDisplay.py		textGridworldDisplay.py
util.py		util.py
valueIterationAgents.py		valueIterationAgents.py

Repository files navigation

Quick Commands
python autograder.py -q q2
python autograder.py -t test_cases/q2/1-bridge-grid

Edit Files
valueIterationAgents.py	= A value iteration agent for solving known MDPs.
qlearningAgents.py = Q-learning agents for Gridworld, Crawler and Pacman.
analysis.py	= A file to put your answers to questions given in the project.


Getting Familiar
python gridworld.py -m
python gridworld.py -h
--------------------------------------------------------
  -h, --help            show this help message and exit
  -d DISCOUNT, --discount=DISCOUNT
                        Discount on future (default 0.9)
  -r R, --livingReward=R
                        Reward for living for a time step (default 0.0)
  -n P, --noise=P       How often action results in unintended direction
                        (default 0.2)
  -e E, --epsilon=E     Chance of taking a random action in q-learning
                        (default 0.3)
  -l P, --learningRate=P
                        TD learning rate (default 0.5)
  -i K, --iterations=K  Number of rounds of value iteration (default 10)
  -k K, --episodes=K    Number of epsiodes of the MDP to run (default 1)
  -g G, --grid=G        Grid to use (case sensitive; options are BookGrid,
                        BridgeGrid, CliffGrid, MazeGrid, default BookGrid)
  -w X, --windowSize=X  Request a window width of X pixels *per grid cell*
                        (default 150)
  -a A, --agent=A       Agent type (options are 'random', 'value' and 'q',
                        default random)
  -t, --text            Use text-only ASCII display
  -p, --pause           Pause GUI after each time step when running the MDP
  -q, --quiet           Skip display of any learning episodes
  -s S, --speed=S       Speed of animation, S > 1.0 is faster, 0.0 < S < 1.0
                        is slower (default 1.0)
  -m, --manual          Manually control agent
  -v, --valueSteps      Display each step of value iteration


--------------------------------------------------------


Reference Files
mdp.py	Defines methods on general MDPs.
learningAgents.py	Defines the base classes ValueEstimationAgent and QLearningAgent, which your agents will extend.
util.py	Utilities, including util.Counter, which is particularly useful for Q-learners.
gridworld.py	The Gridworld implementation.
featureExtractors.py	Classes for extracting features on (state,action) pairs. Used for the approximate Q-learning agent (in qlearningAgents.py).

Q4 Notes
update, computeValueFromQValues, getQValue, and computeActionFromQValues methods.
python autograder.py -q q4